14 January 2025

Deep generative modeling of annotated bacterial biofilm images

Biofilms are critical for understanding environmental processes, developing biotechnology applications, and progressing in medical treatments of various infections. Nowadays, a key limiting factor for biofilm analysis is the difficulty in obtaining large datasets with fully annotated images. This study introduces a versatile approach for creating synthetic datasets of annotated biofilm images with employing deep generative modeling techniques, including VAEs, GANs, diffusion models, and CycleGAN. Synthetic datasets can significantly improve the training of computer vision models for automated biofilm analysis, as demonstrated with the application of Mask R-CNN detection model. The approach represents a key advance in the field of biofilm research, offering a scalable solution for generating high-quality training data and working with different strains of microorganisms at different stages of formation. Terabyte-scale datasets can be easily generated on personal computers. A web application is provided for the on-demand generation of biofilm images.

Reference: npj Biofilms Microbiomes, 2024, 11, 16.

DOI: 10.1038/s41522-025-00647-4

>