← Back to Research Radar
Scientific Literature Scientific Literature

Improving rare-class detection in deep-sea imagery via generative augmentation with stable diffusion

Junlan Deng, Mi Duan, Dingbang Wei, Wei Song, Xuebao He, Jianxin Xia
April 3, 2026
Published Date

Research Abstract & Technology Focus

Megabenthos play a critical role in maintaining deep-sea ecosystem stability, making accurate detection important for deep-sea conservation. However, the high cost of deep-sea exploration and the long-tailed distribution of available datasets lead to severe data scarcity for rare species, limiting deep-sea benthos detection. To address this challenge, we propose a data augmentation framework based on Stable Diffusion (SD) and ControlNet. Specifically, we fine-tune a pretrained SD model using Low-Rank Adaptation (LoRA) to synthesize images of rare benthos, and leverage ControlNet to composite the generated targets into deep-sea backgrounds with controllable layouts and automatic bounding-box annotation. We constructed two megabenthos datasets collected using an optically tethered underwater vehicle (OTV) and an autonomous underwater vehicle (AUV), covering 16 biological categories; data augmentation was applied to 7 rare species with the fewest samples. The generated images achieved a Fréchet Inception Distance (FID) of 117.11 and an Inception Score (IS) of 4.97. When combined with real data for RT-DETR training, the augmentation strategy increased the AP50-95 and AP50 on the OTV dataset to 45.2% and 75.2%, representing improvements of 3.7% and 6.1% over the baseline. Similarly, on the AUV dataset, it increased the AP50-95 and AP50 to 36.8% and 64.7%, yielding enhancements of 2.2% and 4.2% over the baseline. Gains were especially pronounced for tail classes, with AP50-95 increased by 23.6% and 21.9% for Octopus and Bryozoa on the OTV dataset, and by 15.1% and 14.6% for Bryozoa and Hydrozoa on the AUV dataset. Moreover, the proposed approach outperforms traditional augmented methods by 1.6% and 0.8% in AP50-95 on the OTV and AUV datasets, respectively, indicating its utility for improving detection in deep-sea megabenthic surveys.
Read Full Literature

Correlated Market Trend: Artificial Intelligence

Bridging academia to market: The 60-day public search velocity mapping directly to the core technology of this paper. Dashed line represents 7-day moving average.

AI Semantic Synergy Context

Connecting this academic literature to real-world market discussions and products.

openalex.org › research concept
48%
🔥

Improving rare-class detection in deep-sea imagery via generative augmentation with stable diffusion

Megabenthos play a critical role in maintaining deep-sea ecosystem stability, making accurate detection important for deep-sea conservation. However, the high cost of deep-sea exploration and the l...

crossref.org › academic paper
0%

AI models collapse when trained on recursively generated data

Abstract Stable diffusion revolutionized image creation from descriptive text. GPT-2 (ref. 1), GPT-3(.5) (ref. 2) and GPT-4 (ref. 3) demonstrated high performance across a variety of lang...

openalex.org › research concept
0%

Adaptive energy-efficient and secure clustering-based routing architecture for underwater wireless sensor networks in marine environmental and ecosystem monitoring

Introduction Reliable long-term monitoring of coral reefs and other marine ecosystems is limited by the harsh underwater environment, restricted battery capacity of sensor nodes, and the high energ...

openalex.org › research concept
0%

Physical prior-guided SAM adaptation for underwater scene segmentation

Underwater image segmentation is fundamental to marine exploration and autonomous underwater vehicle navigation, yet its accuracy is severely compromised by wavelength-selective absorption and scat...

openalex.org › research concept
0%

Exploiting Phase Memory in Multicarrier Waveforms for Robust Underwater Acoustic Communication

Reliable underwater acoustic (UWA) communication is fundamental to marine sensing applications, including environmental monitoring, underwater sensor networks, and autonomous platforms, yet remains...

Frequently Asked Questions (FAQ)

Curated market intelligence mapped to this research.

What is the core focus of the research titled 'Improving rare-class detection in deep-sea imagery via generative augmentation with stable diffusion'?

This literature focuses on: Megabenthos play a critical role in maintaining deep-sea ecosystem stability, making accurate detection important for deep-sea conservation. However, the high cost of deep-sea exploration and the long-tailed distribution of available datasets lead...

What other academic literature is closely related to 'Improving rare-class detection in deep-sea imagery via generative augmentation with stable diffusion'?

Yes, highly correlated activity was mapped. An entry titled 'Improving rare-class detection in deep-sea imagery via generative augmentation with stable diffusion' discusses this: Megabenthos play a critical role in maintaining deep-sea ecosystem stability, making accurate detection important for deep-sea conservation. Howeve...

Cite this Market Intelligence Report

Reference our AI-mapped synergy between this research and the commercial market to instantly build authority.