Please use this identifier to cite or link to this item: http://hdl.handle.net/2080/5598
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSharma, Deepanjali-
dc.contributor.authorDasgupta, Anirban-
dc.contributor.authorSengupta, Anwesha-
dc.contributor.authorBhattacharya, Shubhobrata-
dc.date.accessioned2026-01-15T06:32:03Z-
dc.date.available2026-01-15T06:32:03Z-
dc.date.issued2025-12-
dc.identifier.citation10th International Conference on Computer Vision and Image Processing (CVIP), IIT Ropar, Punjab, 10-13 December 2025en_US
dc.identifier.urihttp://hdl.handle.net/2080/5598-
dc.descriptionCopyright belongs to the proceeding publisher.en_US
dc.description.abstractTraining robust underwater gesture recognition systems is limited by the scarcity and cost of annotated underwater data. Attempts to generate synthetic data by blending overwater images to introduce turbidity, color distortion, and illumination variability often fail to produce realistic images. This paper proposes AquaGestureSynth, a two-stage generative blending framework based on a Gaussian-Poisson Generative Adversarial Network (GP-GAN), to synthesize realistic underwater gesture images. The proposed method blends segmented overwater hand gestures with underwater backgrounds, introducing turbidity, light attenuation, and chromatic shifts while preserving gesture semantics. The first stage employs a Blending GAN for coarse realism, followed by the application of Gaussian-Poisson gradient-domain refinement within a Laplacian pyramid in the second stage. Further, a loss that combines style, content, histogram, and total variation losses to enhance texture consistency and visual realism has also been introduced. AquaGestureSynth achieves superior results across multiple metrics, with Structural Similarity Index of 0.91, Fr´echet Inception Distance (FID) of 24.8, and Learned Perceptual Image Patch Similarity (LPIPS) of 0.101, thereby outperforming Cycle-GAN and Poisson blending. Downstream gesture classification models trained on our synthetic data achieve up to 96% F1-score, demonstrating the model’s utility in augmenting underwater gesture datasets for human-robot interaction.en_US
dc.subjectUnderwater Gesture Recognitionen_US
dc.subjectGenerative Adversarial Networken_US
dc.subjectGPGANen_US
dc.subjectData Augmentationen_US
dc.subjectLaplacian Pyramiden_US
dc.titleAquaGestureSynth: Generating Realistic Underwater Hand Gesture Images with Gaussian-Poisson GANen_US
dc.typeArticleen_US
Appears in Collections:Conference Papers

Files in This Item:
File Description SizeFormat 
2025_CVIP_ASengupta_Aqua.pdf3.54 MBAdobe PDFView/Open    Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.