new

Get trending papers in your email inbox!

Subscribe

Daily Papers

byAK and the research community

Apr 13

CoralVQA: A Large-Scale Visual Question Answering Dataset for Coral Reef Image Understanding

Coral reefs are vital yet vulnerable ecosystems that require continuous monitoring to support conservation. While coral reef images provide essential information in coral monitoring, interpreting such images remains challenging due to the need for domain expertise. Visual Question Answering (VQA), powered by Large Vision-Language Models (LVLMs), has great potential in user-friendly interaction with coral reef images. However, applying VQA to coral imagery demands a dedicated dataset that addresses two key challenges: domain-specific annotations and multidimensional questions. In this work, we introduce CoralVQA, the first large-scale VQA dataset for coral reef analysis. It contains 12,805 real-world coral images from 67 coral genera collected from 3 oceans, along with 277,653 question-answer pairs that comprehensively assess ecological and health-related conditions. To construct this dataset, we develop a semi-automatic data construction pipeline in collaboration with marine biologists to ensure both scalability and professional-grade data quality. CoralVQA presents novel challenges and provides a comprehensive benchmark for studying vision-language reasoning in the context of coral reef images. By evaluating several state-of-the-art LVLMs, we reveal key limitations and opportunities. These insights form a foundation for future LVLM development, with a particular emphasis on supporting coral conservation efforts.

  • 5 authors
·
Jul 14, 2025

The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs

Coral reefs are declining worldwide due to climate change and local stressors. To inform effective conservation or restoration, monitoring at the highest possible spatial and temporal resolution is necessary. Conventional coral reef surveying methods are limited in scalability due to their reliance on expert labor time, motivating the use of computer vision tools to automate the identification and abundance estimation of live corals from images. However, the design and evaluation of such tools has been impeded by the lack of large high quality datasets. We release the Coralscapes dataset, the first general-purpose dense semantic segmentation dataset for coral reefs, covering 2075 images, 39 benthic classes, and 174k segmentation masks annotated by experts. Coralscapes has a similar scope and the same structure as the widely used Cityscapes dataset for urban scene segmentation, allowing benchmarking of semantic segmentation models in a new challenging domain which requires expert knowledge to annotate. We benchmark a wide range of semantic segmentation models, and find that transfer learning from Coralscapes to existing smaller datasets consistently leads to state-of-the-art performance. Coralscapes will catalyze research on efficient, scalable, and standardized coral reef surveying methods based on computer vision, and holds the potential to streamline the development of underwater ecological robotics.

  • 6 authors
·
Mar 25, 2025

A Kiloparsec-Scale Stellar Cavity in the Center of Abell402-BCG May be Caused by Dynamic Interactions with an Ultramassive Black Hole

We present new observations from JWST NIRCam that reveal a striking kpc-wide cavity in the stellar distribution of the central galaxy in the cluster Abell402. Supporting data from HST allow us to rule out extinction due to dust as an explanation and, instead, suggest that this is a localized depression in the stellar density field corresponding to ~2x10^9 Msun in missing stars within a volume of 0.5kpc^3. On larger scales, both the JWST and HST data show evidence for a 2.2kpc flattened core in the stellar distribution (on which the smaller-scale cavity is superimposed), which implies the presence of a central ultra-massive black hole with M_BH = 6 +/- 4 x10^10 Msun. We report evidence for a mid-IR-bright point source at one edge of the cavity, suggesting that this black hole is actively accreting. MUSE spectroscopy reveal that this source is a LINER AGN and that there is a second candidate AGN on the opposite side of the cavity with a relative velocity of 370km/s -- if real, this implies the presence of a kpc-separation dual AGN with a total binary mass of 6 +/- 2 x10^10 Msun, which would make this the most massive binary black hole system discovered to date. We propose that this unique stellar cavity is the result of a short-lived dynamical interaction between at least one supermassive black hole and the background stellar density field, caused either by three-body scattering during binary hardening or the induction of a dipole instability in the stellar density field.

  • 21 authors
·
Mar 10