Data gaps in Earth Science
71 open data research questions in Earth Science — gaps in available data, datasets, benchmarks, or measurements — extracted from 51 papers in our local library. Below are representative open questions, each linked to the paper that raised it.
Representative open questions
Showing 30 of 71 — one per source paper, highest-quality first.
- Integrative evidence reveals adaptive divergence and speciation in gentoo penguins (2026) · doi
The paper integrates species distribution modeling (biomod2) with climate data (Turner et al. 2005, 2017) to project Gentoo Penguin range shifts, but does not incorporate fine-scale oceanographic variables (sea surface temperature, ice concentration, prey availability) specific to foraging habitats during breeding versus non-breeding seasons. Dynamic species distribution models coupling breeding and non-breeding niche requirements with decadal-resolution oceanographic datasets are required to improve predictive accuracy.
- The impact of aerosol-ice nuclei-cloud interactions on a typical spring dust-precipitation event in China (2026) · doi
The study evaluated aerosol-IN-cloud interactions for only a single spring dust-precipitation event in China. Multiple case studies across different seasons (summer, autumn, winter) and varying dust concentration levels are needed to validate the generalizability of the online aerosol-IN nucleation scheme and its precipitation simulation improvements.
- Revisiting the global budget of atmospheric glyoxal: updates on terrestrial and marine precursor emissions, chemistry, and impacts on atmospheric oxidation capacity (2026) · doi
The Henry's law constants for glyoxal and its precursors (formaldehyde, isoprene oxidation products) need refinement across different temperature and salinity regimes relevant to marine boundary layer conditions, as current values may not accurately represent air-sea partitioning in tropical and polar regions.
- Dynamics of island mass effect – Part 2: Phytoplankton physiological responses (2026) · doi
The comparison of in situ bio-optical properties (chlorophyll a, backscattering coefficient, iron stress index) between inbound and outbound transects is confounded by different water masses being sampled, particularly for the Society Islands where 38 days separated the two transects; concurrent temporal sampling or Lagrangian tracking of the same water masses is needed to isolate island mass effect processes.
- CMIP7 Data Request: atmosphere priorities and opportunities (2026) · doi
Instantaneous ozone sampling methodology (first day of every month) versus monthly-mean climatological ozone fields for radiation scheme diagnostics creates temporal discontinuities that require evaluation. The impact of this sampling frequency mismatch on the accuracy of methane and ozone radiative forcing estimates in CMIP7 models has not been quantified.
- AdaBoost-powered multi-class classification of pre-earthquake ionospheric anomalies using GNSS network in Türkiye: A comparison with random forest (2026) · doi
The paper notes that marine boundary layer effects (high humidity >75%, sea spray aerosols, enhanced evaporation) increase atmospheric conductivity 2-3 times and elevate TEC variability in offshore events, but does not investigate whether incorporating meteorological variables (humidity, aerosol optical depth, sea surface temperature) as additional features would improve AdaBoost or Random Forest performance for maritime earthquake precursor detection.
- Unraveling non-monotonic responses of the El Niño–Southern Oscillation to post-2100 global warming (2026) · doi
The EOF-based separation of Eastern Pacific (EP) versus Central Pacific (CP) ENSO flavors explained only 88.3% of variance in the first two modes; the contribution of higher EOF modes (3+) to ENSO diversity and their potential shifts under post-2100 warming levels have not been characterized.
- The interplay of habitat quality and temperature shape demographic patterns of mule deer (Odocoileus hemionus) in North America (2026) · doi
The study documented that herd units experienced an average of 9.5 months of severe drought (scPDSI < -3) over five-year periods, but did not quantify how cumulative drought duration versus intensity affects mule deer age ratios, nor did it model potential non-linear or lagged demographic responses to prolonged drought conditions.
- Marine chemistry special issue: Aerosol chemistry and impacts on the ocean (2010) · doi
Phosphorus bioavailability from aerosol deposition is understudied relative to iron and aluminum. Anderson et al. (2010) raise the question of what aerosol phosphorus associations can reveal about P bioavailability, but quantitative data on fractional solubility of different aerosol phosphorus forms (organic, inorganic, particulate, dissolved) and their biological uptake efficiency are sparse.
- Climate Change, Food Security, and Scheduled Caste Women’s Livelihoods: A Field-Based Study of Rajnagar Block, Birbhum, West Bengal (2026) · doi
The paper identifies that climate change has affected agricultural production leading to crop failures but does not quantify specific climate variables (rainfall patterns, temperature anomalies, drought frequency) or their temporal correlation with food security deterioration in Rajnagar Block during the study period.
- Developmental Biology: Embryonic Movement Influences Sex Determination in a Turtle (2019) · doi
Intra- and inter-individual variation in thermal responses and pivotal temperatures for sex determination in turtle embryos has not been comprehensively characterized. Quantitative studies measuring variation in embryonic thermoregulation capacity and temperature-dependent sex determination within and between individuals are needed to predict population-level climate change responses.
- Turf Algae Monitoring on Artificial Reefs Using Underwater Photography and ImageJ (2026) · doi
Environmental covariates (light intensity, hydrodynamic flow, water temperature) were not measured during the study; future monitoring workflows must integrate concurrent measurement of these physical parameters with ImageJ-based turf algae segmentation to determine which factors drive the observed differences in algal cover among sediment categories.
- Earthquake Induced Landslide Identification to Support Landuse Planning in Rapid Growing Settlements Area of Imogiri and Dlingo Sub-District, Yogyakarta (2026) · doi
The study uses Landsat-based classification for landuse change detection between 2013-2022, but does not specify the minimum mapping unit or validation accuracy achieved for distinguishing settlement encroachment into medium-susceptibility areas (64.08 km², 57% of study area). Finer-resolution multispectral or SAR data validation would be needed to refine settlement boundary delineation in steep terrain where shadow and slope distortion affect classification.
- Atmospheric Simulation Chambers in the ACTRIS Research Infrastructure (2026) · doi
Multiple studies document secondary organic aerosol formation from α-pinene ozonolysis in atmospheric chambers, but direct systematic comparison of hygroscopicity and optical properties across different chamber designs, wall materials, and operating conditions (humidity, temperature, residence time) is lacking, preventing chamber-independent validated datasets for climate modeling.
- Seasonal variability of physico-chemical parameters and their effects on fish diversity in artificial reefs of the Sea of Oman (2026) · doi
Long-term monitoring of physico-chemical parameters (temperature, salinity, dissolved oxygen) and fish assemblages is recommended as necessary, but the study does not specify monitoring frequency, duration, sampling grid resolution, or multi-year datasets needed to establish baseline conditions and detect climate-driven shifts in reef fish communities.
- Atmospheric oxidation chemistry of hexafluoroisobutylene initiated by OH radical: Kinetics and mechanism (2021) · doi
The atmospheric lifetime calculation for (CF3)2C=CH2 was determined at 298 K only; temperature-dependent kinetic measurements across the full range of tropospheric conditions (200-320 K) are needed to accurately predict persistence of this persistent organic pollutant across different atmospheric layers and seasonal variations.
- Long-term nitrogen burial exceeds denitrification in global fjords (2026) · doi
The study compiled a global dataset of sediment burial and microbial N2 production rates from fjords, but does not specify whether temporal dynamics of nitrogen removal were captured at seasonal or interannual scales, limiting understanding of how climate-driven oxygen fluctuations affect long-term nitrogen burial versus denitrification partitioning.
- Agrivoltaic System Designing for Sustainability and Smart Irrigation (2026) · doi
The paper claims that partial shading from solar panels reduces soil evaporation and creates a favorable microclimate for certain crops, but provides no empirical data comparing evapotranspiration rates, soil temperature profiles, or crop yield outcomes between agrivoltaic zones (under panels), inter-panel zones, and open-field control areas under identical paddy cultivation conditions.
- The chemistry of climate change (2012) · doi
The mechanism by which stratospheric water vapor concentrations increased 50% in the latter half of the 20th century is incompletely understood. The paper identifies two potential sources—methane oxidation and water vapor transport via tropical convective storms—but lacks quantitative partitioning between these mechanisms and does not specify which datasets or atmospheric models could resolve this ambiguity.
- Chemistry of the atmosphere (2012) · doi
The text states that ozone lifetime ranges from minutes to hours in the upper stratosphere to years at the bottom, enabling long-distance transport and storage in the lower stratosphere. However, no specific measurements of ozone lifetime as a function of altitude, temperature, or pressure conditions are provided, nor are there quantitative models reconciling this transport-driven accumulation with predicted steady-state concentrations.
- Collapse of the Atlantic meridional overturning circulation would lead to substantial oceanic carbon release and additional global warming (2026) · doi
The paper addresses global warming from AMOC-driven carbon release but does not specify quantitative thresholds for Atlantic Meridional Overturning Circulation weakening that would trigger substantial ocean carbon outgassing. Paleo-oceanographic data from Dansgaard-Oeschger cycles and glacial rapid climate change events should be systematically analyzed to constrain carbon release magnitudes under different AMOC reduction scenarios.
- Integrated pollution and carbon mitigation delivers major health and economic co-benefits in China (2026) · doi
The study applies a constant background PM2.5 concentration term (k0,m) in the source-receptor model (Eq. 6) without specifying how this background varies temporally or responds to transboundary pollution transport; the robustness of health and economic co-benefit estimates to alternative background concentration assumptions across different meteorological conditions and seasons remains unexamined.
- Spatiotemporal evolution and drivers of grassland ecosystem service value in Inner Mongolia (2026) · doi
The paper identifies interaction effects between annual average temperature and annual precipitation as relatively strong drivers of grassland ESV but does not specify quantitative thresholds or non-linear response curves for these climate variables across different grassland types in Inner Mongolia. Research should map the specific temperature-precipitation combinations that maximize ESV for alpine meadows, steppe, and desert grassland ecosystems separately.
- Climate change-induced agricultural disasters and policy-based agricultural insurance: empirical findings from China (2026) · doi
Region 1 (low disaster severity, low insurance payouts) shows weak association between agricultural disasters and insurance indemnity payments, but the paper does not investigate whether this weak linkage reflects genuine low-loss events or represents uninsured agricultural areas due to insufficient coverage during the early expansion phase of China's insurance system.
- Human-induced intensification of sea surface temperature regime shifts threatens global Large Marine Ecosystems (2026) · doi
The paper cites CMIP6 models for detecting abrupt shifts but does not specify which climate model ensemble members, emissions scenarios, or spatial resolutions were used to assess the nonstationary response of pelagic species recruitment to SST regime shifts. Standardized comparisons across CMIP6 configurations for predicting LME-specific regime shift timing and biological consequences are absent.
- Inferring drivers of tropical isoprene: competing effects of emissions and chemistry (2026) · doi
The study uses CrIS monthly retrievals (2012-2020) but does not address how seasonal or interannual variability in NOx sources (especially lightning and soil emissions parameterized by OTD/LIS and BDSNP) modulates the inferred isoprene drivers across different tropical regions.
- Cloud chemistry (2011) · doi
The measured rate coefficients for O3 reactions with S(IV) species (k0, k1, k2 in Eq. 13.53) are provided at 298 K only; temperature dependence of these rate coefficients across the range of cloud temperatures (268-288 K) necessary for modeling in-cloud sulfate formation is not addressed.
- Reassessing Recent Tectonic Activity in Western Svalbard: Fault Kinematics and Crustal Deformation Patterns (2026) · doi
The paper relies on GPS and VLBI observations from limited stations (notably Ny-Ålesund) to characterize crustal deformation patterns across Western Svalbard; the spatial resolution of geodetic monitoring networks is insufficient to resolve fault-scale kinematics along individual fault segments, particularly for the microseismicity associated with ultraslow-spreading ridge systems.
- Shrub Age and Water Dynamics Influence Primary Production, Carbon, and Nitrogen Stocks in a Coastal Environment (2024) · doi
The study examines shrub age effects on carbon and nitrogen stocks along a coastal chronosequence, but does not quantify how interannual variation in precipitation intensity affects the relationship between shrub encroachment and soil carbon sequestration rates across different water table depths in coastal environments.
- Leveraging artificial intelligence (AI) techniques for sustainable marine resources (2026) · doi
High-quality, standardized, and interoperable marine datasets remain fragmented with inconsistencies in data formats, classification schemes, and annotation practices that hinder cross-platform integration; ocean regions in the Global South and deep-sea environments are critically undersampled, and the lack of long-term time series datasets limits capacity to model temporal dynamics and climate-driven ecosystem changes in AI models.
Working on one of these gaps? Publish with us.
Science AI Journal reviews manuscripts in under 15 minutes with 8 specialised AI reviewers calibrated on 23,000+ real peer reviews. Open access, CC BY 4.0.