The Hobby-Eberly Telescope Dark Energy Experiment (HETDEX) has released its entire dataset to the public, a collection of more than 600 million spectra and a catalog of over one million galaxies. The data, gathered between 2017 and 2024 from a sky area equal to 2,000 full Moons, maps the universe back to an era just 1.8 billion years after the Big Bang. This release gives astronomers, students, and citizen researchers a direct view of Cosmic Noon - the period 10 to 12 billion years ago when most stars were forming - and removes long-standing barriers to working with data of this scale.
Cosmic Noon mapped through 600 million spectra
HETDEX used spectroscopy to break light from tens of thousands of objects at once into its component wavelengths. Each spectrum acts as a record of an object's chemistry, motion, and distance. "This is a spectral map of the universe. It turns every point of light into a barcode of physics," said Erin Mentuch Cooper, HETDEX data manager and lead author on the release paper. "The real excitement is what happens when thousands of astronomers start exploring it." The survey did not focus on individual targets; instead, it mapped both galaxies and the vast spaces between them, creating a three-dimensional atlas of the distant cosmos.
Untargeted survey reveals a diverse catalog
"We aren't picking and choosing specific objects to observe. Instead, we're pointing one of the world's largest telescopes at the sky and seeing what's out there," said Karl Gebhardt, HETDEX principal investigator. "We fully expect to find some really cool, wild stuff hiding in the data." The catalog compiled from the survey includes:
- over one million distant galaxies
- half a million nearby star-forming galaxies
- 18,000 supermassive black holes
- over 150,000 stars
Gebhardt added, "HETDEX gives us an unusually wide and detailed spectroscopic view of the universe at a time when most stars were being formed. There's a lot of potential here."
Supercomputers and cloud hubs lower the barrier to entry
The raw archive totals half a petabyte, but the team processed it down to a more manageable 10 terabytes. Through a collaboration with the Texas Advanced Computing Center, the release includes cloud-based Jupyter Hubs that let anyone run large-scale analyses in a web browser, with all data and software pre-loaded. Users can also download customized subsets by sky location. The HETDEX Jupyter Hubs are available at https://jupyter.tacc.cloud/. "We've turned more than half a billion spectra into something you can actually explore. It's like compressing a universe of information into something you can hold in your hands," Cooper said.
AI already integrated into data processing
Software from RAIC Labs automatically removed contamination from satellites and meteors crossing the telescope's field of view. Automated methods also combed through observations to flag candidate early galaxies, while more than 24,000 citizen scientists confirmed the findings through the Dark Energy Explorers program. The project's reliance on automated detection and cleaning tools illustrates how AI for Science & Research can accelerate the processing of massive astronomical surveys. Today's release marks the first time the full HETDEX dataset and survey catalog have been made available together. Observations continue, calibrations are still improving, and supplementary releases are planned.
Why this matters for Science and Research professionals
This dataset and the accompanying cloud tools remove the technical barriers that once kept large-scale astronomical research locked inside major institutions. Researchers can now query a billion-year-old spectral map without owning supercomputing infrastructure, and the AI-ready formats invite new classification and anomaly detection work. For anyone in data-intensive science, the HETDEX release is both a rare scientific resource and a practical model for making complex, petabyte-scale data publicly usable.
Your membership also unlocks: