legacysurveys-dr10-embeddings
收藏Legacy Survey DR10 Embeddings 数据集概述
数据集基本信息
- 数据集名称: Legacy Survey DR10 Embeddings for 100M+ Galaxies
- 数据集地址: https://huggingface.co/datasets/astronolan/legacysurveys-dr10-embeddings
- 许可证: MIT
- 数据规模: 10M < n < 100M
- 标签: astronomy, galaxy-embeddings, clip, aion, semantic-search
数据集内容
该数据集包含来自“多模态宇宙”中超过1亿个Legacy DR10星系的AION-Search和AION-1-B嵌入向量。具体包含110,204,390个星系。
许可证与数据来源
- 嵌入向量及本仓库的打包内容根据MIT许可证发布。
- 底层目录数据源自Legacy Survey DR10,仍需遵守原始的Legacy Survey数据使用政策及必要的致谢要求。
引用信息
嵌入向量相关引用
会议论文引用:
@inproceedings{koblischke2025why, title={Why wait for human annotations when you have AI? Semantic searching scientific images with synthetic labels}, author={Nolan Koblischke and Liam Parker and Francois Lanusse and Irina Espejo Morales and Jo Bovy and Shirley Ho}, booktitle={NeurIPS 2025 AI for Science Workshop}, year={2025}, url={https://openreview.net/forum?id=j8Qxvb37HQ} }
预印本引用:
@misc{koblischke2025semantic, title={Semantic search for 100M+ galaxy images using AI-generated captions}, author={Nolan Koblischke and Liam Parker and Francois Lanusse and Irina Espejo Morales and Jo Bovy and Shirley Ho}, year={2025}, eprint={2512.11982}, archivePrefix={arXiv}, primaryClass={astro-ph.IM}, url={https://arxiv.org/abs/2512.11982}, }
预印本链接:https://arxiv.org/abs/2512.11982
原始Legacy Survey数据集引用
数据发布10(DR10)是Legacy Surveys的第十次公共数据发布。在论文中使用Legacy Surveys的数据时,请使用README中提供的完整致谢文本。
学术文献引用:
@ARTICLE{2019AJ....157..168D, author = {{Dey}, Arjun and {Schlegel}, David J. and {Lang}, Dustin and {Blum}, Robert and {Burleigh}, Kaylan and {Fan}, Xiaohui and {Findlay}, Joseph R. and {Finkbeiner}, Doug and {Herrera}, David and {Juneau}, St{e}phanie and {Landriau}, Martin and {Levi}, Michael and {McGreer}, Ian and {Meisner}, Aaron and {Myers}, Adam D. and {Moustakas}, John and {Nugent}, Peter and {Patej}, Anna and {Schlafly}, Edward F. and {Walker}, Alistair R. and {Valdes}, Francisco and {Weaver}, Benjamin A. and {Y{`e}che}, Christophe and {Zou}, Hu and {Zhou}, Xu and {Abareshi}, Behzad and {Abbott}, T.~M.~C. and {Abolfathi}, Bela and {Aguilera}, C. and {Alam}, Shadab and {Allen}, Lori and {Alvarez}, A. and {Annis}, James and {Ansarinejad}, Behzad and {Aubert}, Marie and {Beechert}, Jacqueline and {Bell}, Eric F. and {BenZvi}, Segev Y. and {Beutler}, Florian and {Bielby}, Richard M. and {Bolton}, Adam S. and {Brice{~n}o}, C{e}sar and {Buckley-Geer}, Elizabeth J. and {Butler}, Karen and {Calamida}, Annalisa and {Carlberg}, Raymond G. and {Carter}, Paul and {Casas}, Ricard and {Castander}, Francisco J. and {Choi}, Yumi and {Comparat}, Johan and {Cukanovaite}, Elena and {Delubac}, Timoth{e}e and {DeVries}, Kaitlin and {Dey}, Sharmila and {Dhungana}, Govinda and {Dickinson}, Mark and {Ding}, Zhejie and {Donaldson}, John B. and {Duan}, Yutong and {Duckworth}, Christopher J. and {Eftekharzadeh}, Sarah and {Eisenstein}, Daniel J. and {Etourneau}, Thomas and {Fagrelius}, Parker A. and {Farihi}, Jay and {Fitzpatrick}, Mike and {Font-Ribera}, Andreu and {Fulmer}, Leah and {G{"a}nsicke}, Boris T. and {Gaztanaga}, Enrique and {George}, Koshy and {Gerdes}, David W. and {Gontcho}, Satya Gontcho A. and {Gorgoni}, Claudio and {Green}, Gregory and {Guy}, Julien and {Harmer}, Diane and {Hernandez}, M. and {Honscheid}, Klaus and {Huang}, Lijuan Wendy and {James}, David J. and {Jannuzi}, Buell T. and {Jiang}, Linhua and {Joyce}, Richard and {Karcher}, Armin and {Karkar}, Sonia and {Kehoe}, Robert and {Kneib}, Jean-Paul and {Kueter-Young}, Andrea and {Lan}, Ting-Wen and {Lauer}, Tod R. and {Le Guillou}, Laurent and {Le Van Suu}, Auguste and {Lee}, Jae Hyeon and {Lesser}, Michael and {Perreault Levasseur}, Laurence and {Li}, Ting S. and {Mann}, Justin L. and {Marshall}, Robert and {Mart{\i}nez-V{a}zquez}, C.~E. and {Martini}, Paul and {du Mas des Bourboux}, H{e}lion and {McManus}, Sean and {Meier}, Tobias Gabriel and {M{e}nard}, Brice and {Metcalfe}, Nigel and {Mu{~n}oz-Guti{e}rrez}, Andrea and {Najita}, Joan and {Napier}, Kevin and {Narayan}, Gautham and {Newman}, Jeffrey A. and {Nie}, Jundan and {Nord}, Brian and {Norman}, Dara J. and {Olsen}, Knut A.~G. and {Paat}, Anthony and {Palanque-Delabrouille}, Nathalie and {Peng}, Xiyan and {Poppett}, Claire L. and {Poremba}, Megan R. and {Prakash}, Abhishek and {Rabinowitz}, David and {Raichoor}, Anand and {Rezaie}, Mehdi and {Robertson}, A.~N. and {Roe}, Natalie A. and {Ross}, Ashley J. and {Ross}, Nicholas P. and {Rudnick}, Gregory and {Safonova}, Sasha and {Saha}, Abhijit and {S{a}nchez}, F. Javier and {Savary}, Elodie and {Schweiker}, Heidi and {Scott}, Adam and {Seo}, Hee-Jong and {Shan}, Huanyuan and {Silva}, David R. and {Slepian}, Zachary and {Soto}, Christian and {Sprayberry}, David and {Staten}, Ryan and {Stillman}, Coley M. and {Stupak}, Robert J. and {Summers}, David L. and {Sien Tie}, Suk and {Tirado}, H. and {Vargas-Maga{~n}a}, Mariana and {Vivas}, A. Katherina and {Wechsler}, Risa H. and {Williams}, Doug and {Yang}, Jinyi and {Yang}, Qian and {Yapici}, Tolga and {Zaritsky}, Dennis and {Zenteno}, A. and {Zhang}, Kai and {Zhang}, Tianmeng and {Zhou}, Rongpu and {Zhou}, Zhimin}, title = "{Overview of the DESI Legacy Imaging Surveys}", journal = {aj}, keywords = {catalogs, surveys, Astrophysics - Instrumentation and Methods for Astrophysics}, year = 2019, month = may, volume = {157}, number = {5}, eid = {168}, pages = {168}, doi = {10.3847/1538-3881/ab089d}, archivePrefix = {arXiv}, eprint = {1804.08657}, primaryClass = {astro-ph.IM}, adsurl = {https://ui.adsabs.harvard.edu/abs/2019AJ....157..168D}, adsnote = {Provided by the SAO/NASA Astrophysics Data System} }




