five

juliensimon/nasa-exoplanets

收藏
Hugging Face2026-03-24 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/juliensimon/nasa-exoplanets
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 pretty_name: "NASA Exoplanet Archive" language: - en description: "Confirmed exoplanets with orbital, stellar, and discovery parameters from the NASA Exoplanet Archive." task_categories: - tabular-classification - tabular-regression tags: - exoplanet - astronomy - nasa - transit - radial-velocity - kepler - tess - open-data size_categories: - 1K<n<10K --- # NASA Exoplanet Archive ![Update Exoplanets](https://github.com/juliensimon/space-datasets/actions/workflows/update-exoplanets.yml/badge.svg) ![Updated](https://img.shields.io/badge/dynamic/json?url=https://raw.githubusercontent.com/juliensimon/space-datasets/main/status.json&query=$.exoplanets&label=updated&color=brightgreen) All confirmed exoplanets from the [NASA Exoplanet Archive](https://exoplanetarchive.ipac.caltech.edu/), with orbital parameters, stellar properties, and discovery metadata. Currently **6,150** confirmed planets. ## Dataset description The NASA Exoplanet Archive is the authoritative database of confirmed exoplanets, maintained by Caltech/IPAC under contract with NASA. Each entry represents a confirmed planet with its best-available physical and orbital parameters, host star properties, and discovery information. This dataset uses the Planetary Systems (`ps`) table with `default_flag=1` to select one row per planet with the default parameter set. ## Schema | Column | Type | Description | |--------|------|-------------| | `pl_name` | string | Planet name (e.g. "Kepler-22 b", "TRAPPIST-1 e") | | `hostname` | string | Host star name | | `discoverymethod` | string | Discovery method (Transit, Radial Velocity, etc.) | | `disc_year` | Int64 | Year of discovery | | `disc_facility` | string | Discovery facility name | | `pl_orbper` | float | Orbital period in days | | `pl_rade` | float | Planet radius in Earth radii | | `pl_bmasse` | float | Planet mass in Earth masses | | `pl_eqt` | float | Equilibrium temperature in K | | `pl_orbsmax` | float | Semi-major axis in AU | | `pl_orbeccen` | float | Orbital eccentricity | | `st_teff` | float | Stellar effective temperature in K | | `st_rad` | float | Stellar radius in solar radii | | `st_mass` | float | Stellar mass in solar masses | | `sy_dist` | float | Distance in parsecs | | `sy_vmag` | float | V-band magnitude | | `ra` | float | Right ascension in degrees | | `dec` | float | Declination in degrees | | `rowupdate` | string | Date of last row update | ## Quick stats - **6,150** confirmed exoplanets - Most recent discovery: **DMPP-8 b** (2026) ### By discovery method | Method | Count | |--------|-------| | Transit | 4,517 | | Radial Velocity | 1,182 | | Microlensing | 275 | | Imaging | 94 | | Transit Timing Variations | 39 | | Eclipse Timing Variations | 17 | | Orbital Brightness Modulation | 9 | | Pulsar Timing | 8 | ### Recent discoveries by year | Year | Count | |------|-------| | 2026 | 63 | | 2025 | 243 | | 2024 | 259 | | 2023 | 324 | | 2022 | 369 | | 2021 | 564 | | 2020 | 234 | | 2019 | 196 | | 2018 | 315 | | 2017 | 152 | ## Usage ```python from datasets import load_dataset ds = load_dataset("juliensimon/nasa-exoplanets", split="train") df = ds.to_pandas() # Earth-like candidates: rocky, in habitable zone habitable = df[ (df["pl_rade"] < 1.6) & (df["pl_eqt"] > 200) & (df["pl_eqt"] < 310) ] print(f"{len(habitable)} potentially habitable planets") # Transit vs radial velocity discoveries over time transit = df[df["discoverymethod"] == "Transit"] rv = df[df["discoverymethod"] == "Radial Velocity"] # Planets by discovery facility top_facilities = df["disc_facility"].value_counts().head(10) ``` ## Data source All data comes from the [NASA Exoplanet Archive](https://exoplanetarchive.ipac.caltech.edu/), maintained by Caltech/IPAC under contract with NASA. Data is queried via the TAP API using the Planetary Systems table. ## Update schedule Weekly on Monday at 16:00 UTC via [GitHub Actions](https://github.com/juliensimon/space-datasets). ## Related datasets - [space-track-satcat](https://huggingface.co/datasets/juliensimon/space-track-satcat) — NORAD Satellite Catalog - [space-launch-log](https://huggingface.co/datasets/juliensimon/space-launch-log) — Global launch history from GCAT ## Pipeline Source code: [juliensimon/space-datasets](https://github.com/juliensimon/space-datasets) ## Citation ```bibtex @dataset{nasa_exoplanets, author = {Simon, Julien}, title = {NASA Exoplanet Archive}, year = {2026}, publisher = {Hugging Face}, url = {https://huggingface.co/datasets/juliensimon/nasa-exoplanets}, note = {Based on data from the NASA Exoplanet Archive, operated by Caltech/IPAC} } ``` ## License [CC-BY-4.0](https://creativecommons.org/licenses/by/4.0/)

--- license: CC BY 4.0(知识共享署名4.0国际许可协议) pretty_name: "NASA系外行星档案(NASA Exoplanet Archive)" language: - en description: "收录来自NASA系外行星档案(NASA Exoplanet Archive)的已确认系外行星数据,包含轨道参数、恒星属性与发现相关参数。" task_categories: - tabular-classification - tabular-regression tags: - 系外行星(exoplanet) - 天文学(astronomy) - NASA(nasa) - 凌星法(transit) - 视向速度法(radial-velocity) - 开普勒(kepler) - TESS(tess) - 开放数据(open-data) size_categories: - 1K<n<10K --- # NASA系外行星档案(NASA Exoplanet Archive) ![Update Exoplanets](https://github.com/juliensimon/space-datasets/actions/workflows/update-exoplanets.yml/badge.svg) ![Updated](https://img.shields.io/badge/dynamic/json?url=https://raw.githubusercontent.com/juliensimon/space-datasets/main/status.json&query=$.exoplanets&label=updated&color=brightgreen) 本数据集收录来自[NASA系外行星档案](https://exoplanetarchive.ipac.caltech.edu/)的全部已确认系外行星数据,包含轨道参数、恒星属性与发现元数据,目前共收录**6150颗**已确认系外行星。 ## 数据集说明 NASA系外行星档案是由加州理工学院IPAC中心受NASA合同委托维护的权威已确认系外行星数据库。每条数据对应一颗已确认系外行星,包含当前可获取的最优物理与轨道参数、宿主恒星属性以及发现相关信息。本数据集选取行星系统表(Planetary Systems,`ps`)中`default_flag=1`的条目,为每颗行星选取一套默认参数对应的单行数据。 ## 数据结构 | 列名 | 数据类型 | 描述 | |--------|------|-------------| | `pl_name` | string | 行星名称(例如"开普勒(Kepler)-22 b"、"TRAPPIST-1 e") | | `hostname` | string | 宿主恒星名称 | | `discoverymethod` | string | 发现方法(凌星法(transit)、视向速度法(radial velocity)等) | | `disc_year` | Int64 | 发现年份 | | `disc_facility` | string | 发现设备名称 | | `pl_orbper` | float | 轨道周期,单位:天 | | `pl_rade` | float | 行星半径,单位:地球半径 | | `pl_bmasse` | float | 行星质量,单位:地球质量 | | `pl_eqt` | float | 平衡温度,单位:开尔文(K) | | `pl_orbsmax` | float | 半长轴,单位:天文单位(AU) | | `pl_orbeccen` | float | 轨道偏心率 | | `st_teff` | float | 恒星有效温度,单位:开尔文(K) | | `st_rad` | float | 恒星半径,单位:太阳半径 | | `st_mass` | float | 恒星质量,单位:太阳质量 | | `sy_dist` | float | 距离,单位:秒差距(pc) | | `sy_vmag` | float | V波段星等 | | `ra` | float | 赤经,单位:度 | | `dec` | float | 赤纬,单位:度 | | `rowupdate` | string | 最后更新日期 | ## 快速统计 - **6150颗** 已确认系外行星 - 最新发现行星:**DMPP-8 b**(2026年) ### 按发现方法分类 | 发现方法 | 数量 | |--------|-------| | 凌星法(transit) | 4517 | | 视向速度法(radial velocity) | 1182 | | 微引力透镜法(microlensing) | 275 | | 直接成像法(imaging) | 94 | | 凌星计时变分法(transit timing variations) | 39 | | 食计时变分法(eclipse timing variations) | 17 | | 轨道亮度调制法(orbital brightness modulation) | 9 | | 脉冲星计时法(pulsar timing) | 8 | ### 按年份统计的新发现行星 | 年份 | 数量 | |------|-------| | 2026 | 63 | | 2025 | 243 | | 2024 | 259 | | 2023 | 324 | | 2022 | 369 | | 2021 | 564 | | 2020 | 234 | | 2019 | 196 | | 2018 | 315 | | 2017 | 152 | ## 使用示例 python from datasets import load_dataset ds = load_dataset("juliensimon/nasa-exoplanets", split="train") df = ds.to_pandas() # Earth-like candidates: rocky, in habitable zone habitable = df[ (df["pl_rade"] < 1.6) & (df["pl_eqt"] > 200) & (df["pl_eqt"] < 310) ] print(f"{len(habitable)} potentially habitable planets") # Transit vs radial velocity discoveries over time transit = df[df["discoverymethod"] == "Transit"] rv = df[df["discoverymethod"] == "Radial Velocity"] # Planets by discovery facility top_facilities = df["disc_facility"].value_counts().head(10) ## 数据来源 所有数据均来自[NASA系外行星档案](https://exoplanetarchive.ipac.caltech.edu/),由加州理工学院IPAC中心受NASA合同委托维护。本数据集通过TAP API接口查询行星系统表获取数据。 ## 更新计划 每周一16:00 UTC通过[GitHub Actions](https://github.com/juliensimon/space-datasets)自动更新。 ## 相关数据集 - [space-track-satcat](https://huggingface.co/datasets/juliensimon/space-track-satcat) — NORAD卫星目录 - [space-launch-log](https://huggingface.co/datasets/juliensimon/space-launch-log) — 来自GCAT的全球发射历史记录 ## 数据流水线 源代码:[juliensimon/space-datasets](https://github.com/juliensimon/space-datasets) ## 引用格式 bibtex @dataset{nasa_exoplanets, author = {Simon, Julien}, title = {NASA Exoplanet Archive}, year = {2026}, publisher = {Hugging Face}, url = {https://huggingface.co/datasets/juliensimon/nasa-exoplanets}, note = {Based on data from the NASA Exoplanet Archive, operated by Caltech/IPAC} } ## 许可协议 [知识共享署名4.0国际许可协议(CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/)
提供机构:
juliensimon
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作