juliensimon/nasa-exoplanets
收藏Hugging Face2026-03-24 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/juliensimon/nasa-exoplanets
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-4.0
pretty_name: "NASA Exoplanet Archive"
language:
- en
description: "Confirmed exoplanets with orbital, stellar, and discovery parameters from the NASA Exoplanet Archive."
task_categories:
- tabular-classification
- tabular-regression
tags:
- exoplanet
- astronomy
- nasa
- transit
- radial-velocity
- kepler
- tess
- open-data
size_categories:
- 1K<n<10K
---
# NASA Exoplanet Archive


All confirmed exoplanets from the [NASA Exoplanet Archive](https://exoplanetarchive.ipac.caltech.edu/),
with orbital parameters, stellar properties, and discovery metadata. Currently **6,150** confirmed planets.
## Dataset description
The NASA Exoplanet Archive is the authoritative database of confirmed exoplanets, maintained by
Caltech/IPAC under contract with NASA. Each entry represents a confirmed planet with its best-available
physical and orbital parameters, host star properties, and discovery information. This dataset uses the
Planetary Systems (`ps`) table with `default_flag=1` to select one row per planet with the default
parameter set.
## Schema
| Column | Type | Description |
|--------|------|-------------|
| `pl_name` | string | Planet name (e.g. "Kepler-22 b", "TRAPPIST-1 e") |
| `hostname` | string | Host star name |
| `discoverymethod` | string | Discovery method (Transit, Radial Velocity, etc.) |
| `disc_year` | Int64 | Year of discovery |
| `disc_facility` | string | Discovery facility name |
| `pl_orbper` | float | Orbital period in days |
| `pl_rade` | float | Planet radius in Earth radii |
| `pl_bmasse` | float | Planet mass in Earth masses |
| `pl_eqt` | float | Equilibrium temperature in K |
| `pl_orbsmax` | float | Semi-major axis in AU |
| `pl_orbeccen` | float | Orbital eccentricity |
| `st_teff` | float | Stellar effective temperature in K |
| `st_rad` | float | Stellar radius in solar radii |
| `st_mass` | float | Stellar mass in solar masses |
| `sy_dist` | float | Distance in parsecs |
| `sy_vmag` | float | V-band magnitude |
| `ra` | float | Right ascension in degrees |
| `dec` | float | Declination in degrees |
| `rowupdate` | string | Date of last row update |
## Quick stats
- **6,150** confirmed exoplanets
- Most recent discovery: **DMPP-8 b** (2026)
### By discovery method
| Method | Count |
|--------|-------|
| Transit | 4,517 |
| Radial Velocity | 1,182 |
| Microlensing | 275 |
| Imaging | 94 |
| Transit Timing Variations | 39 |
| Eclipse Timing Variations | 17 |
| Orbital Brightness Modulation | 9 |
| Pulsar Timing | 8 |
### Recent discoveries by year
| Year | Count |
|------|-------|
| 2026 | 63 |
| 2025 | 243 |
| 2024 | 259 |
| 2023 | 324 |
| 2022 | 369 |
| 2021 | 564 |
| 2020 | 234 |
| 2019 | 196 |
| 2018 | 315 |
| 2017 | 152 |
## Usage
```python
from datasets import load_dataset
ds = load_dataset("juliensimon/nasa-exoplanets", split="train")
df = ds.to_pandas()
# Earth-like candidates: rocky, in habitable zone
habitable = df[
(df["pl_rade"] < 1.6) &
(df["pl_eqt"] > 200) & (df["pl_eqt"] < 310)
]
print(f"{len(habitable)} potentially habitable planets")
# Transit vs radial velocity discoveries over time
transit = df[df["discoverymethod"] == "Transit"]
rv = df[df["discoverymethod"] == "Radial Velocity"]
# Planets by discovery facility
top_facilities = df["disc_facility"].value_counts().head(10)
```
## Data source
All data comes from the [NASA Exoplanet Archive](https://exoplanetarchive.ipac.caltech.edu/),
maintained by Caltech/IPAC under contract with NASA. Data is queried via the TAP API using
the Planetary Systems table.
## Update schedule
Weekly on Monday at 16:00 UTC via [GitHub Actions](https://github.com/juliensimon/space-datasets).
## Related datasets
- [space-track-satcat](https://huggingface.co/datasets/juliensimon/space-track-satcat) — NORAD Satellite Catalog
- [space-launch-log](https://huggingface.co/datasets/juliensimon/space-launch-log) — Global launch history from GCAT
## Pipeline
Source code: [juliensimon/space-datasets](https://github.com/juliensimon/space-datasets)
## Citation
```bibtex
@dataset{nasa_exoplanets,
author = {Simon, Julien},
title = {NASA Exoplanet Archive},
year = {2026},
publisher = {Hugging Face},
url = {https://huggingface.co/datasets/juliensimon/nasa-exoplanets},
note = {Based on data from the NASA Exoplanet Archive, operated by Caltech/IPAC}
}
```
## License
[CC-BY-4.0](https://creativecommons.org/licenses/by/4.0/)
---
license: CC BY 4.0(知识共享署名4.0国际许可协议)
pretty_name: "NASA系外行星档案(NASA Exoplanet Archive)"
language:
- en
description: "收录来自NASA系外行星档案(NASA Exoplanet Archive)的已确认系外行星数据,包含轨道参数、恒星属性与发现相关参数。"
task_categories:
- tabular-classification
- tabular-regression
tags:
- 系外行星(exoplanet)
- 天文学(astronomy)
- NASA(nasa)
- 凌星法(transit)
- 视向速度法(radial-velocity)
- 开普勒(kepler)
- TESS(tess)
- 开放数据(open-data)
size_categories:
- 1K<n<10K
---
# NASA系外行星档案(NASA Exoplanet Archive)


本数据集收录来自[NASA系外行星档案](https://exoplanetarchive.ipac.caltech.edu/)的全部已确认系外行星数据,包含轨道参数、恒星属性与发现元数据,目前共收录**6150颗**已确认系外行星。
## 数据集说明
NASA系外行星档案是由加州理工学院IPAC中心受NASA合同委托维护的权威已确认系外行星数据库。每条数据对应一颗已确认系外行星,包含当前可获取的最优物理与轨道参数、宿主恒星属性以及发现相关信息。本数据集选取行星系统表(Planetary Systems,`ps`)中`default_flag=1`的条目,为每颗行星选取一套默认参数对应的单行数据。
## 数据结构
| 列名 | 数据类型 | 描述 |
|--------|------|-------------|
| `pl_name` | string | 行星名称(例如"开普勒(Kepler)-22 b"、"TRAPPIST-1 e") |
| `hostname` | string | 宿主恒星名称 |
| `discoverymethod` | string | 发现方法(凌星法(transit)、视向速度法(radial velocity)等) |
| `disc_year` | Int64 | 发现年份 |
| `disc_facility` | string | 发现设备名称 |
| `pl_orbper` | float | 轨道周期,单位:天 |
| `pl_rade` | float | 行星半径,单位:地球半径 |
| `pl_bmasse` | float | 行星质量,单位:地球质量 |
| `pl_eqt` | float | 平衡温度,单位:开尔文(K) |
| `pl_orbsmax` | float | 半长轴,单位:天文单位(AU) |
| `pl_orbeccen` | float | 轨道偏心率 |
| `st_teff` | float | 恒星有效温度,单位:开尔文(K) |
| `st_rad` | float | 恒星半径,单位:太阳半径 |
| `st_mass` | float | 恒星质量,单位:太阳质量 |
| `sy_dist` | float | 距离,单位:秒差距(pc) |
| `sy_vmag` | float | V波段星等 |
| `ra` | float | 赤经,单位:度 |
| `dec` | float | 赤纬,单位:度 |
| `rowupdate` | string | 最后更新日期 |
## 快速统计
- **6150颗** 已确认系外行星
- 最新发现行星:**DMPP-8 b**(2026年)
### 按发现方法分类
| 发现方法 | 数量 |
|--------|-------|
| 凌星法(transit) | 4517 |
| 视向速度法(radial velocity) | 1182 |
| 微引力透镜法(microlensing) | 275 |
| 直接成像法(imaging) | 94 |
| 凌星计时变分法(transit timing variations) | 39 |
| 食计时变分法(eclipse timing variations) | 17 |
| 轨道亮度调制法(orbital brightness modulation) | 9 |
| 脉冲星计时法(pulsar timing) | 8 |
### 按年份统计的新发现行星
| 年份 | 数量 |
|------|-------|
| 2026 | 63 |
| 2025 | 243 |
| 2024 | 259 |
| 2023 | 324 |
| 2022 | 369 |
| 2021 | 564 |
| 2020 | 234 |
| 2019 | 196 |
| 2018 | 315 |
| 2017 | 152 |
## 使用示例
python
from datasets import load_dataset
ds = load_dataset("juliensimon/nasa-exoplanets", split="train")
df = ds.to_pandas()
# Earth-like candidates: rocky, in habitable zone
habitable = df[
(df["pl_rade"] < 1.6) &
(df["pl_eqt"] > 200) & (df["pl_eqt"] < 310)
]
print(f"{len(habitable)} potentially habitable planets")
# Transit vs radial velocity discoveries over time
transit = df[df["discoverymethod"] == "Transit"]
rv = df[df["discoverymethod"] == "Radial Velocity"]
# Planets by discovery facility
top_facilities = df["disc_facility"].value_counts().head(10)
## 数据来源
所有数据均来自[NASA系外行星档案](https://exoplanetarchive.ipac.caltech.edu/),由加州理工学院IPAC中心受NASA合同委托维护。本数据集通过TAP API接口查询行星系统表获取数据。
## 更新计划
每周一16:00 UTC通过[GitHub Actions](https://github.com/juliensimon/space-datasets)自动更新。
## 相关数据集
- [space-track-satcat](https://huggingface.co/datasets/juliensimon/space-track-satcat) — NORAD卫星目录
- [space-launch-log](https://huggingface.co/datasets/juliensimon/space-launch-log) — 来自GCAT的全球发射历史记录
## 数据流水线
源代码:[juliensimon/space-datasets](https://github.com/juliensimon/space-datasets)
## 引用格式
bibtex
@dataset{nasa_exoplanets,
author = {Simon, Julien},
title = {NASA Exoplanet Archive},
year = {2026},
publisher = {Hugging Face},
url = {https://huggingface.co/datasets/juliensimon/nasa-exoplanets},
note = {Based on data from the NASA Exoplanet Archive, operated by Caltech/IPAC}
}
## 许可协议
[知识共享署名4.0国际许可协议(CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/)
提供机构:
juliensimon



