five

PeytonT/paper_universe_interactive

收藏
Hugging Face2026-04-26 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/PeytonT/paper_universe_interactive
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: Paper Universe Interactive Graph viewer: true tags: - datasets - graph - scientific-papers - arxiv - webgl - wasm - visualization size_categories: - 10M<n<100M configs: - config_name: papers_50k data_files: - split: train path: parquet/interactive/papers_50000.parquet - config_name: papers_200k data_files: - split: train path: parquet/interactive/papers_200000.parquet - config_name: papers_all data_files: - split: train path: parquet/interactive/papers_all.parquet - config_name: categories data_files: - split: train path: parquet/interactive/categories.parquet - config_name: years data_files: - split: train path: parquet/interactive/years.parquet --- # Paper Universe Interactive Graph Small static-viewer payload for the Research Library paper universe. This dataset is intentionally separate from `PeytonT/paper_graph`. It contains browser-friendly interactive assets needed by the static WebGL/WASM app. Parquet is the preferred payload format; JSON is retained as a compatibility fallback: - `parquet/interactive/papers_50000.parquet` - `parquet/interactive/papers_200000.parquet` - `parquet/interactive/papers_all.parquet` - `parquet/interactive/categories.parquet` - `parquet/interactive/years.parquet` - fallback JSON under `interactive/` - `interactive/manifest.json` - generated HTML viewers - PNG overview renders - lightweight build/progress manifests It does **not** include full parquet graph splits, paper embeddings, full-text embeddings, KNN edges, topic nodes, or `papers_all.json`. Selected paper views should compute nearest-neighbor context from the loaded interactive level in the browser. The full paper KNN parquet remains in `PeytonT/paper_graph` and is intentionally excluded from this lightweight static payload. ## Size - parquet payload bytes: about `68.2 MiB` - approximate total with JSON fallback and renders: `138 MiB` - largest preferred payload: `parquet/interactive/papers_all.parquet` ## Intended Use Static app URL pattern: ```text https://huggingface.co/datasets/PeytonT/paper_universe_interactive/resolve/main/interactive/manifest.json ``` The static `research_library` app prefers the Parquet paths in `interactive/manifest.json` through DuckDB-WASM, caches them client-side, and uses WASM to normalize coordinate buffers before WebGL rendering. If Parquet loading is unavailable, it falls back to the JSON files. ## Relationship To Full Dataset Use `PeytonT/paper_graph` for the full research graph and embeddings. Use this dataset for fast static browser visualization.
提供机构:
PeytonT
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作