davanstrien/AmericanStories-parquet
收藏Hugging Face2023-10-20 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/davanstrien/AmericanStories-parquet
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: '1774'
path: data/1774-*
- split: '1798'
path: data/1798-*
- split: '1799'
path: data/1799-*
- split: '1800'
path: data/1800-*
- split: '1801'
path: data/1801-*
- split: '1802'
path: data/1802-*
- split: '1803'
path: data/1803-*
- split: '1804'
path: data/1804-*
- split: '1805'
path: data/1805-*
- split: '1806'
path: data/1806-*
- split: '1807'
path: data/1807-*
- split: '1808'
path: data/1808-*
- split: '1809'
path: data/1809-*
- split: '1810'
path: data/1810-*
- split: '1811'
path: data/1811-*
- split: '1812'
path: data/1812-*
- split: '1813'
path: data/1813-*
- split: '1814'
path: data/1814-*
- split: '1815'
path: data/1815-*
- split: '1816'
path: data/1816-*
- split: '1817'
path: data/1817-*
- split: '1818'
path: data/1818-*
- split: '1819'
path: data/1819-*
- split: '1820'
path: data/1820-*
- split: '1821'
path: data/1821-*
- split: '1822'
path: data/1822-*
- split: '1823'
path: data/1823-*
- split: '1824'
path: data/1824-*
- split: '1825'
path: data/1825-*
- split: '1826'
path: data/1826-*
- split: '1827'
path: data/1827-*
- split: '1828'
path: data/1828-*
- split: '1829'
path: data/1829-*
- split: '1830'
path: data/1830-*
- split: '1831'
path: data/1831-*
- split: '1832'
path: data/1832-*
- split: '1833'
path: data/1833-*
- split: '1834'
path: data/1834-*
- split: '1835'
path: data/1835-*
- split: '1836'
path: data/1836-*
- split: '1837'
path: data/1837-*
- split: '1838'
path: data/1838-*
- split: '1839'
path: data/1839-*
- split: '1840'
path: data/1840-*
- split: '1841'
path: data/1841-*
- split: '1842'
path: data/1842-*
- split: '1843'
path: data/1843-*
- split: '1844'
path: data/1844-*
- split: '1845'
path: data/1845-*
- split: '1846'
path: data/1846-*
- split: '1847'
path: data/1847-*
- split: '1848'
path: data/1848-*
- split: '1849'
path: data/1849-*
- split: '1850'
path: data/1850-*
- split: '1851'
path: data/1851-*
- split: '1852'
path: data/1852-*
- split: '1853'
path: data/1853-*
- split: '1854'
path: data/1854-*
- split: '1855'
path: data/1855-*
- split: '1856'
path: data/1856-*
- split: '1857'
path: data/1857-*
- split: '1858'
path: data/1858-*
- split: '1859'
path: data/1859-*
- split: '1860'
path: data/1860-*
- split: '1861'
path: data/1861-*
- split: '1862'
path: data/1862-*
- split: '1863'
path: data/1863-*
- split: '1864'
path: data/1864-*
- split: '1865'
path: data/1865-*
- split: '1866'
path: data/1866-*
- split: '1867'
path: data/1867-*
- split: '1868'
path: data/1868-*
- split: '1869'
path: data/1869-*
- split: '1870'
path: data/1870-*
- split: '1871'
path: data/1871-*
- split: '1872'
path: data/1872-*
- split: '1873'
path: data/1873-*
- split: '1874'
path: data/1874-*
- split: '1875'
path: data/1875-*
- split: '1876'
path: data/1876-*
- split: '1877'
path: data/1877-*
- split: '1878'
path: data/1878-*
- split: '1879'
path: data/1879-*
- split: '1880'
path: data/1880-*
- split: '1881'
path: data/1881-*
- split: '1882'
path: data/1882-*
- split: '1883'
path: data/1883-*
- split: '1884'
path: data/1884-*
- split: '1885'
path: data/1885-*
- split: '1886'
path: data/1886-*
- split: '1887'
path: data/1887-*
- split: '1888'
path: data/1888-*
- split: '1889'
path: data/1889-*
- split: '1890'
path: data/1890-*
- split: '1891'
path: data/1891-*
- split: '1892'
path: data/1892-*
- split: '1893'
path: data/1893-*
- split: '1894'
path: data/1894-*
- split: '1895'
path: data/1895-*
- split: '1896'
path: data/1896-*
- split: '1897'
path: data/1897-*
- split: '1898'
path: data/1898-*
- split: '1899'
path: data/1899-*
- split: '1900'
path: data/1900-*
- split: '1901'
path: data/1901-*
- split: '1902'
path: data/1902-*
- split: '1903'
path: data/1903-*
- split: '1904'
path: data/1904-*
- split: '1905'
path: data/1905-*
- split: '1906'
path: data/1906-*
- split: '1907'
path: data/1907-*
- split: '1908'
path: data/1908-*
- split: '1909'
path: data/1909-*
- split: '1910'
path: data/1910-*
- split: '1911'
path: data/1911-*
- split: '1912'
path: data/1912-*
- split: '1913'
path: data/1913-*
- split: '1914'
path: data/1914-*
- split: '1915'
path: data/1915-*
- split: '1916'
path: data/1916-*
- split: '1917'
path: data/1917-*
- split: '1918'
path: data/1918-*
- split: '1919'
path: data/1919-*
- split: '1920'
path: data/1920-*
- split: '1921'
path: data/1921-*
- split: '1922'
path: data/1922-*
- split: '1923'
path: data/1923-*
- split: '1924'
path: data/1924-*
- split: '1925'
path: data/1925-*
- split: '1926'
path: data/1926-*
- split: '1927'
path: data/1927-*
- split: '1928'
path: data/1928-*
- split: '1929'
path: data/1929-*
- split: '1930'
path: data/1930-*
- split: '1931'
path: data/1931-*
- split: '1932'
path: data/1932-*
- split: '1933'
path: data/1933-*
- split: '1934'
path: data/1934-*
- split: '1935'
path: data/1935-*
- split: '1936'
path: data/1936-*
- split: '1937'
path: data/1937-*
- split: '1938'
path: data/1938-*
- split: '1939'
path: data/1939-*
- split: '1940'
path: data/1940-*
- split: '1941'
path: data/1941-*
- split: '1942'
path: data/1942-*
- split: '1943'
path: data/1943-*
- split: '1944'
path: data/1944-*
- split: '1945'
path: data/1945-*
- split: '1946'
path: data/1946-*
- split: '1947'
path: data/1947-*
- split: '1948'
path: data/1948-*
- split: '1949'
path: data/1949-*
- split: '1950'
path: data/1950-*
- split: '1951'
path: data/1951-*
- split: '1952'
path: data/1952-*
- split: '1953'
path: data/1953-*
- split: '1954'
path: data/1954-*
- split: '1955'
path: data/1955-*
- split: '1956'
path: data/1956-*
- split: '1957'
path: data/1957-*
- split: '1958'
path: data/1958-*
- split: '1959'
path: data/1959-*
- split: '1960'
path: data/1960-*
- split: '1961'
path: data/1961-*
- split: '1962'
path: data/1962-*
- split: '1963'
path: data/1963-*
dataset_info:
features:
- name: article_id
dtype: string
- name: newspaper_name
dtype: string
- name: edition
dtype: string
- name: date
dtype: string
- name: page
dtype: string
- name: headline
dtype: string
- name: byline
dtype: string
- name: article
dtype: string
splits:
- name: '1774'
num_bytes: 22245
num_examples: 12
- name: '1798'
num_bytes: 72288
num_examples: 73
- name: '1799'
num_bytes: 946373
num_examples: 623
- name: '1800'
num_bytes: 38139
num_examples: 45
- name: '1801'
num_bytes: 94991
num_examples: 93
- name: '1802'
num_bytes: 1463322
num_examples: 1158
- name: '1803'
num_bytes: 799797
num_examples: 654
- name: '1804'
num_bytes: 120141
num_examples: 103
- name: '1805'
num_bytes: 2475205
num_examples: 2303
- name: '1806'
num_bytes: 2043729
num_examples: 1860
- name: '1807'
num_bytes: 310568
num_examples: 315
- name: '1808'
num_bytes: 75639
num_examples: 45
- name: '1809'
num_bytes: 430706
num_examples: 422
- name: '1810'
num_bytes: 1319755
num_examples: 982
- name: '1811'
num_bytes: 117701
num_examples: 110
- name: '1812'
num_bytes: 75299
num_examples: 67
- name: '1813'
num_bytes: 290966
num_examples: 242
- name: '1814'
num_bytes: 378212
num_examples: 379
- name: '1815'
num_bytes: 185179
num_examples: 160
- name: '1816'
num_bytes: 495706
num_examples: 409
- name: '1817'
num_bytes: 446354
num_examples: 394
- name: '1818'
num_bytes: 1257916
num_examples: 1108
- name: '1819'
num_bytes: 2476297
num_examples: 1997
- name: '1820'
num_bytes: 611884
num_examples: 433
- name: '1821'
num_bytes: 347361
num_examples: 270
- name: '1822'
num_bytes: 286227
num_examples: 264
- name: '1823'
num_bytes: 2030816
num_examples: 1113
- name: '1824'
num_bytes: 5171191
num_examples: 3110
- name: '1825'
num_bytes: 6341915
num_examples: 4005
- name: '1826'
num_bytes: 10462258
num_examples: 7079
- name: '1827'
num_bytes: 11634621
num_examples: 7213
- name: '1828'
num_bytes: 10253681
num_examples: 6350
- name: '1829'
num_bytes: 4021832
num_examples: 2296
- name: '1830'
num_bytes: 8321949
num_examples: 4232
- name: '1831'
num_bytes: 16796125
num_examples: 9699
- name: '1832'
num_bytes: 9982722
num_examples: 6565
- name: '1833'
num_bytes: 6653515
num_examples: 4108
- name: '1834'
num_bytes: 7099875
num_examples: 4632
- name: '1835'
num_bytes: 9066392
num_examples: 6168
- name: '1836'
num_bytes: 10473366
num_examples: 7375
- name: '1837'
num_bytes: 21002773
num_examples: 13609
- name: '1838'
num_bytes: 13735809
num_examples: 8492
- name: '1839'
num_bytes: 12512339
num_examples: 8938
- name: '1840'
num_bytes: 12647911
num_examples: 8052
- name: '1841'
num_bytes: 39146669
num_examples: 30019
- name: '1842'
num_bytes: 26218700
num_examples: 21290
- name: '1843'
num_bytes: 50447372
num_examples: 41657
- name: '1844'
num_bytes: 79351064
num_examples: 61373
- name: '1845'
num_bytes: 131632573
num_examples: 95921
- name: '1846'
num_bytes: 81086068
num_examples: 70331
- name: '1847'
num_bytes: 32733527
num_examples: 24354
- name: '1848'
num_bytes: 44577825
num_examples: 32531
- name: '1849'
num_bytes: 53877014
num_examples: 42711
- name: '1850'
num_bytes: 76697622
num_examples: 49992
- name: '1851'
num_bytes: 128372084
num_examples: 90184
- name: '1852'
num_bytes: 67005975
num_examples: 51172
- name: '1853'
num_bytes: 54210932
num_examples: 48130
- name: '1854'
num_bytes: 150406197
num_examples: 118825
- name: '1855'
num_bytes: 115893679
num_examples: 99390
- name: '1856'
num_bytes: 188859881
num_examples: 157592
- name: '1857'
num_bytes: 152841585
num_examples: 129179
- name: '1858'
num_bytes: 214657030
num_examples: 171877
- name: '1859'
num_bytes: 178711188
num_examples: 160924
- name: '1860'
num_bytes: 163889573
num_examples: 150590
- name: '1861'
num_bytes: 215595661
num_examples: 173990
- name: '1862'
num_bytes: 228323685
num_examples: 171021
- name: '1863'
num_bytes: 197294365
num_examples: 151485
- name: '1864'
num_bytes: 125113713
num_examples: 94415
- name: '1865'
num_bytes: 133515217
num_examples: 99728
- name: '1866'
num_bytes: 180768118
num_examples: 135316
- name: '1867'
num_bytes: 213571876
num_examples: 161180
- name: '1868'
num_bytes: 202156635
num_examples: 140521
- name: '1869'
num_bytes: 236506656
num_examples: 171455
- name: '1870'
num_bytes: 242779857
num_examples: 174061
- name: '1871'
num_bytes: 203189927
num_examples: 151652
- name: '1872'
num_bytes: 242624062
num_examples: 194784
- name: '1873'
num_bytes: 302626176
num_examples: 241902
- name: '1874'
num_bytes: 280814742
num_examples: 213813
- name: '1875'
num_bytes: 319815222
num_examples: 274269
- name: '1876'
num_bytes: 381483980
num_examples: 288199
- name: '1877'
num_bytes: 317703263
num_examples: 254946
- name: '1878'
num_bytes: 381274032
num_examples: 307865
- name: '1879'
num_bytes: 371703798
num_examples: 287784
- name: '1880'
num_bytes: 296465631
num_examples: 272352
- name: '1881'
num_bytes: 294568051
num_examples: 270228
- name: '1882'
num_bytes: 340511400
num_examples: 311920
- name: '1883'
num_bytes: 419078041
num_examples: 387589
- name: '1884'
num_bytes: 329666364
num_examples: 304242
- name: '1885'
num_bytes: 348144660
num_examples: 318732
- name: '1886'
num_bytes: 431746663
num_examples: 423718
- name: '1887'
num_bytes: 493647568
num_examples: 494559
- name: '1888'
num_bytes: 564523528
num_examples: 547165
- name: '1889'
num_bytes: 558168324
num_examples: 536750
- name: '1890'
num_bytes: 566964770
num_examples: 540615
- name: '1891'
num_bytes: 641124243
num_examples: 620461
- name: '1892'
num_bytes: 524812242
num_examples: 527044
- name: '1893'
num_bytes: 645853680
num_examples: 656805
- name: '1894'
num_bytes: 790577208
num_examples: 795408
- name: '1895'
num_bytes: 890097151
num_examples: 897766
- name: '1896'
num_bytes: 1235234882
num_examples: 1175701
- name: '1897'
num_bytes: 1252347746
num_examples: 1275895
- name: '1898'
num_bytes: 1286411001
num_examples: 1323842
- name: '1899'
num_bytes: 1176418162
num_examples: 1218682
- name: '1900'
num_bytes: 1069983237
num_examples: 1118970
- name: '1901'
num_bytes: 1478945214
num_examples: 1468648
- name: '1902'
num_bytes: 1376703767
num_examples: 1417935
- name: '1903'
num_bytes: 1255538379
num_examples: 1319686
- name: '1904'
num_bytes: 1232185827
num_examples: 1340868
- name: '1905'
num_bytes: 1563178627
num_examples: 1635134
- name: '1906'
num_bytes: 1632815247
num_examples: 1683643
- name: '1907'
num_bytes: 1647491794
num_examples: 1714613
- name: '1908'
num_bytes: 1771267430
num_examples: 1842874
- name: '1909'
num_bytes: 1844179657
num_examples: 1926228
- name: '1910'
num_bytes: 1589052587
num_examples: 1684263
- name: '1911'
num_bytes: 1402309564
num_examples: 1510259
- name: '1912'
num_bytes: 1621648367
num_examples: 1774149
- name: '1913'
num_bytes: 1613599136
num_examples: 1822206
- name: '1914'
num_bytes: 1736284455
num_examples: 1931901
- name: '1915'
num_bytes: 1690248452
num_examples: 1878654
- name: '1916'
num_bytes: 1633563499
num_examples: 1838797
- name: '1917'
num_bytes: 1605677226
num_examples: 1810757
- name: '1918'
num_bytes: 1803695589
num_examples: 1920102
- name: '1919'
num_bytes: 1831703767
num_examples: 1981192
- name: '1920'
num_bytes: 1901882705
num_examples: 2041192
- name: '1921'
num_bytes: 2264618667
num_examples: 2334112
- name: '1922'
num_bytes: 2372137567
num_examples: 2405974
- name: '1923'
num_bytes: 812177597
num_examples: 880372
- name: '1924'
num_bytes: 800835690
num_examples: 845520
- name: '1925'
num_bytes: 601426023
num_examples: 662322
- name: '1926'
num_bytes: 565307890
num_examples: 623765
- name: '1927'
num_bytes: 460501197
num_examples: 504835
- name: '1928'
num_bytes: 452526140
num_examples: 487302
- name: '1929'
num_bytes: 366246066
num_examples: 421909
- name: '1930'
num_bytes: 437657836
num_examples: 492695
- name: '1931'
num_bytes: 441972257
num_examples: 493816
- name: '1932'
num_bytes: 640501746
num_examples: 664615
- name: '1933'
num_bytes: 634373318
num_examples: 642380
- name: '1934'
num_bytes: 641841040
num_examples: 654342
- name: '1935'
num_bytes: 612406176
num_examples: 635554
- name: '1936'
num_bytes: 621035178
num_examples: 662015
- name: '1937'
num_bytes: 625107933
num_examples: 676549
- name: '1938'
num_bytes: 616370880
num_examples: 665274
- name: '1939'
num_bytes: 525913265
num_examples: 556283
- name: '1940'
num_bytes: 471830118
num_examples: 496662
- name: '1941'
num_bytes: 599694786
num_examples: 637200
- name: '1942'
num_bytes: 508785410
num_examples: 523923
- name: '1943'
num_bytes: 452079475
num_examples: 467200
- name: '1944'
num_bytes: 442871777
num_examples: 433769
- name: '1945'
num_bytes: 588623743
num_examples: 588477
- name: '1946'
num_bytes: 526027876
num_examples: 470895
- name: '1947'
num_bytes: 461281363
num_examples: 393086
- name: '1948'
num_bytes: 442999943
num_examples: 396660
- name: '1949'
num_bytes: 421752000
num_examples: 419854
- name: '1950'
num_bytes: 403717616
num_examples: 415416
- name: '1951'
num_bytes: 409600217
num_examples: 419622
- name: '1952'
num_bytes: 397051717
num_examples: 396420
- name: '1953'
num_bytes: 366253682
num_examples: 358332
- name: '1954'
num_bytes: 263197428
num_examples: 266338
- name: '1955'
num_bytes: 268993926
num_examples: 273576
- name: '1956'
num_bytes: 85126796
num_examples: 98035
- name: '1957'
num_bytes: 83757036
num_examples: 93543
- name: '1958'
num_bytes: 85807593
num_examples: 98688
- name: '1959'
num_bytes: 112707174
num_examples: 129452
- name: '1960'
num_bytes: 300484826
num_examples: 344550
- name: '1961'
num_bytes: 297225753
num_examples: 339076
- name: '1962'
num_bytes: 231525869
num_examples: 264724
- name: '1963'
num_bytes: 197520960
num_examples: 226859
download_size: 48388744959
dataset_size: 76303058024
---
# Dataset Card for "AmericanStories-parquet"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
davanstrien
原始信息汇总
数据集概述
数据集配置
- 配置名称: default
- 数据文件路径:
- 分割: 1774, 路径: data/1774-*
- 分割: 1798, 路径: data/1798-*
- 分割: 1799, 路径: data/1799-*
- 分割: 1800, 路径: data/1800-*
- 分割: 1801, 路径: data/1801-*
- 分割: 1802, 路径: data/1802-*
- 分割: 1803, 路径: data/1803-*
- 分割: 1804, 路径: data/1804-*
- 分割: 1805, 路径: data/1805-*
- 分割: 1806, 路径: data/1806-*
- 分割: 1807, 路径: data/1807-*
- 分割: 1808, 路径: data/1808-*
- 分割: 1809, 路径: data/1809-*
- 分割: 1810, 路径: data/1810-*
- 分割: 1811, 路径: data/1811-*
- 分割: 1812, 路径: data/1812-*
- 分割: 1813, 路径: data/1813-*
- 分割: 1814, 路径: data/1814-*
- 分割: 1815, 路径: data/1815-*
- 分割: 1816, 路径: data/1816-*
- 分割: 1817, 路径: data/1817-*
- 分割: 1818, 路径: data/1818-*
- 分割: 1819, 路径: data/1819-*
- 分割: 1820, 路径: data/1820-*
- 分割: 1821, 路径: data/1821-*
- 分割: 1822, 路径: data/1822-*
- 分割: 1823, 路径: data/1823-*
- 分割: 1824, 路径: data/1824-*
- 分割: 1825, 路径: data/1825-*
- 分割: 1826, 路径: data/1826-*
- 分割: 1827, 路径: data/1827-*
- 分割: 1828, 路径: data/1828-*
- 分割: 1829, 路径: data/1829-*
- 分割: 1830, 路径: data/1830-*
- 分割: 1831, 路径: data/1831-*
- 分割: 1832, 路径: data/1832-*
- 分割: 1833, 路径: data/1833-*
- 分割: 1834, 路径: data/1834-*
- 分割: 1835, 路径: data/1835-*
- 分割: 1836, 路径: data/1836-*
- 分割: 1837, 路径: data/1837-*
- 分割: 1838, 路径: data/1838-*
- 分割: 1839, 路径: data/1839-*
- 分割: 1840, 路径: data/1840-*
- 分割: 1841, 路径: data/1841-*
- 分割: 1842, 路径: data/1842-*
- 分割: 1843, 路径: data/1843-*
- 分割: 1844, 路径: data/1844-*
- 分割: 1845, 路径: data/1845-*
- 分割: 1846, 路径: data/1846-*
- 分割: 1847, 路径: data/1847-*
- 分割: 1848, 路径: data/1848-*
- 分割: 1849, 路径: data/1849-*
- 分割: 1850, 路径: data/1850-*
- 分割: 1851, 路径: data/1851-*
- 分割: 1852, 路径: data/1852-*
- 分割: 1853, 路径: data/1853-*
- 分割: 1854, 路径: data/1854-*
- 分割: 1855, 路径: data/1855-*
- 分割: 1856, 路径: data/1856-*
- 分割: 1857, 路径: data/1857-*
- 分割: 1858, 路径: data/1858-*
- 分割: 1859, 路径: data/1859-*
- 分割: 1860, 路径: data/1860-*
- 分割: 1861, 路径: data/1861-*
- 分割: 1862, 路径: data/1862-*
- 分割: 1863, 路径: data/1863-*
- 分割: 1864, 路径: data/1864-*
- 分割: 1865, 路径: data/1865-*
- 分割: 1866, 路径: data/1866-*
- 分割: 1867, 路径: data/1867-*
- 分割: 1868, 路径: data/1868-*
- 分割: 1869, 路径: data/1869-*
- 分割: 1870, 路径: data/1870-*
- 分割: 1871, 路径: data/1871-*
- 分割: 1872, 路径: data/1872-*
- 分割: 1873, 路径: data/1873-*
- 分割: 1874, 路径: data/1874-*
- 分割: 1875, 路径: data/1875-*
- 分割: 1876, 路径: data/1876-*
- 分割: 1877, 路径: data/1877-*
- 分割: 1878, 路径: data/1878-*
- 分割: 1879, 路径: data/1879-*
- 分割: 1880, 路径: data/1880-*
- 分割: 1881, 路径: data/1881-*
- 分割: 1882, 路径: data/1882-*
- 分割: 1883, 路径: data/1883-*
- 分割: 1884, 路径: data/1884-*
- 分割: 1885, 路径: data/1885-*
- 分割: 1886, 路径: data/1886-*
- 分割: 1887, 路径: data/1887-*
- 分割: 1888, 路径: data/1888-*
- 分割: 1889, 路径: data/1889-*
- 分割: 1890, 路径: data/1890-*
- 分割: 1891, 路径: data/1891-*
- 分割: 1892, 路径: data/1892-*
- 分割: 1893, 路径: data/1893-*
- 分割: 1894, 路径: data/1894-*
- 分割: 1895, 路径: data/1895-*
- 分割: 1896, 路径: data/1896-*
- 分割: 1897, 路径: data/1897-*
- 分割: 1898, 路径: data/1898-*
- 分割: 1899, 路径: data/1899-*
- 分割: 1900, 路径: data/1900-*
- 分割: 1901, 路径: data/1901-*
- 分割: 1902, 路径: data/1902-*
- 分割: 1903, 路径: data/1903-*
- 分割: 1904, 路径: data/1904-*
- 分割: 1905, 路径: data/1905-*
- 分割: 1906, 路径: data/1906-*
- 分割: 1907, 路径: data/1907-*
- 分割: 1908, 路径: data/1908-*
- 分割: 1909, 路径: data/1909-*
- 分割: 1910, 路径: data/1910-*
- 分割: 1911, 路径: data/1911-*
- 分割: 1912, 路径: data/1912-*
- 分割: 1913, 路径: data/1913-*
- 分割: 1914, 路径: data/1914-*
- 分割: 1915, 路径: data/1915-*
- 分割: 1916, 路径: data/1916-*
- 分割: 1917, 路径: data/1917-*
- 分割: 1918, 路径: data/1918-*
- 分割: 1919, 路径: data/1919-*
- 分割: 1920, 路径: data/1920-*
- 分割: 1921, 路径: data/1921-*
- 分割: 1922, 路径: data/1922-*
- 分割: 1923, 路径: data/1923-*
- 分割: 1924, 路径: data/1924-*
- 分割: 1925, 路径: data/1925-*
- 分割: 1926, 路径: data/1926-*
- 分割: 1927, 路径: data/1927-*
- 分割: 1928, 路径: data/1928-*
- 分割: 1929, 路径: data/1929-*
- 分割: 1930, 路径: data/1930-*
- 分割: 1931, 路径: data/1931-*
- 分割: 1932, 路径: data/1932-*
- 分割: 1933, 路径: data/1933-*
- 分割: 1934, 路径: data/1934-*
- 分割: 1935, 路径: data/1935-*
- 分割: 1936, 路径: data/1936-*
- 分割: 1937, 路径: data/1937-*
- 分割: 1938, 路径: data/1938-*
- 分割: 1939, 路径: data/1939-*
- 分割: 1940, 路径: data/1940-*
- 分割: 1941, 路径: data/1941-*
- 分割: 1942, 路径: data/1942-*
- 分割: 1943, 路径: data/1943-*
- 分割: 1944, 路径: data/1944-*
- 分割: 1945, 路径: data/1945-*
- 分割: 1946, 路径: data/1946-*
- 分割: 1947, 路径: data/1947-*
- 分割: 1948, 路径: data/1948-*
- 分割: 1949, 路径: data/1949-*
- 分割: 1950, 路径: data/1950-*
- 分割: 1951, 路径: data/1951-*
- 分割: 1952, 路径: data/1952-*
- 分割: 1953, 路径: data/1953-*
- 分割: 1954, 路径: data/1954-*
- 分割: 1955, 路径: data/1955-*
- 分割: 1956, 路径: data/1956-*
- 分割: 1957, 路径: data/1957-*
- 分割: 1958, 路径: data/1958-*
- 分割: 1959, 路径: data/1959-*
- 分割: 1960, 路径: data/1960-*
- 分割: 1961, 路径: data/1961-*
- 分割: 1962, 路径: data/1962-*
- 分割: 1963, 路径: data/1963-*
数据集信息
-
特征:
- 名称: article_id, 数据类型: string
- 名称: newspaper_name, 数据类型: string
- 名称: edition, 数据类型: string
- 名称: date, 数据类型: string
- 名称: page, 数据类型: string
- 名称: headline, 数据类型: string
- 名称: byline, 数据类型: string
- 名称: article, 数据类型: string
-
分割信息:
- 名称: 1774, 字节数: 22245, 样本数: 12
- 名称: 1798, 字节数: 72288, 样本数: 73
- 名称: 1799, 字节数: 946373, 样本数: 623
- 名称: 1800, 字节数: 38139, 样本数: 45
- 名称: 1801, 字节数: 94991, 样本数: 93
- 名称: 1802, 字节数: 1463322, 样本数: 1158
- 名称: 1803, 字节数: 799797, 样本数: 654
- 名称: 1804, 字节数: 120141, 样本数: 103



