five

DoSp/DomainSpeech

收藏
Hugging Face2024-03-22 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/DoSp/DomainSpeech
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: Agriculture_Agricultural Biotechnology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 143439038.0 num_examples: 300 download_size: 143297680 dataset_size: 143439038.0 - config_name: Agriculture_Agricultural Economics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 138126833.0 num_examples: 300 download_size: 138014919 dataset_size: 138126833.0 - config_name: Agriculture_Agricultural Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 143180625.0 num_examples: 300 download_size: 143050446 dataset_size: 143180625.0 - config_name: Agriculture_Agricultural Mechanization features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 154916533.0 num_examples: 300 download_size: 154747365 dataset_size: 154916533.0 - config_name: Agriculture_Animal Science features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 146354369.0 num_examples: 300 download_size: 146220983 dataset_size: 146354369.0 - config_name: Agriculture_Crop Science features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 143046061.0 num_examples: 300 download_size: 142880656 dataset_size: 143046061.0 - config_name: Agriculture_Entomology and Pesticides features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 143552360.0 num_examples: 300 download_size: 143407167 dataset_size: 143552360.0 - config_name: Agriculture_Fisheries features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 138944065.0 num_examples: 300 download_size: 138788871 dataset_size: 138944065.0 - config_name: Agriculture_Forestry features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 140535848.0 num_examples: 300 download_size: 140392528 dataset_size: 140535848.0 - config_name: Agriculture_Horticulture features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 147926282.0 num_examples: 300 download_size: 147791744 dataset_size: 147926282.0 - config_name: Agriculture_Plant Science features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 123700367.0 num_examples: 300 download_size: 123597900 dataset_size: 123700367.0 - config_name: Agriculture_Poultry Production features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 147073759.0 num_examples: 300 download_size: 146906099 dataset_size: 147073759.0 - config_name: Agriculture_Soil Sciences and Plant Nutrition features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 127354046.0 num_examples: 300 download_size: 127256326 dataset_size: 127354046.0 - config_name: Agriculture_Soil and Water Engineering and Conservation features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 134537041.0 num_examples: 300 download_size: 134387592 dataset_size: 134537041.0 - config_name: Arts Design_Arts features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 119548638.0 num_examples: 300 download_size: 119440736 dataset_size: 119548638.0 - config_name: Arts Design_Design features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 135083325.0 num_examples: 300 download_size: 134936083 dataset_size: 135083325.0 - config_name: Arts Design_Interior Architecture features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 141126586.0 num_examples: 300 download_size: 140979090 dataset_size: 141126586.0 - config_name: Arts Design_Urban Planning features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 147980852.0 num_examples: 300 download_size: 147794755 dataset_size: 147980852.0 - config_name: Business_Business Administration features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 121104401.0 num_examples: 300 download_size: 120968900 dataset_size: 121104401.0 - config_name: Business_Communications and Media Studies features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 123893864.0 num_examples: 300 download_size: 123794867 dataset_size: 123893864.0 - config_name: Business_Decision Science and Operations Management features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 117426723.0 num_examples: 300 download_size: 117317155 dataset_size: 117426723.0 - config_name: Business_Entrepreneurship features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 129740439.0 num_examples: 300 download_size: 129590618 dataset_size: 129740439.0 - config_name: Business_Human Resource Management features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 134109342.0 num_examples: 300 download_size: 133946610 dataset_size: 134109342.0 - config_name: Business_Marketing features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 131082374.0 num_examples: 300 download_size: 130942488 dataset_size: 131082374.0 - config_name: Business_Public Administration features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 128436764.0 num_examples: 300 download_size: 128268709 dataset_size: 128436764.0 - config_name: Business_Strategic Management features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 129705598.0 num_examples: 300 download_size: 129565676 dataset_size: 129705598.0 - config_name: Economics_Accounting and Finance features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 130086798.0 num_examples: 300 download_size: 129970443 dataset_size: 130086798.0 - config_name: Economics_Banking and Insurance features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 125576327.0 num_examples: 300 download_size: 125457196 dataset_size: 125576327.0 - config_name: Economics_Environmental Economics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 144396467.0 num_examples: 300 download_size: 144269317 dataset_size: 144396467.0 - config_name: Economics_Financial Economics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 126345574.0 num_examples: 300 download_size: 126213407 dataset_size: 126345574.0 - config_name: Economics_International Trade features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 129266847.0 num_examples: 300 download_size: 129131077 dataset_size: 129266847.0 - config_name: Education_Early Childhood Education features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 134842546.0 num_examples: 300 download_size: 134669041 dataset_size: 134842546.0 - config_name: Education_Educational Administration features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 129139609.0 num_examples: 300 download_size: 129009495 dataset_size: 129139609.0 - config_name: Education_Educational Psychology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 132445380.0 num_examples: 300 download_size: 132314227 dataset_size: 132445380.0 - config_name: Education_Educational Technology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 136349543.0 num_examples: 300 download_size: 136233919 dataset_size: 136349543.0 - config_name: Education_Elemantary Teacher Education features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 128929721.0 num_examples: 300 download_size: 128832448 dataset_size: 128929721.0 - config_name: Education_Foreign Language Education features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 132729799.0 num_examples: 300 download_size: 132576098 dataset_size: 132729799.0 - config_name: Education_Guidance and Counseling features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 137961853.0 num_examples: 300 download_size: 137814518 dataset_size: 137961853.0 - config_name: Education_Mathematics and Science Education features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 134215509.0 num_examples: 300 download_size: 134099723 dataset_size: 134215509.0 - config_name: Education_Physical Education features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 132937777.0 num_examples: 300 download_size: 132805858 dataset_size: 132937777.0 - config_name: Education_Sociology of Education features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 124285485.0 num_examples: 300 download_size: 124176688 dataset_size: 124285485.0 - config_name: Education_Special Education features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 152289384.0 num_examples: 300 download_size: 152131422 dataset_size: 152289384.0 - config_name: Engineering_Aerospace Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 124292138.0 num_examples: 300 download_size: 124191922 dataset_size: 124292138.0 - config_name: Engineering_Automotive Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 143846463.0 num_examples: 300 download_size: 143708257 dataset_size: 143846463.0 - config_name: Engineering_Bioengineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 143137978.0 num_examples: 300 download_size: 143012457 dataset_size: 143137978.0 - config_name: Engineering_Biomaterials and Tissue Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 137146975.0 num_examples: 300 download_size: 137025731 dataset_size: 137146975.0 - config_name: Engineering_Biomedical Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 131378195.0 num_examples: 300 download_size: 131261573 dataset_size: 131378195.0 - config_name: Engineering_Chemical Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 143133003.0 num_examples: 300 download_size: 143008061 dataset_size: 143133003.0 - config_name: Engineering_Civil Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 130465075.0 num_examples: 300 download_size: 130356251 dataset_size: 130465075.0 - config_name: Engineering_Computer Science features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 132679470.0 num_examples: 300 download_size: 132529121 dataset_size: 132679470.0 - config_name: Engineering_Earth Sciences features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 119846962.0 num_examples: 300 download_size: 119730185 dataset_size: 119846962.0 - config_name: Engineering_Electrical and Electronic Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 126520050.0 num_examples: 300 download_size: 126360752 dataset_size: 126520050.0 - config_name: Engineering_Electrical and Information Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 123849397.0 num_examples: 300 download_size: 123716265 dataset_size: 123849397.0 - config_name: Engineering_Energy Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 137784439.0 num_examples: 300 download_size: 137683801 dataset_size: 137784439.0 - config_name: Engineering_Environmental Science and Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 137198399.0 num_examples: 300 download_size: 137059643 dataset_size: 137198399.0 - config_name: Engineering_Food Science and Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 133611502.0 num_examples: 300 download_size: 133484623 dataset_size: 133611502.0 - config_name: Engineering_Geomatics Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 129068429.0 num_examples: 300 download_size: 128978145 dataset_size: 129068429.0 - config_name: Engineering_Industrial and Manufacturing Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 122429186.0 num_examples: 300 download_size: 122322658 dataset_size: 122429186.0 - config_name: Engineering_Marine Sciences and Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 132973282.0 num_examples: 300 download_size: 132860408 dataset_size: 132973282.0 - config_name: Engineering_Mechanical Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 135364923.0 num_examples: 300 download_size: 135221594 dataset_size: 135364923.0 - config_name: Engineering_Mechatronics Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 126449973.0 num_examples: 300 download_size: 126341559 dataset_size: 126449973.0 - config_name: Engineering_Metallurgical and Materials Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 124292613.0 num_examples: 300 download_size: 124165732 dataset_size: 124292613.0 - config_name: Engineering_Meteorology and Atmospheric Sciences features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 120671090.0 num_examples: 300 download_size: 120549799 dataset_size: 120671090.0 - config_name: Engineering_Mining Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 133000100.0 num_examples: 300 download_size: 132898319 dataset_size: 133000100.0 - config_name: Engineering_Nanoscience and Nanotechnology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 126720028.0 num_examples: 300 download_size: 126601451 dataset_size: 126720028.0 - config_name: Engineering_Nuclear Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 138378246.0 num_examples: 300 download_size: 138263608 dataset_size: 138378246.0 - config_name: Engineering_Petroleum Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 131247557.0 num_examples: 300 download_size: 131121220 dataset_size: 131247557.0 - config_name: Engineering_Textile Engineering features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 138330600.0 num_examples: 300 download_size: 138157500 dataset_size: 138330600.0 - config_name: History_History features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 130253621.0 num_examples: 300 download_size: 130146337 dataset_size: 130253621.0 - config_name: Law_Business Corporate Law features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 132833176.0 num_examples: 300 download_size: 132657300 dataset_size: 132833176.0 - config_name: Law_Civil Law features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 120799613.0 num_examples: 300 download_size: 120705948 dataset_size: 120799613.0 - config_name: Law_Constitutional Law features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 124263458.0 num_examples: 300 download_size: 124147786 dataset_size: 124263458.0 - config_name: Law_Criminal Law features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 125936929.0 num_examples: 300 download_size: 125829464 dataset_size: 125936929.0 - config_name: Law_Employment Law features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 132215591.0 num_examples: 300 download_size: 132097839 dataset_size: 132215591.0 - config_name: Law_Environmental Law features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 141112457.0 num_examples: 300 download_size: 140980187 dataset_size: 141112457.0 - config_name: Law_European Union Law features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 134430087.0 num_examples: 300 download_size: 134291260 dataset_size: 134430087.0 - config_name: Law_International Law features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 132972818.0 num_examples: 300 download_size: 132822729 dataset_size: 132972818.0 - config_name: Law_Law and Legal Studies features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 124902845.0 num_examples: 300 download_size: 124767772 dataset_size: 124902845.0 - config_name: Law_Public Law features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 119886102.0 num_examples: 300 download_size: 119768166 dataset_size: 119886102.0 - config_name: Law_Tax Law features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 126528701.0 num_examples: 300 download_size: 126415023 dataset_size: 126528701.0 - config_name: Medical Sciences_Anatomy features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 124345096.0 num_examples: 300 download_size: 124253091 dataset_size: 124345096.0 - config_name: Medical Sciences_Anesthesiology and Reanimation features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 129149763.0 num_examples: 300 download_size: 129028143 dataset_size: 129149763.0 - config_name: Medical Sciences_Audiology and Speech Pathology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 134675137.0 num_examples: 300 download_size: 134564783 dataset_size: 134675137.0 - config_name: Medical Sciences_Bacteriology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 129314886.0 num_examples: 300 download_size: 129190011 dataset_size: 129314886.0 - config_name: Medical Sciences_Biochemistry features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 125011940.0 num_examples: 300 download_size: 124932996 dataset_size: 125011940.0 - config_name: Medical Sciences_Biophysics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 126020992.0 num_examples: 300 download_size: 125897336 dataset_size: 126020992.0 - config_name: Medical Sciences_Biostatistics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 118651656.0 num_examples: 300 download_size: 118574377 dataset_size: 118651656.0 - config_name: Medical Sciences_Cardiology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 135302197.0 num_examples: 300 download_size: 135193717 dataset_size: 135302197.0 - config_name: Medical Sciences_Cardiovascular Surgery features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 137987783.0 num_examples: 300 download_size: 137879610 dataset_size: 137987783.0 - config_name: Medical Sciences_Chest Diseases features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 131629091.0 num_examples: 300 download_size: 131486615 dataset_size: 131629091.0 - config_name: Medical Sciences_Child and Adolescent Psychiatry features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 152654204.0 num_examples: 300 download_size: 152523834 dataset_size: 152654204.0 - config_name: Medical Sciences_Clinical Pathology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 133021566.0 num_examples: 300 download_size: 132912535 dataset_size: 133021566.0 - config_name: Medical Sciences_Dentistry features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 135479829.0 num_examples: 300 download_size: 135352775 dataset_size: 135479829.0 - config_name: Medical Sciences_Dermatology and Venereology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 125724038.0 num_examples: 300 download_size: 125637034 dataset_size: 125724038.0 - config_name: Medical Sciences_Emergency Medicine features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 135705901.0 num_examples: 300 download_size: 135572579 dataset_size: 135705901.0 - config_name: Medical Sciences_Endocrinology and Metabolism features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 136547926.0 num_examples: 300 download_size: 136424174 dataset_size: 136547926.0 - config_name: Medical Sciences_Epidemiology and Public Health features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 122443760.0 num_examples: 300 download_size: 122331509 dataset_size: 122443760.0 - config_name: Medical Sciences_Family Medicine features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 147162214.0 num_examples: 300 download_size: 147018769 dataset_size: 147162214.0 - config_name: Medical Sciences_Forensic Medicine features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 135621041.0 num_examples: 300 download_size: 135465069 dataset_size: 135621041.0 - config_name: Medical Sciences_Gastroenterology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 137843323.0 num_examples: 300 download_size: 137726037 dataset_size: 137843323.0 - config_name: Medical Sciences_General Surgery features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 124773122.0 num_examples: 300 download_size: 124665167 dataset_size: 124773122.0 - config_name: Medical Sciences_Geriatrics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 149601165.0 num_examples: 300 download_size: 149441668 dataset_size: 149601165.0 - config_name: Medical Sciences_Health Administration features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 137277345.0 num_examples: 300 download_size: 137127990 dataset_size: 137277345.0 - config_name: Medical Sciences_Health Sciences features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 132340082.0 num_examples: 300 download_size: 132191040 dataset_size: 132340082.0 - config_name: Medical Sciences_Hematology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 137161132.0 num_examples: 300 download_size: 137001185 dataset_size: 137161132.0 - config_name: Medical Sciences_Histology and Embriology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 118029910.0 num_examples: 300 download_size: 117960878 dataset_size: 118029910.0 - config_name: Medical Sciences_Immunology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 147571192.0 num_examples: 300 download_size: 147439785 dataset_size: 147571192.0 - config_name: Medical Sciences_Infectious Diseases features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 130628555.0 num_examples: 300 download_size: 130515362 dataset_size: 130628555.0 - config_name: Medical Sciences_Internal Medicine features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 132341319.0 num_examples: 300 download_size: 132242597 dataset_size: 132341319.0 - config_name: Medical Sciences_Medical Biochemistry features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 141321514.0 num_examples: 300 download_size: 141192803 dataset_size: 141321514.0 - config_name: Medical Sciences_Medical Biology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 123713781.0 num_examples: 300 download_size: 123626323 dataset_size: 123713781.0 - config_name: Medical Sciences_Medical Education features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 130348018.0 num_examples: 300 download_size: 130247442 dataset_size: 130348018.0 - config_name: Medical Sciences_Medical Genetics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 132739285.0 num_examples: 300 download_size: 132620709 dataset_size: 132739285.0 - config_name: Medical Sciences_Medical Microbiology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 131818843.0 num_examples: 300 download_size: 131710880 dataset_size: 131818843.0 - config_name: Medical Sciences_Medical Oncology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 132891133.0 num_examples: 300 download_size: 132742137 dataset_size: 132891133.0 - config_name: Medical Sciences_Medical Parasitology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 127638224.0 num_examples: 300 download_size: 127533891 dataset_size: 127638224.0 - config_name: Medical Sciences_Medical Physics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 128012792.0 num_examples: 300 download_size: 127907099 dataset_size: 128012792.0 - config_name: Medical Sciences_Medical Physiology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 123009232.0 num_examples: 300 download_size: 122906320 dataset_size: 123009232.0 - config_name: Medical Sciences_Medical Virology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 129423629.0 num_examples: 300 download_size: 129321752 dataset_size: 129423629.0 - config_name: Medical Sciences_Microbiology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 133143959.0 num_examples: 300 download_size: 132988663 dataset_size: 133143959.0 - config_name: Medical Sciences_Molecular Biology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 127464967.0 num_examples: 300 download_size: 127337963 dataset_size: 127464967.0 - config_name: Medical Sciences_Mycology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 137823673.0 num_examples: 300 download_size: 137708636 dataset_size: 137823673.0 - config_name: Medical Sciences_Neonatology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 141049258.0 num_examples: 300 download_size: 140933138 dataset_size: 141049258.0 - config_name: Medical Sciences_Nephrology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 133628216.0 num_examples: 300 download_size: 133504498 dataset_size: 133628216.0 - config_name: Medical Sciences_Neurology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 136508584.0 num_examples: 300 download_size: 136386376 dataset_size: 136508584.0 - config_name: Medical Sciences_Neuroscience features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 126214227.0 num_examples: 300 download_size: 126138247 dataset_size: 126214227.0 - config_name: Medical Sciences_Neurosurgery features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 139598205.0 num_examples: 300 download_size: 139459556 dataset_size: 139598205.0 - config_name: Medical Sciences_Nuclear Medicine features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 141475957.0 num_examples: 300 download_size: 141349187 dataset_size: 141475957.0 - config_name: Medical Sciences_Nursing and Midwifery features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 125067849.0 num_examples: 300 download_size: 124961824 dataset_size: 125067849.0 - config_name: Medical Sciences_Nutrition and Dietetics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 137298930.0 num_examples: 300 download_size: 137177542 dataset_size: 137298930.0 - config_name: Medical Sciences_Obstetrics and Gynecology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 139462396.0 num_examples: 300 download_size: 139346196 dataset_size: 139462396.0 - config_name: Medical Sciences_Occupational Medicine features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 139789686.0 num_examples: 300 download_size: 139663646 dataset_size: 139789686.0 - config_name: Medical Sciences_Ophthalmology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 128256576.0 num_examples: 300 download_size: 128137213 dataset_size: 128256576.0 - config_name: Medical Sciences_Optometry features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 124158526.0 num_examples: 300 download_size: 124043338 dataset_size: 124158526.0 - config_name: Medical Sciences_Orthopedics and Traumatology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 124954258.0 num_examples: 300 download_size: 124839699 dataset_size: 124954258.0 - config_name: Medical Sciences_Otorhinolaryngology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 118568192.0 num_examples: 300 download_size: 118469263 dataset_size: 118568192.0 - config_name: Medical Sciences_Parasitology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 128606032.0 num_examples: 300 download_size: 128481740 dataset_size: 128606032.0 - config_name: Medical Sciences_Pathology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 136361718.0 num_examples: 300 download_size: 136219475 dataset_size: 136361718.0 - config_name: Medical Sciences_Pediatric Cardiology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 125106812.0 num_examples: 300 download_size: 125019625 dataset_size: 125106812.0 - config_name: Medical Sciences_Pediatric Endocrinology and Metabolism features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 133790952.0 num_examples: 300 download_size: 133675104 dataset_size: 133790952.0 - config_name: Medical Sciences_Pediatric Gastroenterology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 129939533.0 num_examples: 300 download_size: 129818254 dataset_size: 129939533.0 - config_name: Medical Sciences_Pediatric Hematology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 130557879.0 num_examples: 300 download_size: 130455018 dataset_size: 130557879.0 - config_name: Medical Sciences_Pediatric Immunology and Allergy features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 124548519.0 num_examples: 300 download_size: 124454909 dataset_size: 124548519.0 - config_name: Medical Sciences_Pediatric Infectious Diseases features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 129885463.0 num_examples: 300 download_size: 129772398 dataset_size: 129885463.0 - config_name: Medical Sciences_Pediatric Intensive Care features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 136008333.0 num_examples: 300 download_size: 135876113 dataset_size: 136008333.0 - config_name: Medical Sciences_Pediatric Nephrology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 133539276.0 num_examples: 300 download_size: 133420904 dataset_size: 133539276.0 - config_name: Medical Sciences_Pediatric Neurology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 130006445.0 num_examples: 300 download_size: 129883565 dataset_size: 130006445.0 - config_name: Medical Sciences_Pediatric Pulmonology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 131918311.0 num_examples: 300 download_size: 131790321 dataset_size: 131918311.0 - config_name: Medical Sciences_Pediatric Rheumatology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 141173770.0 num_examples: 300 download_size: 141048082 dataset_size: 141173770.0 - config_name: Medical Sciences_Pediatric Surgery features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 129573172.0 num_examples: 300 download_size: 129467025 dataset_size: 129573172.0 - config_name: Medical Sciences_Pediatrics and Child Health features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 142513323.0 num_examples: 300 download_size: 142398544 dataset_size: 142513323.0 - config_name: Medical Sciences_Perinatology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 143238723.0 num_examples: 300 download_size: 143075573 dataset_size: 143238723.0 - config_name: Medical Sciences_Pharmacology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 131266646.0 num_examples: 300 download_size: 131140692 dataset_size: 131266646.0 - config_name: Medical Sciences_Pharmacy & Pharmaceutical Sciences features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 123536721.0 num_examples: 300 download_size: 123432708 dataset_size: 123536721.0 - config_name: Medical Sciences_Physical Medicine features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 138883982.0 num_examples: 300 download_size: 138766735 dataset_size: 138883982.0 - config_name: Medical Sciences_Physiology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 129536853.0 num_examples: 300 download_size: 129405940 dataset_size: 129536853.0 - config_name: Medical Sciences_Physiotherapy features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 142691474.0 num_examples: 300 download_size: 142563292 dataset_size: 142691474.0 - config_name: Medical Sciences_Plastic Surgery features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 131666178.0 num_examples: 300 download_size: 131555009 dataset_size: 131666178.0 - config_name: Medical Sciences_Podiatry features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 130451437.0 num_examples: 300 download_size: 130325455 dataset_size: 130451437.0 - config_name: Medical Sciences_Psychiatry features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 137513120.0 num_examples: 300 download_size: 137383527 dataset_size: 137513120.0 - config_name: Medical Sciences_Radiation Oncology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 146934885.0 num_examples: 300 download_size: 146815433 dataset_size: 146934885.0 - config_name: Medical Sciences_Radiology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 148168300.0 num_examples: 300 download_size: 148016600 dataset_size: 148168300.0 - config_name: Medical Sciences_Rheumatology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 134954977.0 num_examples: 300 download_size: 134841511 dataset_size: 134954977.0 - config_name: Medical Sciences_Sport Science features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 127576853.0 num_examples: 300 download_size: 127455316 dataset_size: 127576853.0 - config_name: Medical Sciences_Sports Medicine features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 135083531.0 num_examples: 300 download_size: 134931348 dataset_size: 135083531.0 - config_name: Medical Sciences_Thoracic Surgery features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 135906719.0 num_examples: 300 download_size: 135778944 dataset_size: 135906719.0 - config_name: Medical Sciences_Urology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 135596805.0 num_examples: 300 download_size: 135473770 dataset_size: 135596805.0 - config_name: Medical Sciences_Veterinary Sciences features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 135858075.0 num_examples: 300 download_size: 135730165 dataset_size: 135858075.0 - config_name: Medical Sciences_Virology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 127937723.0 num_examples: 300 download_size: 127838000 dataset_size: 127937723.0 - config_name: Natural Sciences_Applied physics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 126350419.0 num_examples: 300 download_size: 126248052 dataset_size: 126350419.0 - config_name: Natural Sciences_Astrophysics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 129300703.0 num_examples: 300 download_size: 129158168 dataset_size: 129300703.0 - config_name: Natural Sciences_Atomic, Molecular and Optical physics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 111687416.0 num_examples: 300 download_size: 111582196 dataset_size: 111687416.0 - config_name: Natural Sciences_Biological Science features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 126050128.0 num_examples: 300 download_size: 125945290 dataset_size: 126050128.0 - config_name: Natural Sciences_Chemical Sciences features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 125925185.0 num_examples: 300 download_size: 125809833 dataset_size: 125925185.0 - config_name: Natural Sciences_Condensed matter physics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 119880781.0 num_examples: 300 download_size: 119762462 dataset_size: 119880781.0 - config_name: Natural Sciences_Geography features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 127678573.0 num_examples: 300 download_size: 127551992 dataset_size: 127678573.0 - config_name: Natural Sciences_Mathematical Sciences features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 118078153.0 num_examples: 300 download_size: 117964811 dataset_size: 118078153.0 - config_name: Natural Sciences_Molecular Biology and Genetics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 112294561.0 num_examples: 300 download_size: 112198712 dataset_size: 112294561.0 - config_name: Natural Sciences_Nuclear and Particle Physics features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 121217938.0 num_examples: 300 download_size: 121108176 dataset_size: 121217938.0 - config_name: Philosophy_Philosophy features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 118345587.0 num_examples: 300 download_size: 118229918 dataset_size: 118345587.0 - config_name: Social Sciences_Anthropology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 128840376.0 num_examples: 300 download_size: 128696216 dataset_size: 128840376.0 - config_name: Social Sciences_Archeology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 118321559.0 num_examples: 300 download_size: 118206487 dataset_size: 118321559.0 - config_name: Social Sciences_Child Development features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 126576147.0 num_examples: 300 download_size: 126464165 dataset_size: 126576147.0 - config_name: Social Sciences_Demography features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 132052357.0 num_examples: 300 download_size: 131901043 dataset_size: 132052357.0 - config_name: Social Sciences_Higher Education Studies features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 141786814.0 num_examples: 300 download_size: 141661233 dataset_size: 141786814.0 - config_name: Social Sciences_Housing features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 146169123.0 num_examples: 300 download_size: 146033728 dataset_size: 146169123.0 - config_name: Social Sciences_International Relations features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 133839740.0 num_examples: 300 download_size: 133676984 dataset_size: 133839740.0 - config_name: Social Sciences_Library and Information Science features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 123726092.0 num_examples: 300 download_size: 123594991 dataset_size: 123726092.0 - config_name: Social Sciences_Linguistics and Literature features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 114704654.0 num_examples: 300 download_size: 114595695 dataset_size: 114704654.0 - config_name: Social Sciences_Open and Distance Education features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 143105156.0 num_examples: 300 download_size: 142956652 dataset_size: 143105156.0 - config_name: Social Sciences_Political Science features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 121094624.0 num_examples: 300 download_size: 120963345 dataset_size: 121094624.0 - config_name: Social Sciences_Psychology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 136275316.0 num_examples: 300 download_size: 136139111 dataset_size: 136275316.0 - config_name: Social Sciences_Regional Studies features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 124353132.0 num_examples: 300 download_size: 124243486 dataset_size: 124353132.0 - config_name: Social Sciences_Social Policy features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 134904666.0 num_examples: 300 download_size: 134753980 dataset_size: 134904666.0 - config_name: Social Sciences_Social Work features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 134077053.0 num_examples: 300 download_size: 133967130 dataset_size: 134077053.0 - config_name: Social Sciences_Sociology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 133329557.0 num_examples: 300 download_size: 133180184 dataset_size: 133329557.0 - config_name: Social Sciences_Tourism and Hospitality features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 142262231.0 num_examples: 300 download_size: 142100591 dataset_size: 142262231.0 - config_name: Social Sciences_Transportation Science and Technology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 140265612.0 num_examples: 300 download_size: 140124964 dataset_size: 140265612.0 - config_name: Theology_Theology features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 115449408.0 num_examples: 300 download_size: 115356333 dataset_size: 115449408.0 - config_name: testing features: - name: audio dtype: audio: sampling_rate: 16000 - name: sentence dtype: string splits: - name: test num_bytes: 115449370.0 num_examples: 300 download_size: 115356390 dataset_size: 115449370.0 configs: - config_name: Agriculture_Agricultural Biotechnology data_files: - split: test path: content/Agriculture/Agricultural Biotechnology/test-* - config_name: Agriculture_Agricultural Economics data_files: - split: test path: content/Agriculture/Agricultural Economics/test-* - config_name: Agriculture_Agricultural Engineering data_files: - split: test path: content/Agriculture/Agricultural Engineering/test-* - config_name: Agriculture_Agricultural Mechanization data_files: - split: test path: content/Agriculture/Agricultural Mechanization/test-* - config_name: Agriculture_Animal Science data_files: - split: test path: content/Agriculture/Animal Science/test-* - config_name: Agriculture_Crop Science data_files: - split: test path: content/Agriculture/Crop Science/test-* - config_name: Agriculture_Entomology and Pesticides data_files: - split: test path: content/Agriculture/Entomology and Pesticides/test-* - config_name: Agriculture_Fisheries data_files: - split: test path: content/Agriculture/Fisheries/test-* - config_name: Agriculture_Forestry data_files: - split: test path: content/Agriculture/Forestry/test-* - config_name: Agriculture_Horticulture data_files: - split: test path: content/Agriculture/Horticulture/test-* - config_name: Agriculture_Plant Science data_files: - split: test path: content/Agriculture/Plant Science/test-* - config_name: Agriculture_Poultry Production data_files: - split: test path: content/Agriculture/Poultry Production/test-* - config_name: Agriculture_Soil Sciences and Plant Nutrition data_files: - split: test path: content/Agriculture/Soil Sciences and Plant Nutrition/test-* - config_name: Agriculture_Soil and Water Engineering and Conservation data_files: - split: test path: content/Agriculture/Soil and Water Engineering and Conservation/test-* - config_name: Arts Design_Arts data_files: - split: test path: content/Arts Design/Arts/test-* - config_name: Arts Design_Design data_files: - split: test path: content/Arts Design/Design/test-* - config_name: Arts Design_Interior Architecture data_files: - split: test path: content/Arts Design/Interior Architecture/test-* - config_name: Arts Design_Urban Planning data_files: - split: test path: content/Arts Design/Urban Planning/test-* - config_name: Business_Business Administration data_files: - split: test path: content/Business/Business Administration/test-* - config_name: Business_Communications and Media Studies data_files: - split: test path: content/Business/Communications and Media Studies/test-* - config_name: Business_Decision Science and Operations Management data_files: - split: test path: content/Business/Decision Science and Operations Management/test-* - config_name: Business_Entrepreneurship data_files: - split: test path: content/Business/Entrepreneurship/test-* - config_name: Business_Human Resource Management data_files: - split: test path: content/Business/Human Resource Management/test-* - config_name: Business_Marketing data_files: - split: test path: content/Business/Marketing/test-* - config_name: Business_Public Administration data_files: - split: test path: content/Business/Public Administration/test-* - config_name: Business_Strategic Management data_files: - split: test path: content/Business/Strategic Management/test-* - config_name: Economics_Accounting and Finance data_files: - split: test path: content/Economics/Accounting and Finance/test-* - config_name: Economics_Banking and Insurance data_files: - split: test path: content/Economics/Banking and Insurance/test-* - config_name: Economics_Environmental Economics data_files: - split: test path: content/Economics/Environmental Economics/test-* - config_name: Economics_Financial Economics data_files: - split: test path: content/Economics/Financial Economics/test-* - config_name: Economics_International Trade data_files: - split: test path: content/Economics/International Trade/test-* - config_name: Education_Early Childhood Education data_files: - split: test path: content/Education/Early Childhood Education/test-* - config_name: Education_Educational Administration data_files: - split: test path: content/Education/Educational Administration/test-* - config_name: Education_Educational Psychology data_files: - split: test path: content/Education/Educational Psychology/test-* - config_name: Education_Educational Technology data_files: - split: test path: content/Education/Educational Technology/test-* - config_name: Education_Elemantary Teacher Education data_files: - split: test path: content/Education/Elemantary Teacher Education/test-* - config_name: Education_Foreign Language Education data_files: - split: test path: content/Education/Foreign Language Education/test-* - config_name: Education_Guidance and Counseling data_files: - split: test path: content/Education/Guidance and Counseling/test-* - config_name: Education_Mathematics and Science Education data_files: - split: test path: content/Education/Mathematics and Science Education/test-* - config_name: Education_Physical Education data_files: - split: test path: content/Education/Physical Education/test-* - config_name: Education_Sociology of Education data_files: - split: test path: content/Education/Sociology of Education/test-* - config_name: Education_Special Education data_files: - split: test path: content/Education/Special Education/test-* - config_name: Engineering_Aerospace Engineering data_files: - split: test path: content/Engineering/Aerospace Engineering/test-* - config_name: Engineering_Automotive Engineering data_files: - split: test path: content/Engineering/Automotive Engineering/test-* - config_name: Engineering_Bioengineering data_files: - split: test path: content/Engineering/Bioengineering/test-* - config_name: Engineering_Biomaterials and Tissue Engineering data_files: - split: test path: content/Engineering/Biomaterials and Tissue Engineering/test-* - config_name: Engineering_Biomedical Engineering data_files: - split: test path: content/Engineering/Biomedical Engineering/test-* - config_name: Engineering_Chemical Engineering data_files: - split: test path: content/Engineering/Chemical Engineering/test-* - config_name: Engineering_Civil Engineering data_files: - split: test path: content/Engineering/Civil Engineering/test-* - config_name: Engineering_Computer Science data_files: - split: test path: content/Engineering/Computer Science/test-* - config_name: Engineering_Earth Sciences data_files: - split: test path: content/Engineering/Earth Sciences/test-* - config_name: Engineering_Electrical and Electronic Engineering data_files: - split: test path: content/Engineering/Electrical and Electronic Engineering/test-* - config_name: Engineering_Electrical and Information Engineering data_files: - split: test path: content/Engineering/Electrical and Information Engineering/test-* - config_name: Engineering_Energy Engineering data_files: - split: test path: content/Engineering/Energy Engineering/test-* - config_name: Engineering_Environmental Science and Engineering data_files: - split: test path: content/Engineering/Environmental Science and Engineering/test-* - config_name: Engineering_Food Science and Engineering data_files: - split: test path: content/Engineering/Food Science and Engineering/test-* - config_name: Engineering_Geomatics Engineering data_files: - split: test path: content/Engineering/Geomatics Engineering/test-* - config_name: Engineering_Industrial and Manufacturing Engineering data_files: - split: test path: content/Engineering/Industrial and Manufacturing Engineering/test-* - config_name: Engineering_Marine Sciences and Engineering data_files: - split: test path: content/Engineering/Marine Sciences and Engineering/test-* - config_name: Engineering_Mechanical Engineering data_files: - split: test path: content/Engineering/Mechanical Engineering/test-* - config_name: Engineering_Mechatronics Engineering data_files: - split: test path: content/Engineering/Mechatronics Engineering/test-* - config_name: Engineering_Metallurgical and Materials Engineering data_files: - split: test path: content/Engineering/Metallurgical and Materials Engineering/test-* - config_name: Engineering_Meteorology and Atmospheric Sciences data_files: - split: test path: content/Engineering/Meteorology and Atmospheric Sciences/test-* - config_name: Engineering_Mining Engineering data_files: - split: test path: content/Engineering/Mining Engineering/test-* - config_name: Engineering_Nanoscience and Nanotechnology data_files: - split: test path: content/Engineering/Nanoscience and Nanotechnology/test-* - config_name: Engineering_Nuclear Engineering data_files: - split: test path: content/Engineering/Nuclear Engineering/test-* - config_name: Engineering_Petroleum Engineering data_files: - split: test path: content/Engineering/Petroleum Engineering/test-* - config_name: Engineering_Textile Engineering data_files: - split: test path: content/Engineering/Textile Engineering/test-* - config_name: History_History data_files: - split: test path: content/History/History/test-* - config_name: Law_Business Corporate Law data_files: - split: test path: content/Law/Business Corporate Law/test-* - config_name: Law_Civil Law data_files: - split: test path: content/Law/Civil Law/test-* - config_name: Law_Constitutional Law data_files: - split: test path: content/Law/Constitutional Law/test-* - config_name: Law_Criminal Law data_files: - split: test path: content/Law/Criminal Law/test-* - config_name: Law_Employment Law data_files: - split: test path: content/Law/Employment Law/test-* - config_name: Law_Environmental Law data_files: - split: test path: content/Law/Environmental Law/test-* - config_name: Law_European Union Law data_files: - split: test path: content/Law/European Union Law/test-* - config_name: Law_International Law data_files: - split: test path: content/Law/International Law/test-* - config_name: Law_Law and Legal Studies data_files: - split: test path: content/Law/Law and Legal Studies/test-* - config_name: Law_Public Law data_files: - split: test path: content/Law/Public Law/test-* - config_name: Law_Tax Law data_files: - split: test path: content/Law/Tax Law/test-* - config_name: Medical Sciences_Anatomy data_files: - split: test path: content/Medical Sciences/Anatomy/test-* - config_name: Medical Sciences_Anesthesiology and Reanimation data_files: - split: test path: content/Medical Sciences/Anesthesiology and Reanimation/test-* - config_name: Medical Sciences_Audiology and Speech Pathology data_files: - split: test path: content/Medical Sciences/Audiology and Speech Pathology/test-* - config_name: Medical Sciences_Bacteriology data_files: - split: test path: content/Medical Sciences/Bacteriology/test-* - config_name: Medical Sciences_Biochemistry data_files: - split: test path: content/Medical Sciences/Biochemistry/test-* - config_name: Medical Sciences_Biophysics data_files: - split: test path: content/Medical Sciences/Biophysics/test-* - config_name: Medical Sciences_Biostatistics data_files: - split: test path: content/Medical Sciences/Biostatistics/test-* - config_name: Medical Sciences_Cardiology data_files: - split: test path: content/Medical Sciences/Cardiology/test-* - config_name: Medical Sciences_Cardiovascular Surgery data_files: - split: test path: content/Medical Sciences/Cardiovascular Surgery/test-* - config_name: Medical Sciences_Chest Diseases data_files: - split: test path: content/Medical Sciences/Chest Diseases/test-* - config_name: Medical Sciences_Child and Adolescent Psychiatry data_files: - split: test path: content/Medical Sciences/Child and Adolescent Psychiatry/test-* - config_name: Medical Sciences_Clinical Pathology data_files: - split: test path: content/Medical Sciences/Clinical Pathology/test-* - config_name: Medical Sciences_Dentistry data_files: - split: test path: content/Medical Sciences/Dentistry/test-* - config_name: Medical Sciences_Dermatology and Venereology data_files: - split: test path: content/Medical Sciences/Dermatology and Venereology/test-* - config_name: Medical Sciences_Emergency Medicine data_files: - split: test path: content/Medical Sciences/Emergency Medicine/test-* - config_name: Medical Sciences_Endocrinology and Metabolism data_files: - split: test path: content/Medical Sciences/Endocrinology and Metabolism/test-* - config_name: Medical Sciences_Epidemiology and Public Health data_files: - split: test path: content/Medical Sciences/Epidemiology and Public Health/test-* - config_name: Medical Sciences_Family Medicine data_files: - split: test path: content/Medical Sciences/Family Medicine/test-* - config_name: Medical Sciences_Forensic Medicine data_files: - split: test path: content/Medical Sciences/Forensic Medicine/test-* - config_name: Medical Sciences_Gastroenterology data_files: - split: test path: content/Medical Sciences/Gastroenterology/test-* - config_name: Medical Sciences_General Surgery data_files: - split: test path: content/Medical Sciences/General Surgery/test-* - config_name: Medical Sciences_Geriatrics data_files: - split: test path: content/Medical Sciences/Geriatrics/test-* - config_name: Medical Sciences_Health Administration data_files: - split: test path: content/Medical Sciences/Health Administration/test-* - config_name: Medical Sciences_Health Sciences data_files: - split: test path: content/Medical Sciences/Health Sciences/test-* - config_name: Medical Sciences_Hematology data_files: - split: test path: content/Medical Sciences/Hematology/test-* - config_name: Medical Sciences_Histology and Embriology data_files: - split: test path: content/Medical Sciences/Histology and Embriology/test-* - config_name: Medical Sciences_Immunology data_files: - split: test path: content/Medical Sciences/Immunology/test-* - config_name: Medical Sciences_Infectious Diseases data_files: - split: test path: content/Medical Sciences/Infectious Diseases/test-* - config_name: Medical Sciences_Internal Medicine data_files: - split: test path: content/Medical Sciences/Internal Medicine/test-* - config_name: Medical Sciences_Medical Biochemistry data_files: - split: test path: content/Medical Sciences/Medical Biochemistry/test-* - config_name: Medical Sciences_Medical Biology data_files: - split: test path: content/Medical Sciences/Medical Biology/test-* - config_name: Medical Sciences_Medical Education data_files: - split: test path: content/Medical Sciences/Medical Education/test-* - config_name: Medical Sciences_Medical Genetics data_files: - split: test path: content/Medical Sciences/Medical Genetics/test-* - config_name: Medical Sciences_Medical Microbiology data_files: - split: test path: content/Medical Sciences/Medical Microbiology/test-* - config_name: Medical Sciences_Medical Oncology data_files: - split: test path: content/Medical Sciences/Medical Oncology/test-* - config_name: Medical Sciences_Medical Parasitology data_files: - split: test path: content/Medical Sciences/Medical Parasitology/test-* - config_name: Medical Sciences_Medical Physics data_files: - split: test path: content/Medical Sciences/Medical Physics/test-* - config_name: Medical Sciences_Medical Physiology data_files: - split: test path: content/Medical Sciences/Medical Physiology/test-* - config_name: Medical Sciences_Medical Virology data_files: - split: test path: content/Medical Sciences/Medical Virology/test-* - config_name: Medical Sciences_Microbiology data_files: - split: test path: content/Medical Sciences/Microbiology/test-* - config_name: Medical Sciences_Molecular Biology data_files: - split: test path: content/Medical Sciences/Molecular Biology/test-* - config_name: Medical Sciences_Mycology data_files: - split: test path: content/Medical Sciences/Mycology/test-* - config_name: Medical Sciences_Neonatology data_files: - split: test path: content/Medical Sciences/Neonatology/test-* - config_name: Medical Sciences_Nephrology data_files: - split: test path: content/Medical Sciences/Nephrology/test-* - config_name: Medical Sciences_Neurology data_files: - split: test path: content/Medical Sciences/Neurology/test-* - config_name: Medical Sciences_Neuroscience data_files: - split: test path: content/Medical Sciences/Neuroscience/test-* - config_name: Medical Sciences_Neurosurgery data_files: - split: test path: content/Medical Sciences/Neurosurgery/test-* - config_name: Medical Sciences_Nuclear Medicine data_files: - split: test path: content/Medical Sciences/Nuclear Medicine/test-* - config_name: Medical Sciences_Nursing and Midwifery data_files: - split: test path: content/Medical Sciences/Nursing and Midwifery/test-* - config_name: Medical Sciences_Nutrition and Dietetics data_files: - split: test path: content/Medical Sciences/Nutrition and Dietetics/test-* - config_name: Medical Sciences_Obstetrics and Gynecology data_files: - split: test path: content/Medical Sciences/Obstetrics and Gynecology/test-* - config_name: Medical Sciences_Occupational Medicine data_files: - split: test path: content/Medical Sciences/Occupational Medicine/test-* - config_name: Medical Sciences_Ophthalmology data_files: - split: test path: content/Medical Sciences/Ophthalmology/test-* - config_name: Medical Sciences_Optometry data_files: - split: test path: content/Medical Sciences/Optometry/test-* - config_name: Medical Sciences_Orthopedics and Traumatology data_files: - split: test path: content/Medical Sciences/Orthopedics and Traumatology/test-* - config_name: Medical Sciences_Otorhinolaryngology data_files: - split: test path: content/Medical Sciences/Otorhinolaryngology/test-* - config_name: Medical Sciences_Parasitology data_files: - split: test path: content/Medical Sciences/Parasitology/test-* - config_name: Medical Sciences_Pathology data_files: - split: test path: content/Medical Sciences/Pathology/test-* - config_name: Medical Sciences_Pediatric Cardiology data_files: - split: test path: content/Medical Sciences/Pediatric Cardiology/test-* - config_name: Medical Sciences_Pediatric Endocrinology and Metabolism data_files: - split: test path: content/Medical Sciences/Pediatric Endocrinology and Metabolism/test-* - config_name: Medical Sciences_Pediatric Gastroenterology data_files: - split: test path: content/Medical Sciences/Pediatric Gastroenterology/test-* - config_name: Medical Sciences_Pediatric Hematology data_files: - split: test path: content/Medical Sciences/Pediatric Hematology/test-* - config_name: Medical Sciences_Pediatric Immunology and Allergy data_files: - split: test path: content/Medical Sciences/Pediatric Immunology and Allergy/test-* - config_name: Medical Sciences_Pediatric Infectious Diseases data_files: - split: test path: content/Medical Sciences/Pediatric Infectious Diseases/test-* - config_name: Medical Sciences_Pediatric Intensive Care data_files: - split: test path: content/Medical Sciences/Pediatric Intensive Care/test-* - config_name: Medical Sciences_Pediatric Nephrology data_files: - split: test path: content/Medical Sciences/Pediatric Nephrology/test-* - config_name: Medical Sciences_Pediatric Neurology data_files: - split: test path: content/Medical Sciences/Pediatric Neurology/test-* - config_name: Medical Sciences_Pediatric Pulmonology data_files: - split: test path: content/Medical Sciences/Pediatric Pulmonology/test-* - config_name: Medical Sciences_Pediatric Rheumatology data_files: - split: test path: content/Medical Sciences/Pediatric Rheumatology/test-* - config_name: Medical Sciences_Pediatric Surgery data_files: - split: test path: content/Medical Sciences/Pediatric Surgery/test-* - config_name: Medical Sciences_Pediatrics and Child Health data_files: - split: test path: content/Medical Sciences/Pediatrics and Child Health/test-* - config_name: Medical Sciences_Perinatology data_files: - split: test path: content/Medical Sciences/Perinatology/test-* - config_name: Medical Sciences_Pharmacology data_files: - split: test path: content/Medical Sciences/Pharmacology/test-* - config_name: Medical Sciences_Pharmacy & Pharmaceutical Sciences data_files: - split: test path: content/Medical Sciences/Pharmacy & Pharmaceutical Sciences/test-* - config_name: Medical Sciences_Physical Medicine data_files: - split: test path: content/Medical Sciences/Physical Medicine/test-* - config_name: Medical Sciences_Physiology data_files: - split: test path: content/Medical Sciences/Physiology/test-* - config_name: Medical Sciences_Physiotherapy data_files: - split: test path: content/Medical Sciences/Physiotherapy/test-* - config_name: Medical Sciences_Plastic Surgery data_files: - split: test path: content/Medical Sciences/Plastic Surgery/test-* - config_name: Medical Sciences_Podiatry data_files: - split: test path: content/Medical Sciences/Podiatry/test-* - config_name: Medical Sciences_Psychiatry data_files: - split: test path: content/Medical Sciences/Psychiatry/test-* - config_name: Medical Sciences_Radiation Oncology data_files: - split: test path: content/Medical Sciences/Radiation Oncology/test-* - config_name: Medical Sciences_Radiology data_files: - split: test path: content/Medical Sciences/Radiology/test-* - config_name: Medical Sciences_Rheumatology data_files: - split: test path: content/Medical Sciences/Rheumatology/test-* - config_name: Medical Sciences_Sport Science data_files: - split: test path: content/Medical Sciences/Sport Science/test-* - config_name: Medical Sciences_Sports Medicine data_files: - split: test path: content/Medical Sciences/Sports Medicine/test-* - config_name: Medical Sciences_Thoracic Surgery data_files: - split: test path: content/Medical Sciences/Thoracic Surgery/test-* - config_name: Medical Sciences_Urology data_files: - split: test path: content/Medical Sciences/Urology/test-* - config_name: Medical Sciences_Veterinary Sciences data_files: - split: test path: content/Medical Sciences/Veterinary Sciences/test-* - config_name: Medical Sciences_Virology data_files: - split: test path: content/Medical Sciences/Virology/test-* - config_name: Natural Sciences_Applied physics data_files: - split: test path: content/Natural Sciences/Applied physics/test-* - config_name: Natural Sciences_Astrophysics data_files: - split: test path: content/Natural Sciences/Astrophysics/test-* - config_name: Natural Sciences_Atomic, Molecular and Optical physics data_files: - split: test path: content/Natural Sciences/Atomic, Molecular and Optical physics/test-* - config_name: Natural Sciences_Biological Science data_files: - split: test path: content/Natural Sciences/Biological Science/test-* - config_name: Natural Sciences_Chemical Sciences data_files: - split: test path: content/Natural Sciences/Chemical Sciences/test-* - config_name: Natural Sciences_Condensed matter physics data_files: - split: test path: content/Natural Sciences/Condensed matter physics/test-* - config_name: Natural Sciences_Geography data_files: - split: test path: content/Natural Sciences/Geography/test-* - config_name: Natural Sciences_Mathematical Sciences data_files: - split: test path: content/Natural Sciences/Mathematical Sciences/test-* - config_name: Natural Sciences_Molecular Biology and Genetics data_files: - split: test path: content/Natural Sciences/Molecular Biology and Genetics/test-* - config_name: Natural Sciences_Nuclear and Particle Physics data_files: - split: test path: content/Natural Sciences/Nuclear and Particle Physics/test-* - config_name: Philosophy_Philosophy data_files: - split: test path: content/Philosophy/Philosophy/test-* - config_name: Social Sciences_Anthropology data_files: - split: test path: content/Social Sciences/Anthropology/test-* - config_name: Social Sciences_Archeology data_files: - split: test path: content/Social Sciences/Archeology/test-* - config_name: Social Sciences_Child Development data_files: - split: test path: content/Social Sciences/Child Development/test-* - config_name: Social Sciences_Demography data_files: - split: test path: content/Social Sciences/Demography/test-* - config_name: Social Sciences_Higher Education Studies data_files: - split: test path: content/Social Sciences/Higher Education Studies/test-* - config_name: Social Sciences_Housing data_files: - split: test path: content/Social Sciences/Housing/test-* - config_name: Social Sciences_International Relations data_files: - split: test path: content/Social Sciences/International Relations/test-* - config_name: Social Sciences_Library and Information Science data_files: - split: test path: content/Social Sciences/Library and Information Science/test-* - config_name: Social Sciences_Linguistics and Literature data_files: - split: test path: content/Social Sciences/Linguistics and Literature/test-* - config_name: Social Sciences_Open and Distance Education data_files: - split: test path: content/Social Sciences/Open and Distance Education/test-* - config_name: Social Sciences_Political Science data_files: - split: test path: content/Social Sciences/Political Science/test-* - config_name: Social Sciences_Psychology data_files: - split: test path: content/Social Sciences/Psychology/test-* - config_name: Social Sciences_Regional Studies data_files: - split: test path: content/Social Sciences/Regional Studies/test-* - config_name: Social Sciences_Social Policy data_files: - split: test path: content/Social Sciences/Social Policy/test-* - config_name: Social Sciences_Social Work data_files: - split: test path: content/Social Sciences/Social Work/test-* - config_name: Social Sciences_Sociology data_files: - split: test path: content/Social Sciences/Sociology/test-* - config_name: Social Sciences_Tourism and Hospitality data_files: - split: test path: content/Social Sciences/Tourism and Hospitality/test-* - config_name: Social Sciences_Transportation Science and Technology data_files: - split: test path: content/Social Sciences/Transportation Science and Technology/test-* - config_name: Theology_Theology data_files: - split: test path: content/Theology/Theology/test-* - config_name: testing data_files: - split: test path: /content/testing/test-* --- # Multi-domain academic audio data for evaluating ASR model ## Dataset Summary This dataset, named "DomainSpeech," is meticulously curated to serve as a robust evaluation tool for Automatic Speech Recognition (ASR) models. Encompassing a broad spectrum of academic domains including Agriculture, Sciences, Engineering, and Business. A distinctive feature of this dataset is its deliberate design to present a more challenging benchmark by maintaining a technical terminology density of 20% across the texts. This parameter was set to elevate the complexity above the norm found in existing ASR model evaluation datasets, thereby rendering "DomainSpeech" an ideal candidate for validating the performance of ASR systems in recognizing domain-specific contents. The dataset's unique composition makes it a valuable asset for researchers and developers aiming to enhance the accuracy and reliability of ASR systems in academic and professional settings. ## Dataset Description DomainSpeech is composed of 199 subsets, each contributing 300 rows of domain-specific English text data and corresponding 22050 Hz speech data. Each subset name takes a form as {domain}_{subdomain}. Although DomainSpeech mainly focuses on evaluation of ASR models, it also have extra 1500 rows for fine-tuning with some subdomains (Anatomy, Anthropology, Cardiology, Dentistry, Pathology). ## How to Use To utilize the "DomainSpeech" dataset, especially focusing on a subset such as 'Medical Sciences_Anatomy,' you can follow the simple steps outlined below. This example demonstrates how to load the 'Medical Sciences_Anatomy' subset from the dataset for further analysis or model evaluation. ```python from datasets import load_dataset # Load the 'Medical Sciences_Anatomy' subset from the 'DomainSpeech' dataset dataset = load_dataset("DoSp/DomainSpeech", "Medical Sciences_Anatomy") ``` ## Evaluation Example Can be found on our Paper "DomainSpeech: Domain Specific Corpus to Evaluate and Enhance ASR System" | | Anatomy | Anthropology | Cardiology | Dentistry | Pathology | | ----------------- | ----- | ----- | ----- | ----- | ----- | | **Whisper-small** | - | - | - | - | - | | **Baseline** | 9.19 | 9.19 | 13.25 | 9.76 | 11.92 | | **T5-base** |8.49 | 7.15 | 9.7 | 8.60 | 11.16 | | **Whisper-large-v2** | - | - | - | - | - | | **Baseline** | 3.98 | 3.19 | 6.17 | 4.33 | 6.85 | | **T5-base** | 3.84 | 4.31 | 4.34 | 4.00 | 7.83 |

dataset_info: - config_name: 农业_农业生物技术(Agriculture_Agricultural Biotechnology) features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 143439038.0 num_examples: 300 download_size: 143297680 dataset_size: 143439038.0 - config_name: 农业_农业经济学(Agriculture_Agricultural Economics) features: - name: audio dtype: audio - name: sentence dtype: string splits: - name: test num_bytes: 138126833.0 num_examples: 300 download_size: 138014919 dataset_size: 138126833.0 ...(其余197个配置项格式与上述一致) - config_name: 测试(testing) features: - name: audio dtype: audio: sampling_rate: 16000 - name: sentence dtype: string splits: - name: test num_bytes: 115449370.0 num_examples: 300 download_size: 115356390 dataset_size: 115449370.0 configs: - config_name: 农业_农业生物技术(Agriculture_Agricultural Biotechnology) data_files: - split: test path: content/农业/农业生物技术/test-* - config_name: 农业_农业经济学(Agriculture_Agricultural Economics) data_files: - split: test path: content/农业/农业经济学/test-* ...(其余197个配置项格式与上述一致) - config_name: 测试(testing) data_files: - split: test path: /content/testing/test-* # 用于评估自动语音识别(Automatic Speech Recognition, ASR)模型的多领域学术音频数据 ## 数据集概览 本数据集名为"DomainSpeech",经精心编纂,旨在作为一款可靠的自动语音识别(ASR)模型评估工具。其涵盖农业、理学、工程学、商学等广泛的学术领域。该数据集的显著特色在于,其刻意设计为更具挑战性的基准测试集:在所有文本中维持20%的专业术语密度。该参数设置将文本复杂度提升至现有ASR模型评估数据集的常规水平之上,从而使DomainSpeech成为验证ASR系统识别领域特定内容性能的理想候选方案。本数据集的独特构成,为致力于提升学术与专业场景下ASR系统准确性与可靠性的研究人员与开发者提供了宝贵的资源。 ## 数据集说明 DomainSpeech共包含199个子集,每个子集均提供300条领域专属的英文文本数据与对应的22050 Hz语音数据。每个子集的命名格式为「{主领域}_{子领域}」。尽管DomainSpeech主要用于ASR模型的评估,但其还为部分子领域(解剖学、人类学、心脏病学、牙科学、病理学)提供了额外的1500条数据用于模型微调。 ## 使用方法 若要使用"DomainSpeech"数据集——例如以「医学科学_解剖学」子集为例——可遵循以下简单步骤。本示例展示了如何从数据集中加载「医学科学_解剖学」子集,以开展后续分析或模型评估。 python from datasets import load_dataset # 从DomainSpeech数据集中加载「医学科学_解剖学」子集 dataset = load_dataset("DoSp/DomainSpeech", "Medical Sciences_Anatomy") ## 评估示例 评估示例可参见我们的论文《DomainSpeech:用于评估与增强ASR系统的领域专属语料库》。 | | 解剖学 | 人类学 | 心脏病学 | 牙科学 | 病理学 | | ----------------- | ----- | ----- | ----- | ----- | ----- | | **Whisper-small** | - | - | - | - | - | | **基线模型** | 9.19 | 9.19 | 13.25 | 9.76 | 11.92 | | **T5-base** |8.49 | 7.15 | 9.7 | 8.60 | 11.16 | | **Whisper-large-v2** | - | - | - | - | - | | **基线模型** | 3.98 | 3.19 | 6.17 | 4.33 | 6.85 | | **T5-base** | 3.84 | 4.31 | 4.34 | 4.00 | 7.83 |
提供机构:
DoSp
原始信息汇总

数据集概述

数据集配置

农业领域

  • Agriculture_Agricultural Biotechnology

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 143439038.0字节
    • 下载大小: 143297680字节
    • 数据集大小: 143439038.0字节
  • Agriculture_Agricultural Economics

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 138126833.0字节
    • 下载大小: 138014919字节
    • 数据集大小: 138126833.0字节
  • Agriculture_Agricultural Engineering

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 143180625.0字节
    • 下载大小: 143050446字节
    • 数据集大小: 143180625.0字节
  • Agriculture_Agricultural Mechanization

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 154916533.0字节
    • 下载大小: 154747365字节
    • 数据集大小: 154916533.0字节
  • Agriculture_Animal Science

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 146354369.0字节
    • 下载大小: 146220983字节
    • 数据集大小: 146354369.0字节
  • Agriculture_Crop Science

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 143046061.0字节
    • 下载大小: 142880656字节
    • 数据集大小: 143046061.0字节
  • Agriculture_Entomology and Pesticides

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 143552360.0字节
    • 下载大小: 143407167字节
    • 数据集大小: 143552360.0字节
  • Agriculture_Fisheries

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 138944065.0字节
    • 下载大小: 138788871字节
    • 数据集大小: 138944065.0字节
  • Agriculture_Forestry

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 140535848.0字节
    • 下载大小: 140392528字节
    • 数据集大小: 140535848.0字节
  • Agriculture_Horticulture

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 147926282.0字节
    • 下载大小: 147791744字节
    • 数据集大小: 147926282.0字节
  • Agriculture_Plant Science

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 123700367.0字节
    • 下载大小: 123597900字节
    • 数据集大小: 123700367.0字节
  • Agriculture_Poultry Production

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 147073759.0字节
    • 下载大小: 146906099字节
    • 数据集大小: 147073759.0字节
  • Agriculture_Soil Sciences and Plant Nutrition

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 127354046.0字节
    • 下载大小: 127256326字节
    • 数据集大小: 127354046.0字节
  • Agriculture_Soil and Water Engineering and Conservation

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 134537041.0字节
    • 下载大小: 134387592字节
    • 数据集大小: 134537041.0字节

艺术设计领域

  • Arts Design_Arts

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 119548638.0字节
    • 下载大小: 119440736字节
    • 数据集大小: 119548638.0字节
  • Arts Design_Design

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 135083325.0字节
    • 下载大小: 134936083字节
    • 数据集大小: 135083325.0字节
  • Arts Design_Interior Architecture

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 141126586.0字节
    • 下载大小: 140979090字节
    • 数据集大小: 141126586.0字节
  • Arts Design_Urban Planning

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 147980852.0字节
    • 下载大小: 147794755字节
    • 数据集大小: 147980852.0字节

商业领域

  • Business_Business Administration

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 121104401.0字节
    • 下载大小: 120968900字节
    • 数据集大小: 121104401.0字节
  • Business_Communications and Media Studies

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 123893864.0字节
    • 下载大小: 123794867字节
    • 数据集大小: 123893864.0字节
  • Business_Decision Science and Operations Management

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 117426723.0字节
    • 下载大小: 117317155字节
    • 数据集大小: 117426723.0字节
  • Business_Entrepreneurship

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 129740439.0字节
    • 下载大小: 129590618字节
    • 数据集大小: 129740439.0字节
  • Business_Human Resource Management

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 134109342.0字节
    • 下载大小: 133946610字节
    • 数据集大小: 134109342.0字节
  • Business_Marketing

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 131082374.0字节
    • 下载大小: 130942488字节
    • 数据集大小: 131082374.0字节
  • Business_Public Administration

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 128436764.0字节
    • 下载大小: 128268709字节
    • 数据集大小: 128436764.0字节
  • Business_Strategic Management

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 129705598.0字节
    • 下载大小: 129565676字节
    • 数据集大小: 129705598.0字节

经济学领域

  • Economics_Accounting and Finance

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 130086798.0字节
    • 下载大小: 129970443字节
    • 数据集大小: 130086798.0字节
  • Economics_Banking and Insurance

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 125576327.0字节
    • 下载大小: 125457196字节
    • 数据集大小: 125576327.0字节
  • Economics_Environmental Economics

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 144396467.0字节
    • 下载大小: 144269317字节
    • 数据集大小: 144396467.0字节
  • Economics_Financial Economics

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 126345574.0字节
    • 下载大小: 126213407字节
    • 数据集大小: 126345574.0字节
  • Economics_International Trade

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 129266847.0字节
    • 下载大小: 129131077字节
    • 数据集大小: 129266847.0字节

教育领域

  • Education_Early Childhood Education

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 134842546.0字节
    • 下载大小: 134669041字节
    • 数据集大小: 134842546.0字节
  • Education_Educational Administration

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 129139609.0字节
    • 下载大小: 129009495字节
    • 数据集大小: 129139609.0字节
  • Education_Educational Psychology

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 132445380.0字节
    • 下载大小: 132314227字节
    • 数据集大小: 132445380.0字节
  • Education_Educational Technology

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 136349543.0字节
    • 下载大小: 136233919字节
    • 数据集大小: 136349543.0字节
  • Education_Elemantary Teacher Education

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 128929721.0字节
    • 下载大小: 128832448字节
    • 数据集大小: 128929721.0字节
  • Education_Foreign Language Education

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 132729799.0字节
    • 下载大小: 132576098字节
    • 数据集大小: 132729799.0字节
  • Education_Guidance and Counseling

    • 特征:
      • audio: 音频
      • sentence: 字符串
    • 分割:
      • test: 300个样本, 137961853.0字节
    • 下载大小: 137814518字节
    • 数据集大小: 137961853.0字节
  • Education_Mathematics and Science Education

    • 特征:
      • audio: 音频
      • sentence: 字符串
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作