Parsed DMOZ data
收藏DataONE2021-02-18 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:07947364fcee8079381f5d49557c801b9fcc7e1c4f4f96a4b713a75a84e24fef
下载链接
链接失效反馈官方服务:
资源简介:
DMOZ is a large communally maintained open directory that categorizes web content. The data are posted in a complex XML format. The python scripts posted here were used to parse the data posted at: http://rdf.dmoz.org/ on June 12, 2016 to produce a csv file posted here. The structure of the file is \"URL\",\"Category 1\",\"Category 2\",.......... Given the categories are separated by commas, doing read_csv without the right options can be problematic Here's some code to read in the file: https://gist.github.com/soodoku/a97e6cf2800429d1c541ac2fb65e4c98
创建时间:
2023-11-21



