neuml-linkedin-202501
收藏魔搭社区2025-11-07 更新2025-01-18 收录
下载链接:
https://modelscope.cn/datasets/NeuML/neuml-linkedin-202501
下载链接
链接失效反馈官方服务:
资源简介:
# NeuML LinkedIn Company Posts
This dataset is 12 months of [NeuML's LinkedIn Company Posts](https://www.linkedin.com/company/neuml) as of January 2025. It contains the post text along with engagement metrics.
It was created as follows:
1. Export the company posts from the analytics page, [see this link for instructions](https://www.linkedin.com/help/linkedin/answer/a551206).
2. Run the following code to create a dataset
```python
import pandas as pd
from datasets import load_dataset
df = pd.read_excel("export_data.xls", sheet_name=1, header=1)
df = df.dropna(axis="columns")
df.to_csv("data/posts.csv", index=False)
```
This same process can be run to create your own dataset. It can be loaded locally as follows.
```python
ds = load_dataset("data")
```
# NeuML 领英官方帖子数据集
本数据集收录了截至2025年1月的12个月内[NeuML官方领英主页](https://www.linkedin.com/company/neuml)发布的全部帖子,包含帖子正文与互动数据指标。
数据集构建流程如下:
1. 从领英分析页面导出官方帖子数据,[操作指南详见此链接](https://www.linkedin.com/help/linkedin/answer/a551206)。
2. 运行下述代码即可生成该数据集:
python
import pandas as pd
from datasets import load_dataset
df = pd.read_excel("export_data.xls", sheet_name=1, header=1)
df = df.dropna(axis="columns")
df.to_csv("data/posts.csv", index=False)
该流程可复用以生成自定义数据集,本地加载该数据集的方式如下:
python
ds = load_dataset("data")
提供机构:
maas
创建时间:
2025-01-13



