Dataset with four years of condition monitoring technical language annotations from paper machine industries in northern Sweden
收藏DataCite Commons2025-09-22 更新2025-04-16 收录
下载链接:
https://researchdata.se/catalogue/dataset/2023-257
下载链接
链接失效反馈官方服务:
资源简介:
This dataset consists of four years of technical language annotations from two paper machines in northern Sweden, structured as a Pandas dataframe. The same data is also available as a semicolon-separated .csv file. The data consists of two columns, where the first column corresponds to annotation note contents, and the second column corresponds to annotation titles. The annotations are in Swedish, and processed so that all mentions of personal information are replaced with the string ‘egennamn’, meaning “personal name” in Swedish. Each row corresponds to one annotation with the corresponding title.
Data can be accessed in Python with:
import pandas as pd
annotations_df = pd.read_pickle("Technical_Language_Annotations.pkl")
annotation_contents = annotations_df['noteComment']
annotation_titles = annotations_df['title']
提供机构:
Luleå University of Technology
创建时间:
2023-12-21



