Collocation and Contextual Analysis of 'Anglais' in Algerian and French Press Corpora
收藏DataCite Commons2025-09-30 更新2026-04-25 收录
下载链接:
https://figshare.com/articles/dataset/Collocate_Analysis_and_KWIC_of_Anglais/30251800/2
下载链接
链接失效反馈官方服务:
资源简介:
This file contains a collocate analysis dataset for the term "anglais" (English) across three corpora: AlgPress, FrenPress, and OppPress. Each row represents a collocate (a word frequently appearing near "anglais") with associated metrics. Columns include:- **Index**: Unique identifier for each collocate entry.- **Corpus**: Source corpus (AlgPress, FrenPress, or OppPress).- **Position**: Position of the collocate relative to the node word ("L" for left, "R" for right, "M" for mixed).- **Collocate**: The word co-occurring with "anglais."- **Stat**: Statistical measure of association strength (likely log-likelihood or similar).- **LogDice**: LogDice score, a measure of collocation strength.- **Freq (coll)**: Frequency of the collocate in the context of "anglais."- **Freq (corpus)**: Total frequency of the collocate in the corpus.<br>The dataset is useful for linguistic research, particularly in analyzing the contextual use of "anglais" in Algerian and French press corpora, focusing on educational and linguistic policy discussions. It supports studies in corpus linguistics, sociolinguistics, and language policy in multilingual contexts.Usage Notes: The file is in CSV format, suitable for analysis with tools like R, Python (pandas), or spreadsheet software. Researchers should note the specific statistical measures (Stat and LogDice) for interpreting collocation significance. The data can be used to compare linguistic patterns across the three corpora, reflecting different perspectives on English language adoption in Algeria.
提供机构:
figshare
创建时间:
2025-09-30



