five

SWF Financials: Virtual Company Data (1996-2025)

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://data.mendeley.com/datasets/xz26m9ypz9
下载链接
链接失效反馈
官方服务:
资源简介:
Overview This dataset provides a synthetic "Income Statement" (Profit & Loss) for a virtual mega-corporation, covering 30 years of business operations from 1996 to 2025. The data is designed to simulate a realistic corporate financial lifecycle, useful for financial time-series analysis, forecasting, and educational purposes. Dataset Composition The dataset contains 120 quarters of financial data divided into two distinct periods: 1996–2011: Projected and modeled data (64 quarters) based on historical financial trends. 2012–2025: Aggregated financial data derived from real SEC EDGAR filings (56 quarters), representing a consolidated view of large-scale corporate performance. Key Metrics Included Each record includes standard accounting metrics: Revenue and Cost of Revenue Gross Profit and Operating Expenses Operating Income and Net Income Calculated margins (Gross, Operating, and Net Margins) Data Quality Flags indicating the source (Projected vs. Complete) and validity. Files Included swf_financials.csv: The primary dataset. SWF_FINANCIALS_SCHEMA.md: Detailed column definitions and data types. SWF_ASSUMPTIONS.md: Documentation of the methodology used for data aggregation and projection. financial_preprocessing_pipeline.py: The Python script used to process and clean the raw data. examples.ipynb: A Jupyter Notebook demonstrating basic analysis and visualization of the dataset. Potential Use Cases Training machine learning models for financial forecasting. Backtesting algorithmic trading strategies. Academic teaching of accounting principles and margin analysis. Benchmarking corporate performance against synthetic "average" mega-corporations. Important Research Note Please note that while the 2012–2025 data is based on real SEC filings, this is a synthetic aggregation and does not represent a single specific real-world company. It is intended for research and educational applications only.
创建时间:
2026-01-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作