five

linking2202/steam-games-dataset

收藏
Hugging Face2026-04-28 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/linking2202/steam-games-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 language: - en tags: - games - steam - video games - gamedev - game development task_categories: - text-generation - tabular-classification - tabular-regression - feature-extraction - zero-shot-classification - text-retrieval - text-ranking - summarization configs: - config_name: default data_files: - split: train path: data/train-* pretty_name: Steam Games Dataset size_categories: - 10K<n<100K dataset_info: features: - name: appID dtype: string - name: name dtype: string - name: release_date dtype: string - name: estimated_owners dtype: string - name: peak_ccu dtype: int64 - name: required_age dtype: int64 - name: price dtype: float64 - name: dlc_count dtype: int64 - name: detailed_description dtype: string - name: short_description dtype: string - name: supported_languages list: string - name: full_audio_languages list: string - name: reviews dtype: string - name: header_image dtype: string - name: website dtype: string - name: support_url dtype: string - name: support_email dtype: string - name: windows dtype: bool - name: mac dtype: bool - name: linux dtype: bool - name: metacritic_score dtype: int64 - name: metacritic_url dtype: string - name: user_score dtype: int64 - name: positive dtype: int64 - name: negative dtype: int64 - name: score_rank dtype: string - name: achievements dtype: int64 - name: recommendations dtype: int64 - name: notes dtype: string - name: average_playtime_forever dtype: int64 - name: average_playtime_2weeks dtype: int64 - name: median_playtime_forever dtype: int64 - name: median_playtime_2weeks dtype: int64 - name: developers list: string - name: publishers list: string - name: categories list: string - name: genres list: string - name: tags list: 'null' - name: screenshots list: string - name: movies list: 'null' - name: packages dtype: string splits: - name: train num_bytes: 442552363 num_examples: 124146 download_size: 191983414 dataset_size: 442552363 --- <p align="center"><img src="images/banner.png"/></p> # Overview Information of **more than 120,000 games** published on Steam. Maintained by **[Fronkon Games](https://github.com/FronkonGames)**. This dataset has been created with **[this code (MIT)](https://github.com/FronkonGames/Steam-Games-Scraper)** and use the API provided by _Steam_, the largest gaming platform on PC. Data is also collected from _Steam Spy_. Only published games, _no DLCs, episodes, music, videos, etc_. Here is a simple example of how to parse json information: ``` # Simple parse of the 'games.json' file. import os import json dataset = {} if os.path.exists('games.json'): with open('games.json', 'r', encoding='utf-8') as fin: text = fin.read() if len(text) > 0: dataset = json.loads(text) for app in dataset: appID = app # AppID, unique identifier for each app (string). game = dataset[app] name = game['name'] # Game name (string). releaseDate = game['release_date'] # Release date (string). estimatedOwners = game['estimated_owners'] # Estimated owners (string, e.g.: "0 - 20000"). peakCCU = game['peak_ccu'] # Number of concurrent users, yesterday (int). required_age = game['required_age'] # Age required to play, 0 if it is for all audiences (int). price = game['price'] # Price in USD, 0.0 if its free (float). dlcCount = game['dlc_count'] # Number of DLCs, 0 if you have none (int). longDesc = game['detailed_description'] # Detailed description of the game (string). shortDesc = game['short_description'] # Brief description of the game, # does not contain HTML tags (string). languages = game['supported_languages'] # Comma-separated enumeration of supporting languages. fullAudioLanguages = game['full_audio_languages'] # Comma-separated enumeration of languages with audio support. reviews = game['reviews'] # headerImage = game['header_image'] # Header image URL in the store (string). website = game['website'] # Game website (string). supportWeb = game['support_url'] # Game support URL (string). supportEmail = game['support_email'] # Game support email (string). supportWindows = game['windows'] # Does it support Windows? (bool). supportMac = game['mac'] # Does it support Mac? (bool). supportLinux = game['linux'] # Does it support Linux? (bool). metacriticScore = game['metacritic_score'] # Metacritic score, 0 if it has none (int). metacriticURL = game['metacritic_url'] # Metacritic review URL (string). userScore = game['user_score'] # Users score, 0 if it has none (int). positive = game['positive'] # Positive votes (int). negative = game['negative'] # Negative votes (int). scoreRank = game['score_rank'] # Score rank of the game based on user reviews (string). achievements = game['achievements'] # Number of achievements, 0 if it has none (int). recommens = game['recommendations'] # User recommendations, 0 if it has none (int). notes = game['notes'] # Extra information about the game content (string). averagePlaytime = game['average_playtime_forever'] # Average playtime since March 2009, in minutes (int). averageplaytime2W = game['average_playtime_2weeks'] # Average playtime in the last two weeks, in minutes (int). medianPlaytime = game['median_playtime_forever'] # Median playtime since March 2009, in minutes (int). medianPlaytime2W = game['median_playtime_2weeks'] # Median playtime in the last two weeks, in minutes (int). packages = game['packages'] # Available packages. for pack in packages: title = pack['title'] # Package title (string). packDesc = pack['description'] # Package description (string). subs = pack['subs'] # Subpackages. for sub in subs: text = sub['text'] # Subpackage title (string). subDesc = sub['description'] # Subpackage description (string). subPrice = sub['price'] # Subpackage price in USD (float). developers = game['developers'] # Game developers. for developer in developers: developerName = developer # Developer name (string). publishers = game['publishers'] # Game publishers. for publisher in publishers: publisherName = publisher # Publisher name (string). categories = game['categories'] # Game categories. for category in categories: categoryName = category # Category name (string). genres = game['genres'] # Game genres. for gender in genres: genderName = gender # Gender name (string). screenshots = game['scrennshots'] # Game screenshots. for screenshot in screenshots: scrennshotsURL = screenshot # Game screenshot URL (string). movies = game['movies'] # Game movies. for movie in movies: movieURL = movie # Game movie URL (string). tags = game['tags'] # Tags. for tag in tags: tagKey = tag # Tag key (string, int). ```
提供机构:
linking2202
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作