Geolytica POIData.xyz Points of Interest (POI) Geo Data - India
收藏Datarade2024-04-19 收录
下载链接:
https://datarade.ai/data-products/geolytica-poidata-xyz-points-of-interest-poi-geo-data-india-geolytica
下载链接
链接失效反馈官方服务:
资源简介:
https://store.poidata.xyz/in Point-of-interest (POI) is defined as a physical entity (such as a business) in a geo location (point) which may be (of interest). We strive to provide the most accurate, complete and up to date point of interest datasets for all countries of the world. The India POI Dataset is one of our worldwide POI datasets. This is our process flow: Our machine learning systems continuously crawl for new POI data Our geoparsing and geocoding calculates their geo locations Our categorization systems cleanup and standardize the datasets Our data pipeline API publishes the datasets on our data store POI Data is in a constant flux - especially so during times of drastic change such as the Covid-19 pandemic. Every minute worldwide on an average day over 200 businesses will move, over 600 new businesses will open their doors and over 400 businesses will cease to exist. In today's interconnected world, of the approximately 200 million POIs worldwide, over 94% have a public online presence. As a new POI comes into existence its information will appear very quickly in location based social networks (LBSNs), other social media, pictures, websites, blogs, press releases. Soon after that, our state-of-the-art POI Information retrieval system will pick it up. We offer our customers perpetual data licenses for any dataset representing this ever changing information, downloaded at any given point in time. This makes our company's licensing model unique in the current Data as a Service - DaaS Industry. Our customers don't have to delete our data after the expiration of a certain "Term", regardless of whether the data was purchased as a one time snapshot, or via a recurring payment plan on our data update pipeline. The main differentiators between us vs the competition are our flexible licensing terms and our data freshness. The core attribute coverage for India is as follows: Poi Field Data Coverage (%) poi_name 100 brand 3 poi_tel 17 formatted_address 100 main_category 100 latitude 100 longitude 100 neighborhood 7 source_url 24 email 2 opening_hours 26 The dataset may be viewed online at https://store.poidata.xyz/in and a data sample may be downloaded at https://store.poidata.xyz/datafiles/in_sample.csv
https://store.poidata.xyz/in
兴趣点(Point-of-interest, POI)被定义为地理点位上的实体(如商业机构),具备被关注的价值。本团队致力于为全球所有国家提供最精准、完整且实时更新的兴趣点数据集。印度兴趣点数据集便是我们全球POI数据集之一。
我们的处理流程如下:
1. 我们的机器学习系统持续爬取新增POI数据;
2. 通过地理解析与地理编码技术计算其地理位置;
3. 依托分类系统对数据集进行清洗与标准化;
4. 最终通过数据流水线API将数据集发布至我们的数据商店。
POI数据始终处于动态变化之中,在新冠疫情(COVID-19 pandemic)这类剧变时期尤为明显。全球范围内,平日平均每分钟有超过200家商业机构迁址、600余家新机构开业,另有超400家机构停业。在如今互联互通的时代,全球约2亿个POI中,超过94%拥有公开的线上信息。当一个新POI诞生时,其相关信息会迅速出现在基于位置的社交网络(Location-based social networks, LBSNs)、其他社交媒体、图片、网站、博客与新闻稿中。紧随其后,我们的顶尖POI信息检索系统便会抓取这些信息。
我们为客户提供针对该动态变化数据集的永久数据许可,许可基于任意时刻下载的数据集版本。这使得本公司的授权模式在当前数据即服务(Data as a Service, DaaS)行业中独树一帜。无论客户购买的是一次性快照数据集,还是通过我们的数据更新流水线订购的定期付费订阅数据,均无需在特定"期限"届满后删除本数据集。
我们相较于竞品的核心差异化优势在于灵活的授权条款与数据新鲜度。印度数据集的核心属性覆盖情况如下:
| POI字段 | 数据覆盖率(%) |
| ---- | ---- |
| poi_name | 100 |
| brand | 3 |
| poi_tel | 17 |
| formatted_address | 100 |
| main_category | 100 |
| latitude | 100 |
| longitude | 100 |
| neighborhood | 7 |
| source_url | 24 |
| email | 2 |
| opening_hours | 26 |
用户可在线浏览该数据集:https://store.poidata.xyz/in,数据集样本可从https://store.poidata.xyz/datafiles/in_sample.csv 下载。
提供机构:
Geolytica
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集提供印度全境兴趣点(POI)地理信息,包含名称、地址、坐标等核心字段(覆盖率100%),通过机器学习系统实时更新动态数据。其特色在于提供永久数据许可,并保持行业领先的数据新鲜度,但部分辅助字段如品牌、电话等覆盖率较低。
以上内容由遇见数据集搜集并总结生成



