five

Fast Generalized Linear Models by Database Sampling and One-Step Polishing

收藏
Taylor & Francis Group2021-09-29 更新2026-04-16 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Fast_generalised_linear_models_by_database_sampling_and_one-step_polishing/8063768/3
下载链接
链接失效反馈
官方服务:
资源简介:
In this article, I show how to fit a generalized linear model to <i>N</i> observations on <i>p</i> variables stored in a relational database, using one sampling query and one aggregation query, as long as N12+δ observations can be stored in memory, for some δ&gt;0. The resulting estimator is fully efficient and asymptotically equivalent to the maximum likelihood estimator, and so its variance can be estimated from the Fisher information in the usual way. A proof-of-concept implementation uses R with MonetDB and with SQLite, and could easily be adapted to other popular databases. I illustrate the approach with examples of taxi-trip data in New York City and factors related to car color in New Zealand. Supplementary materials for this article are available online.
提供机构:
Lumley, Thomas
创建时间:
2021-09-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作