five

JesusIC/vota-con-la-chola-data

收藏
Hugging Face2026-02-26 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/JesusIC/vota-con-la-chola-data
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - es license: other task_categories: - tabular-classification pretty_name: Vota Con La Chola snapshots configs: - config_name: "admin_levels" data_files: - split: train path: - "snapshots/2026-03-05/parquet/admin_levels/*.parquet" - config_name: "causal_estimates" data_files: - split: train path: - "snapshots/2026-03-05/parquet/causal_estimates/*.parquet" - config_name: "document_fetches" data_files: - split: train path: - "snapshots/2026-03-05/parquet/document_fetches/*.parquet" - config_name: "domains" data_files: - split: train path: - "snapshots/2026-03-05/parquet/domains/*.parquet" - config_name: "genders" data_files: - split: train path: - "snapshots/2026-03-05/parquet/genders/*.parquet" - config_name: "indicator_observation_records" data_files: - split: train path: - "snapshots/2026-03-05/parquet/indicator_observation_records/*.parquet" - config_name: "indicator_points" data_files: - split: train path: - "snapshots/2026-03-05/parquet/indicator_points/*.parquet" - config_name: "indicator_series" data_files: - split: train path: - "snapshots/2026-03-05/parquet/indicator_series/*.parquet" - config_name: "infoelectoral_archivos_extraccion" data_files: - split: train path: - "snapshots/2026-03-05/parquet/infoelectoral_archivos_extraccion/*.parquet" - config_name: "infoelectoral_convocatoria_tipos" data_files: - split: train path: - "snapshots/2026-03-05/parquet/infoelectoral_convocatoria_tipos/*.parquet" - config_name: "infoelectoral_convocatorias" data_files: - split: train path: - "snapshots/2026-03-05/parquet/infoelectoral_convocatorias/*.parquet" - config_name: "infoelectoral_proceso_resultados" data_files: - split: train path: - "snapshots/2026-03-05/parquet/infoelectoral_proceso_resultados/*.parquet" - config_name: "infoelectoral_procesos" data_files: - split: train path: - "snapshots/2026-03-05/parquet/infoelectoral_procesos/*.parquet" - config_name: "ingestion_runs" data_files: - split: train path: - "snapshots/2026-03-05/parquet/ingestion_runs/*.parquet" - config_name: "institutions" data_files: - split: train path: - "snapshots/2026-03-05/parquet/institutions/*.parquet" - config_name: "intervention_events" data_files: - split: train path: - "snapshots/2026-03-05/parquet/intervention_events/*.parquet" - config_name: "interventions" data_files: - split: train path: - "snapshots/2026-03-05/parquet/interventions/*.parquet" - config_name: "legal_fragment_responsibilities" data_files: - split: train path: - "snapshots/2026-03-05/parquet/legal_fragment_responsibilities/*.parquet" - config_name: "legal_fragment_responsibility_evidence" data_files: - split: train path: - "snapshots/2026-03-05/parquet/legal_fragment_responsibility_evidence/*.parquet" - config_name: "legal_norm_fragments" data_files: - split: train path: - "snapshots/2026-03-05/parquet/legal_norm_fragments/*.parquet" - config_name: "legal_norm_lineage_edges" data_files: - split: train path: - "snapshots/2026-03-05/parquet/legal_norm_lineage_edges/*.parquet" - config_name: "legal_norms" data_files: - split: train path: - "snapshots/2026-03-05/parquet/legal_norms/*.parquet" - config_name: "liberty_delegated_enforcement_links" data_files: - split: train path: - "snapshots/2026-03-05/parquet/liberty_delegated_enforcement_links/*.parquet" - config_name: "liberty_delegated_enforcement_methodologies" data_files: - split: train path: - "snapshots/2026-03-05/parquet/liberty_delegated_enforcement_methodologies/*.parquet" - config_name: "liberty_enforcement_methodologies" data_files: - split: train path: - "snapshots/2026-03-05/parquet/liberty_enforcement_methodologies/*.parquet" - config_name: "liberty_enforcement_observations" data_files: - split: train path: - "snapshots/2026-03-05/parquet/liberty_enforcement_observations/*.parquet" - config_name: "liberty_indirect_methodologies" data_files: - split: train path: - "snapshots/2026-03-05/parquet/liberty_indirect_methodologies/*.parquet" - config_name: "liberty_indirect_responsibility_edges" data_files: - split: train path: - "snapshots/2026-03-05/parquet/liberty_indirect_responsibility_edges/*.parquet" - config_name: "liberty_irlc_methodologies" data_files: - split: train path: - "snapshots/2026-03-05/parquet/liberty_irlc_methodologies/*.parquet" - config_name: "liberty_proportionality_methodologies" data_files: - split: train path: - "snapshots/2026-03-05/parquet/liberty_proportionality_methodologies/*.parquet" - config_name: "liberty_proportionality_reviews" data_files: - split: train path: - "snapshots/2026-03-05/parquet/liberty_proportionality_reviews/*.parquet" - config_name: "liberty_restriction_assessments" data_files: - split: train path: - "snapshots/2026-03-05/parquet/liberty_restriction_assessments/*.parquet" - config_name: "liberty_right_categories" data_files: - split: train path: - "snapshots/2026-03-05/parquet/liberty_right_categories/*.parquet" - config_name: "mandates" data_files: - split: train path: - "snapshots/2026-03-05/parquet/mandates/*.parquet" - config_name: "money_contract_records" data_files: - split: train path: - "snapshots/2026-03-05/parquet/money_contract_records/*.parquet" - config_name: "money_subsidy_records" data_files: - split: train path: - "snapshots/2026-03-05/parquet/money_subsidy_records/*.parquet" - config_name: "parl_initiative_doc_extractions" data_files: - split: train path: - "snapshots/2026-03-05/parquet/parl_initiative_doc_extractions/*.parquet" - config_name: "parl_initiative_documents" data_files: - split: train path: - "snapshots/2026-03-05/parquet/parl_initiative_documents/*.parquet" - config_name: "parl_initiatives" data_files: - split: train path: - "snapshots/2026-03-05/parquet/parl_initiatives/*.parquet" - config_name: "parl_vote_event_initiatives" data_files: - split: train path: - "snapshots/2026-03-05/parquet/parl_vote_event_initiatives/*.parquet" - config_name: "parl_vote_events" data_files: - split: train path: - "snapshots/2026-03-05/parquet/parl_vote_events/*.parquet" - config_name: "parl_vote_member_votes" data_files: - split: train path: - "snapshots/2026-03-05/parquet/parl_vote_member_votes/*.parquet" - config_name: "parties" data_files: - split: train path: - "snapshots/2026-03-05/parquet/parties/*.parquet" - config_name: "party_aliases" data_files: - split: train path: - "snapshots/2026-03-05/parquet/party_aliases/*.parquet" - config_name: "person_identifiers" data_files: - split: train path: - "snapshots/2026-03-05/parquet/person_identifiers/*.parquet" - config_name: "person_name_aliases" data_files: - split: train path: - "snapshots/2026-03-05/parquet/person_name_aliases/*.parquet" - config_name: "person_public_data_queue" data_files: - split: train path: - "snapshots/2026-03-05/parquet/person_public_data_queue/*.parquet" - config_name: "persons" data_files: - split: train path: - "snapshots/2026-03-05/parquet/persons/*.parquet" - config_name: "placsp_contract_detail_documents" data_files: - split: train path: - "snapshots/2026-03-05/parquet/placsp_contract_detail_documents/*.parquet" - config_name: "placsp_contract_detail_records" data_files: - split: train path: - "snapshots/2026-03-05/parquet/placsp_contract_detail_records/*.parquet" - config_name: "policy_axes" data_files: - split: train path: - "snapshots/2026-03-05/parquet/policy_axes/*.parquet" - config_name: "policy_event_axis_scores" data_files: - split: train path: - "snapshots/2026-03-05/parquet/policy_event_axis_scores/*.parquet" - config_name: "policy_events" data_files: - split: train path: - "snapshots/2026-03-05/parquet/policy_events/*.parquet" - config_name: "policy_instruments" data_files: - split: train path: - "snapshots/2026-03-05/parquet/policy_instruments/*.parquet" - config_name: "roles" data_files: - split: train path: - "snapshots/2026-03-05/parquet/roles/*.parquet" - config_name: "sanction_infraction_type_mappings" data_files: - split: train path: - "snapshots/2026-03-05/parquet/sanction_infraction_type_mappings/*.parquet" - config_name: "sanction_infraction_types" data_files: - split: train path: - "snapshots/2026-03-05/parquet/sanction_infraction_types/*.parquet" - config_name: "sanction_municipal_ordinance_fragments" data_files: - split: train path: - "snapshots/2026-03-05/parquet/sanction_municipal_ordinance_fragments/*.parquet" - config_name: "sanction_municipal_ordinances" data_files: - split: train path: - "snapshots/2026-03-05/parquet/sanction_municipal_ordinances/*.parquet" - config_name: "sanction_norm_catalog" data_files: - split: train path: - "snapshots/2026-03-05/parquet/sanction_norm_catalog/*.parquet" - config_name: "sanction_norm_fragment_links" data_files: - split: train path: - "snapshots/2026-03-05/parquet/sanction_norm_fragment_links/*.parquet" - config_name: "sanction_procedural_kpi_definitions" data_files: - split: train path: - "snapshots/2026-03-05/parquet/sanction_procedural_kpi_definitions/*.parquet" - config_name: "sanction_procedural_metrics" data_files: - split: train path: - "snapshots/2026-03-05/parquet/sanction_procedural_metrics/*.parquet" - config_name: "sanction_volume_observations" data_files: - split: train path: - "snapshots/2026-03-05/parquet/sanction_volume_observations/*.parquet" - config_name: "sanction_volume_sources" data_files: - split: train path: - "snapshots/2026-03-05/parquet/sanction_volume_sources/*.parquet" - config_name: "sources" data_files: - split: train path: - "snapshots/2026-03-05/parquet/sources/*.parquet" - config_name: "territories" data_files: - split: train path: - "snapshots/2026-03-05/parquet/territories/*.parquet" - config_name: "text_documents" data_files: - split: train path: - "snapshots/2026-03-05/parquet/text_documents/*.parquet" - config_name: "topic_evidence" data_files: - split: train path: - "snapshots/2026-03-05/parquet/topic_evidence/*.parquet" - config_name: "topic_evidence_reviews" data_files: - split: train path: - "snapshots/2026-03-05/parquet/topic_evidence_reviews/*.parquet" - config_name: "topic_positions" data_files: - split: train path: - "snapshots/2026-03-05/parquet/topic_positions/*.parquet" - config_name: "topic_set_topics" data_files: - split: train path: - "snapshots/2026-03-05/parquet/topic_set_topics/*.parquet" - config_name: "topic_sets" data_files: - split: train path: - "snapshots/2026-03-05/parquet/topic_sets/*.parquet" - config_name: "topics" data_files: - split: train path: - "snapshots/2026-03-05/parquet/topics/*.parquet" --- # Vota Con La Chola - Snapshots ETL Dataset de snapshots públicos del proyecto `JesusIC/vota-con-la-chola-data`. Repositorio fuente: [https://github.com/gsusI/vota-con-la-chola](https://github.com/gsusI/vota-con-la-chola) Contenido por snapshot (capa raw + capa procesada): - `snapshots/2026-03-05/published/*`: capa raw reproducible (artefactos canónicos JSON/JSON.GZ). - `snapshots/2026-03-05/parquet/<tabla>/part-*.parquet`: tablas navegables en el visor Data Studio. - `snapshots/2026-03-05/sources/<source_id>.json`: procedencia legal por fuente (licencia/aviso, obligaciones, terms_url, estado de verificación). - Tablas excluidas por privacidad (default público): `lost_and_found`, `raw_fetches`, `run_fetches`, `source_records`. Usa `--allow-sensitive-parquet` solo en repos privados. - `ingestion_runs.csv`: historial de corridas de ingesta. - `source_records_by_source.csv`: conteos por fuente para la fecha del snapshot. - `explorer_schema.json`: contrato de esquema (tablas/PK/FK) para exploración en navegador. - `manifest.json` y `checksums.sha256`: trazabilidad e integridad. Licencia del repo Hugging Face: - `license: other` porque el snapshot mezcla múltiples licencias/avisos por fuente. - La licencia/condiciones aplicables están detalladas por `source_id` en `sources/*.json`. Resumen legal por fuente (snapshot actual): | source_id | registros | verificación | base legal/licencia | terms_url | |---|---:|---|---|---| | `placsp_autonomico` | 2 | verificado | PLACSP: reproducción autorizada con cita de origen; datasets vinculados a datos abiertos de Hacienda | [link](https://datos.gob.es/es/aviso-legal) | | `placsp_sindicacion` | 3 | verificado | PLACSP: reproducción autorizada con cita de origen; datasets vinculados a datos abiertos de Hacienda | [link](https://datos.gob.es/es/aviso-legal) | Cautelas de cumplimiento: - Este dataset no implica respaldo institucional de las fuentes. - Cuando una fuente exige integridad/no alteración para mirror, mantener `published/*` como capa raw y declarar transformaciones en derivados. - Si hay datos personales, aplicar minimización, evitar reidentificación y revisar compatibilidad de finalidad (GDPR). - Fuentes con estado `parcial`, `pendiente` o `no verificado` requieren revisión legal adicional antes de reutilización comercial sensible. Ruta del último snapshot publicado en este commit: - `snapshots/2026-03-05` (snapshot_date=2026-03-05) Actualización: - `just etl-publish-hf-dry-run` para validar empaquetado. - `just etl-publish-hf` para publicar actualización.
提供机构:
JesusIC
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作