JesusIC/vota-con-la-chola-data
收藏Hugging Face2026-02-26 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/JesusIC/vota-con-la-chola-data
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- es
license: other
task_categories:
- tabular-classification
pretty_name: Vota Con La Chola snapshots
configs:
- config_name: "admin_levels"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/admin_levels/*.parquet"
- config_name: "causal_estimates"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/causal_estimates/*.parquet"
- config_name: "document_fetches"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/document_fetches/*.parquet"
- config_name: "domains"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/domains/*.parquet"
- config_name: "genders"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/genders/*.parquet"
- config_name: "indicator_observation_records"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/indicator_observation_records/*.parquet"
- config_name: "indicator_points"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/indicator_points/*.parquet"
- config_name: "indicator_series"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/indicator_series/*.parquet"
- config_name: "infoelectoral_archivos_extraccion"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/infoelectoral_archivos_extraccion/*.parquet"
- config_name: "infoelectoral_convocatoria_tipos"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/infoelectoral_convocatoria_tipos/*.parquet"
- config_name: "infoelectoral_convocatorias"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/infoelectoral_convocatorias/*.parquet"
- config_name: "infoelectoral_proceso_resultados"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/infoelectoral_proceso_resultados/*.parquet"
- config_name: "infoelectoral_procesos"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/infoelectoral_procesos/*.parquet"
- config_name: "ingestion_runs"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/ingestion_runs/*.parquet"
- config_name: "institutions"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/institutions/*.parquet"
- config_name: "intervention_events"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/intervention_events/*.parquet"
- config_name: "interventions"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/interventions/*.parquet"
- config_name: "legal_fragment_responsibilities"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/legal_fragment_responsibilities/*.parquet"
- config_name: "legal_fragment_responsibility_evidence"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/legal_fragment_responsibility_evidence/*.parquet"
- config_name: "legal_norm_fragments"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/legal_norm_fragments/*.parquet"
- config_name: "legal_norm_lineage_edges"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/legal_norm_lineage_edges/*.parquet"
- config_name: "legal_norms"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/legal_norms/*.parquet"
- config_name: "liberty_delegated_enforcement_links"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/liberty_delegated_enforcement_links/*.parquet"
- config_name: "liberty_delegated_enforcement_methodologies"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/liberty_delegated_enforcement_methodologies/*.parquet"
- config_name: "liberty_enforcement_methodologies"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/liberty_enforcement_methodologies/*.parquet"
- config_name: "liberty_enforcement_observations"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/liberty_enforcement_observations/*.parquet"
- config_name: "liberty_indirect_methodologies"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/liberty_indirect_methodologies/*.parquet"
- config_name: "liberty_indirect_responsibility_edges"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/liberty_indirect_responsibility_edges/*.parquet"
- config_name: "liberty_irlc_methodologies"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/liberty_irlc_methodologies/*.parquet"
- config_name: "liberty_proportionality_methodologies"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/liberty_proportionality_methodologies/*.parquet"
- config_name: "liberty_proportionality_reviews"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/liberty_proportionality_reviews/*.parquet"
- config_name: "liberty_restriction_assessments"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/liberty_restriction_assessments/*.parquet"
- config_name: "liberty_right_categories"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/liberty_right_categories/*.parquet"
- config_name: "mandates"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/mandates/*.parquet"
- config_name: "money_contract_records"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/money_contract_records/*.parquet"
- config_name: "money_subsidy_records"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/money_subsidy_records/*.parquet"
- config_name: "parl_initiative_doc_extractions"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/parl_initiative_doc_extractions/*.parquet"
- config_name: "parl_initiative_documents"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/parl_initiative_documents/*.parquet"
- config_name: "parl_initiatives"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/parl_initiatives/*.parquet"
- config_name: "parl_vote_event_initiatives"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/parl_vote_event_initiatives/*.parquet"
- config_name: "parl_vote_events"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/parl_vote_events/*.parquet"
- config_name: "parl_vote_member_votes"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/parl_vote_member_votes/*.parquet"
- config_name: "parties"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/parties/*.parquet"
- config_name: "party_aliases"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/party_aliases/*.parquet"
- config_name: "person_identifiers"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/person_identifiers/*.parquet"
- config_name: "person_name_aliases"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/person_name_aliases/*.parquet"
- config_name: "person_public_data_queue"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/person_public_data_queue/*.parquet"
- config_name: "persons"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/persons/*.parquet"
- config_name: "placsp_contract_detail_documents"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/placsp_contract_detail_documents/*.parquet"
- config_name: "placsp_contract_detail_records"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/placsp_contract_detail_records/*.parquet"
- config_name: "policy_axes"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/policy_axes/*.parquet"
- config_name: "policy_event_axis_scores"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/policy_event_axis_scores/*.parquet"
- config_name: "policy_events"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/policy_events/*.parquet"
- config_name: "policy_instruments"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/policy_instruments/*.parquet"
- config_name: "roles"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/roles/*.parquet"
- config_name: "sanction_infraction_type_mappings"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/sanction_infraction_type_mappings/*.parquet"
- config_name: "sanction_infraction_types"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/sanction_infraction_types/*.parquet"
- config_name: "sanction_municipal_ordinance_fragments"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/sanction_municipal_ordinance_fragments/*.parquet"
- config_name: "sanction_municipal_ordinances"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/sanction_municipal_ordinances/*.parquet"
- config_name: "sanction_norm_catalog"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/sanction_norm_catalog/*.parquet"
- config_name: "sanction_norm_fragment_links"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/sanction_norm_fragment_links/*.parquet"
- config_name: "sanction_procedural_kpi_definitions"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/sanction_procedural_kpi_definitions/*.parquet"
- config_name: "sanction_procedural_metrics"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/sanction_procedural_metrics/*.parquet"
- config_name: "sanction_volume_observations"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/sanction_volume_observations/*.parquet"
- config_name: "sanction_volume_sources"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/sanction_volume_sources/*.parquet"
- config_name: "sources"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/sources/*.parquet"
- config_name: "territories"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/territories/*.parquet"
- config_name: "text_documents"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/text_documents/*.parquet"
- config_name: "topic_evidence"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/topic_evidence/*.parquet"
- config_name: "topic_evidence_reviews"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/topic_evidence_reviews/*.parquet"
- config_name: "topic_positions"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/topic_positions/*.parquet"
- config_name: "topic_set_topics"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/topic_set_topics/*.parquet"
- config_name: "topic_sets"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/topic_sets/*.parquet"
- config_name: "topics"
data_files:
- split: train
path:
- "snapshots/2026-03-05/parquet/topics/*.parquet"
---
# Vota Con La Chola - Snapshots ETL
Dataset de snapshots públicos del proyecto `JesusIC/vota-con-la-chola-data`.
Repositorio fuente: [https://github.com/gsusI/vota-con-la-chola](https://github.com/gsusI/vota-con-la-chola)
Contenido por snapshot (capa raw + capa procesada):
- `snapshots/2026-03-05/published/*`: capa raw reproducible (artefactos canónicos JSON/JSON.GZ).
- `snapshots/2026-03-05/parquet/<tabla>/part-*.parquet`: tablas navegables en el visor Data Studio.
- `snapshots/2026-03-05/sources/<source_id>.json`: procedencia legal por fuente (licencia/aviso, obligaciones, terms_url, estado de verificación).
- Tablas excluidas por privacidad (default público): `lost_and_found`, `raw_fetches`, `run_fetches`, `source_records`. Usa `--allow-sensitive-parquet` solo en repos privados.
- `ingestion_runs.csv`: historial de corridas de ingesta.
- `source_records_by_source.csv`: conteos por fuente para la fecha del snapshot.
- `explorer_schema.json`: contrato de esquema (tablas/PK/FK) para exploración en navegador.
- `manifest.json` y `checksums.sha256`: trazabilidad e integridad.
Licencia del repo Hugging Face:
- `license: other` porque el snapshot mezcla múltiples licencias/avisos por fuente.
- La licencia/condiciones aplicables están detalladas por `source_id` en `sources/*.json`.
Resumen legal por fuente (snapshot actual):
| source_id | registros | verificación | base legal/licencia | terms_url |
|---|---:|---|---|---|
| `placsp_autonomico` | 2 | verificado | PLACSP: reproducción autorizada con cita de origen; datasets vinculados a datos abiertos de Hacienda | [link](https://datos.gob.es/es/aviso-legal) |
| `placsp_sindicacion` | 3 | verificado | PLACSP: reproducción autorizada con cita de origen; datasets vinculados a datos abiertos de Hacienda | [link](https://datos.gob.es/es/aviso-legal) |
Cautelas de cumplimiento:
- Este dataset no implica respaldo institucional de las fuentes.
- Cuando una fuente exige integridad/no alteración para mirror, mantener `published/*` como capa raw y declarar transformaciones en derivados.
- Si hay datos personales, aplicar minimización, evitar reidentificación y revisar compatibilidad de finalidad (GDPR).
- Fuentes con estado `parcial`, `pendiente` o `no verificado` requieren revisión legal adicional antes de reutilización comercial sensible.
Ruta del último snapshot publicado en este commit:
- `snapshots/2026-03-05` (snapshot_date=2026-03-05)
Actualización:
- `just etl-publish-hf-dry-run` para validar empaquetado.
- `just etl-publish-hf` para publicar actualización.
提供机构:
JesusIC



