five

6DammK9/25110801-229M

收藏
Hugging Face2025-11-22 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/6DammK9/25110801-229M
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit --- # 25110801-229M # - [Apache Parquet](https://en.wikipedia.org/wiki/Apache_Parquet) *Dataset* from a MySQL *Database* dump. - [Author's article.](https://webbhk.substack.com/about) - [The Repository.](https://drive.google.com/drive/folders/13mzAvvufXC3QmH8OvMVx2juUi7qdnVLL) - [One of the ongoing site.](https://webbsite.0xmd.com/dbpub/) and [video](https://youtu.be/xV2RV3z4URc?si=u48eWYjcPbEgmxpV). - Data cutoff: **251101** (it keeps updating as on 251108) - *I will update more contents when I think it is more appropriate. Sorry for no further elaboration.* - *Carpe diem!* ## Contents inside ## - `parquet`: Raw data exported from MySQL. Data tables only. Empty tables are absent. - `parquet_uuid`: `parquet` with UUID for each row. - `parquet_fk`: Foreign Keys linking 2 UUIDs under `parquet_uuid`. - `dump`: "The riddle". *This folder triggered HF and blocked by upload, so I used rename.* ## How to use ## - Bring a Machine with ~~64GB~~ 128GB of RAM. 64GB for `parquet_*`, 112GB for `dump`. However the original MySQL only takes 6GB for *single user*. - *Good luck guessing this riddle. However I think it is quite obvious if you know the theories well. It is just a bit abstract, general, and simple.* ## Future Plan ## - Source code... I'll rename this repo when I really want to promote it. ~~The (dataset) codes will be in Github. Server code is likely "never exist".~~ See `static`. I made it pure frontend. - [Diagram of the dataset conversion](https://github.com/6DammK9/nai-anime-pure-negative-prompt/blob/main/ch97/parquet_neo4j.md) are located in Github whatsever. ## Hints on "the riddle" ## - [News article but not in English (0293).](https://www.am730.com.hk/column/%E8%B2%A1%E7%B6%93/%E4%BA%A4%E5%8F%89%E6%8C%81%E8%82%A1%E6%98%AF%E9%82%AA%E6%83%A1%E9%82%84%E6%98%AF%E5%96%84%E8%89%AF%E4%B9%8B%E6%A7%8D-/234671) - [Another news article (0273).](https://www.hkcd.com.hk/content/2017-12/22/content_1074882.html) ```cypher //Need to flip all the "sholders_holder" first MATCH (a:sholdings)-[rel:sholders_holder]->(b:persons) WITH rel CALL apoc.refactor.invert(rel) YIELD input, output RETURN output //Enjoy MATCH (v00:stocklistings{StockCode:'0273',StockExID:'1'}) WITH v00 CALL apoc.path.spanningTree(v00, { relationshipFilter: 'stocklistings_issue>|FK_issue_orgs>|FK_organisations_persons>|sholders_holder>|sholders_issue>' }) YIELD path WITH nodes(path) as list CALL apoc.nodes.cycles(list, { relTypes: [ "stocklistings_issue", "FK_issue_orgs", "FK_organisations_persons", "sholders_holder", "sholders_issue" ] }) YIELD path AS p00 WITH p00, nodes(p00) AS v01 MATCH p01=(:stocklistings)-[stocklistings_issue]->()-[FK_issue_orgs]->(v02:organisations) WHERE v02 IN v01 RETURN p00, p01 LIMIT 1000 ``` ## Visualising / quick start for "the riddle" ## - Host the entire `static` as a mini web server. - Read codes and change the auth content (e.g. password). - Use `?stockcode=0273&limit=1000` to change the query parameters. - *You will find that the code is suprisingly low. It is (really) abstract, general, and simple.*
提供机构:
6DammK9
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作