five

Predict HIV Progression预测艾滋病毒进展

收藏
阿里云天池2026-06-08 更新2024-03-07 收录
下载链接:
https://tianchi.aliyun.com/dataset/89335
下载链接
链接失效反馈
官方服务:
资源简介:
根据世界卫生组织的资料,自1981年首次被确认以来,艾滋病毒已在世界范围内导致2500万人死亡。近年来,已通过一系列疗法来控制感染。但是,病毒可能会围绕这些药物进化,因此对我们更好地了解病毒本身至关重要。 理解病毒的重要步骤是掌握其遗传蓝图。这项竞赛旨在通过让参赛者在HIV序列中找到可以预测感染严重程度变化的标记(通过病毒载量和CD4计数来衡量)来实现这一目标。 可以使用1,000名患者的记录来训练模型。为了预测患者病毒载量的改善,在治疗开始时,将向竞争对手提供其逆转录酶(RT),蛋白酶(PR)的核苷酸序列以及病毒载量和CD4计数的数据。在“背景技术”部分中对这些变量的科学进行了简短的讨论,但无需生物学知识即可在这场比赛中取得成功。竞争对手的预测将在包含692名患者的数据集上进行测试。 可获得500美元的奖金,获奖者还将有机会与竞赛主持人共同撰写论文。获奖者必须在提供任何奖金之前提供他们的方法。

According to the World Health Organization (WHO), since the human immunodeficiency virus (HIV) was first identified in 1981, it has caused 25 million deaths globally. In recent years, a range of therapies have been developed to control HIV infection. However, the virus can evolve in response to these medications, making it critical to gain a more comprehensive understanding of the virus itself. A key step in comprehending the virus is grasping its genetic blueprint. This competition aims to achieve this goal by challenging competitors to identify markers in HIV sequences that can predict changes in the severity of infection, as measured by viral load and CD4 count. Models can be trained using records from 1,000 patients. To predict improvements in patients' viral load, competitors will be provided with the nucleotide sequences of their reverse transcriptase (RT) and protease (PR), as well as data on viral load and CD4 count at the start of treatment. The science behind these variables is briefly discussed in the "Technical Background" section, but prior biological knowledge is not required to achieve success in this competition. Competitors' predictions will be tested on a dataset containing 692 patients. A $500 cash prize is available, and winners will also have the opportunity to co-author a paper with the competition organizers. Winners must disclose their methodology prior to receiving any prize funds.
提供机构:
阿里云天池
创建时间:
2021-01-27
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集来自天池竞赛,旨在利用HIV患者的逆转录酶和蛋白酶核苷酸序列,以及病毒载量和CD4计数等临床数据,预测感染严重程度的改善情况。训练集包含1000名患者记录,测试集包含692名患者记录,用于评估模型预测患者治疗16周后病毒载量是否显著降低的能力。
以上内容由遇见数据集搜集并总结生成
二维码
社区交流群
二维码
科研交流群
商业服务