Online bipartite matching methodology for resources with major epidemics: adaptive time window based on reinforcement learning
收藏DataCite Commons2025-06-12 更新2026-05-05 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=d0170cadc71b4b6e8155ff70b08a93a3
下载链接
链接失效反馈官方服务:
资源简介:
This paper studies online matching problem of anti-epidemic resources between demanders and suppliers on the Internet of healthcare systems in a major outbreak, considering the heterogeneity of suppliers and demanders respectively. A time-window-based multi-stage online dynamic bipartite matching model is formulated, which can be transformed into a Markov decision process. The paper proposes adaptive time window batch bipartite matching algorithm based on reinforcement learning with nearest neighbor first heuristic strategy to allocate anti-epidemic resources. The findings indicate that while the matching rate continues to rise, the average waiting time exhibits a tendency of first reducing and then increasing as the matching time window gets longer. This implies that a manager should adjust the matching time window in accordance with the epidemic's development tendency and the availability of resources. It demonstrates that keep exploring the matching time window at a high level with taking the acceptable waiting time into account.
提供机构:
Science Data Bank
创建时间:
2025-06-12



