five

Online bipartite matching methodology for resources with major epidemics: adaptive time window based on reinforcement learning

收藏
DataCite Commons2025-06-12 更新2026-05-05 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=d0170cadc71b4b6e8155ff70b08a93a3
下载链接
链接失效反馈
官方服务:
资源简介:
This paper studies online matching problem of anti-epidemic resources between demanders and suppliers on the Internet of healthcare systems in a major outbreak, considering the heterogeneity of suppliers and demanders respectively. A time-window-based multi-stage online dynamic bipartite matching model is formulated, which can be transformed into a Markov decision process. The paper proposes adaptive time window batch bipartite matching algorithm based on reinforcement learning with nearest neighbor first heuristic strategy to allocate anti-epidemic resources. The findings indicate that while the matching rate continues to rise, the average waiting time exhibits a tendency of first reducing and then increasing as the matching time window gets longer. This implies that a manager should adjust the matching time window in accordance with the epidemic's development tendency and the availability of resources. It demonstrates that keep exploring the matching time window at a high level with taking the acceptable waiting time into account.
提供机构:
Science Data Bank
创建时间:
2025-06-12
二维码
社区交流群
二维码
科研交流群
商业服务