找回密码
 立即注册
搜索
热搜: 活动 交友 discuz
查看: 2|回复: 0

Triple-BERT:我们在顺风车平台上真的需要多智能体强化学习来派单吗?

[复制链接]

622

主题

0

回帖

1895

积分

金牌会员

积分
1895
发表于 2026-1-4 23:18:42 | 显示全部楼层 |阅读模式
摘要: On-demand ride-sharing platforms, such as Uber and Lyft, face the complex real-time challenge of bundling and matching passengers with different origins and destinations to available vehicles, while dealing with significant system uncertainties. Due to the large number of drivers and orders, order dispatching is often tackled using Multi-Agent Reinforcement Learning (MARL). However, traditional MARL methods struggle to capture global information and lack cooperation among workers, while Centralized Training Decentralized Execution (CTDE) MARL methods suffer from dimensionality issues. To address these challenges, we propose Triple-BERT, a centralized Single Agent Reinforcement Learning method tailored for large-scale order dispatching on ride-sharing platforms. Based on a variant of TD3, our approach breaks down the joint action probability into individual driver action probabilities to handle the vast action space. To deal with the extensive observation space, we introduce a novel BERT-based network that uses parameter reuse to manage parameter growth as the number of drivers and orders increases, and an attention mechanism to capture the complex relationships among drivers and orders. Our method is validated using a real-world ride-hailing dataset from Manhattan, showing an approximately 11.95% improvement over current state-of-the-art methods, with a 4.26% increase in served orders and a 22.25% reduction in pickup times. Our code, trained model parameters, and processed data are publicly available at the repository https://github.com/RS2002/Triple-BERT.
更新时间: 2025-12-31 05:05:23
领域: cs.LG,cs.AI,cs.MA

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

QQ|Archiver|手机版|小黑屋|Octave中文网学术交流论坛 ( 黑ICP备2024030411号-2 )

GMT+8, 2026-1-13 18:31 , Processed in 0.073747 second(s), 21 queries .

Powered by Discuz! X3.5

© 2001-2025 Discuz! Team.

快速回复 返回顶部 返回列表