【ＡＩ】美團LongCat團隊發布並開源VitaBench大模型評測基準

　　美團LongCat團隊20日正式發布當前高度貼近真實生活場景、面向複雜問題的大模型智能體評測基準--VitaBench(Versatile Interactive Tasks Benchmark)，並已全面開源。

　　據官方介紹，VitaBench以外賣點餐、餐廳就餐、旅遊出行三大高頻真實生活場景為典型載體，構建了包含66個工具的交互式評測環境，並進行了跨場景的綜合任務設計。例如，在旅遊規劃任務中，要求智能體通過思考、調用工具和用戶交互，完整執行從買好票到訂好餐廳的終端狀態。
《經濟通通訊社21日專訊》

【香港好去處】2025去邊最好玩？etnet為你提供全港最齊盛事活動，所有資訊盡在掌握！► 即睇

上一篇新聞︰21/10/2025 09:24 恒指高開301點報26160，科指升108點報6042，阿里、蘋概股造好

下一篇新聞︰21/10/2025 09:21 《異動股》舜宇光學科技等３家公司開市競價異動

其他

21/10/2025 09:31 《異動股》舜宇高開4%領漲蘋概股，iPhone 17熱賣隔晚蘋果股價破頂

21/10/2025 09:31 《異動股》紫金（０２８９９）曾升逾３％，現報３２﹒９４元

21/10/2025 09:25 《異動股》阿里高開３﹒５％，天貓雙１１開賣首小時１８９１９…

21/10/2025 09:19 《本港樓市》錦上路「柏瓏ＩＩＩ」九日累收逾４７００票超購３…

21/10/2025 09:15 肖飛與仇廣宇加入美團決策層Ｓ－ｔｅａｍ，分管軟硬件服務和Ｋ…

備註：	即時報價更新時間為21/10/2025 17:59
	港股即時基本市場行情由香港交易所提供; 香港交易所指定免費發放即時基本市場行情的網站

經濟通
強化版MQ
強化版TQ
財曆
Mobile
Web

客務熱線︰(852) 2880 7004 客務郵箱︰cs@etnet.com.hk
關於我們 | 產品服務 | 廣告查詢 | 聯絡我們 | 私隱政策 | 使用條款 | 網站導航 | 有用連結 | RSS新聞

Copyright 2025 ET Net Limited. http://www.etnet.com.hk ET Net Limited, HKEx Information Services Limited, its Holding Companies and/or any Subsidiaries of such holding companies, and Third Party Information Providers endeavour to ensure the availability, completeness, timeliness, accuracy and reliability of the information provided but do not guarantee its availability, completeness, timeliness, accuracy or reliability and accept no liability (whether in tort or contract or otherwise) any loss or damage arising directly or indirectly from any inaccuracies, interruption, incompleteness, delay, omissions, or any decision made or action taken by you or any third party in reliance upon the information provided. The quotes, charts, commentaries and buy/sell ratings on this website should be used as references only with your own discretion. ET Net Limited is not soliciting any subscriber or site visitor to execute any trade. Any trades executed following the commentaries and buy/sell ratings on this website are taken at your own risk for your own account.

《經濟通》所刊的署名及／或不署名文章，相關內容屬作者個人意見，並不代表《經濟通》立場，《經濟通》所扮演的角色是提供一個自由言論平台。