• 中国科学学与科技政策研究会
  • 中国科学院科技战略咨询研究院
  • 清华大学科学技术与社会研究中心
ISSN 1003-2053 CN 11-1805/G3

科学学研究 ›› 2026, Vol. 44 ›› Issue (1): 41-50.

• 专稿 • 上一篇    下一篇

AI 和人类顺序性协同的管理决策过程———来自期刊论文初审筛选的启示

史会斌1,李成汉2   

  1. 1. 西安电子科技大学
    2. 同济大学
  • 收稿日期:2025-08-16 修回日期:2025-11-23 出版日期:2026-01-15 发布日期:2026-01-19
  • 通讯作者: 李成汉
  • 基金资助:
    商业生态视角下先动企业与跟随企业数字化转型战略协同机制的研究

The Process of Sequential AI-Human Collaborative Decision-Making: Insights from Initial Manuscript Screening in Journal Peer Review

  • Received:2025-08-16 Revised:2025-11-23 Online:2026-01-15 Published:2026-01-19

摘要: 本文深入期刊论文初审筛选决策的具体过程,使用大语言模型,针对“先AI后人类”的顺序性管理决策,提出了提高“人类-AI”系统协同性的决策方法。本文指出“先AI后人类”的顺序性管理决策的关键是在对AI决策方案筛选效能评估基础上,设计AI的决策方案筛选阈值。在引入和设计了spearman相关系数、优序比较系数、灵敏性、特异性、准确性及AI最小保留率指标后,使用聚合多个人类主体的评价基准,完成了确立决策方案筛选阈值的过程。本文对一个期刊三组论文的分析表明,利用现有大语言模型的论文初审筛选决策中,通过保留75%的论文,就能够覆盖相关评估基准中的高质量论文,从而使编辑的初审工作量降低约25%,编辑总体工作效率提高约15%,编审全流程效率提高约8%。本文提出的决策方法能够适应不同业务场景以及大语言模型能力的动态演进,具有较强的适用性。

Abstract: The widespread application of the "AI-human" system in management decision-making has sparked discussions on enhancing the synergy of the "AI-human" system to improve management decision-making performance. Although existing research has proposed some methods to improve the synergy of the "AI-human" system in management decision-making, there is still a lack of validation in specific business scenarios. This article chooses the initial screening of journal articles as a scenario and delve into the specific process of sequential AI-Human collaboration to exploring the path to improving the synergy of the "AI-human" system in management decision-making. This article proposes that we can use the scoring results of AI to eliminate some low-quality submitted papers before the editor starts the initial screening work. Then, the editor screens the remaining papers with higher scores. In this process, AI’s filtering effect will reduce the number of papers reviewed by editors. Therefore, we can use the sequential AI-Human collaborative mode to improve the overall efficiency of paper review work. That is to say, the key to this type decision-making lies in designing AI screening thresholds based on an evaluation of AI's screening effectiveness. Firstly, this article introduces Spearman correlation and innovatively designs priority comparison coefficient to evaluate the overall decision-making efficiency of AI. Secondly, by defining the negative and positive states of the screening results, sensitivity, specificity, and accuracy indicators were introduced to evaluate the AI’s decision screening effectiveness; Then, the design of evaluating benchmarks is carried out by aggregating human collective judgments, and it is proposed that the judgments of multiple human experts can be combined, enhanced, and jointly determined to form different evaluation benchmarks. These benchmarks can avoid the limitation of using a single evaluation benchmark that cannot accurately reflect AI’s decision-making ability; Finally, by designing the minimum retention rate as an indicator, the threshold setting method for AI in screening decision was clarified. This indicator not only reflects AI’s screening effectiveness, but also enables the rapid implementation of AI applications in the specific scenario. The analysis of three groups of papers in a journal demonstrates that by retaining 75% of the papers during the initial screening phase using existing large language models, high-quality papers identified in relevant evaluation benchmarks can be effectively covered. This reduces editorial workload by approximately 25%, increases overall editorial efficiency by about 15%, and improves the efficiency of the entire editorial process by roughly 8%. The research is not limited to the level of AI (or LLM) capabilities, but focuses on how to effectively introduce AI into management decision-making on the basis of existing AI capabilities, so as to accelerate the pace of AI implementation and application. The relevant evaluation indicators of AI decision-making capability and their usage logic proposed in this article can not only be applied to decision screening in other scenarios, but also adapt to the dynamic evolution of LLM capability. After significant changes in AI capability, the threshold for AI decision screening can be reanalyzed and calculated, achieving continuous improvement in the synergy level of the "AI-human" systems.

中图分类号: