[READNOTE]

Predicting scientific breakthroughs based on knowledge structure variations

💡 MetaData

Title	Predicting scientific breakthroughs based on knowledge structure variations
Journal	Technological Forecasting and Social Change
Authors	Chao Min; Yi Bu; Jianjun Sun
Pub. date	2021-03-01
DOI	10.1016/j.techfore.2020.120502
JINFO	中科院分区升级版: 管理学1区影响因子: 10.88 5年影响因子: 10.403 EI: 是 SSCI: Q1 AJG: 3 FMS: B JCI: 2.41

**Abstract

**Breakthrough research plays an essential role in the advancement of the scientific system. The identification and recognition of scientific breakthroughs is thus of extreme importance. We propose a citing-structure perspective for observing the unfolding of breakthrough research from variations in knowledge structure. The hypothesis is empirically validated that scientific breakthroughs show distinctive knowledge structure characteristics, which are further utilized to predict breakthroughs in their early stage of formation. These characteristics include average clustering coefficient, average degree, maximum closeness centrality, and maximum eigenvector centrality in the direct citing networks of a breakthrough publication. Several explanations are provided for the effectiveness of the predictive models. We also show that: (1) the number of direct citation counts is of low predictive power, with even a negative impact on prediction performance; (2) disciplinary differences exist in knowledge structure, and this should be taken into account; (3) breakthrough characteristics are most prominent in the first layer of citing networks; (4) timing is critical, and 2- to 3-year-old citing networks have greater predictive power.

📜 研究概况

问题：

使用引文网络结构特征预测突破论文

现状：

突破无统一定义
突破研究一般都是定量的研究
基于引用量的研究不足够，国王研究探究了其他视角，或使用引文网络代替引用数本身

路径：

诺贝尔奖论文作为突破论文样本，其余部分作为对照；
分析网络指标差异，随后预测

贡献：

科学突破产生对网络结构的影响

📊 研究细节

网络指标、节点数、边数、平均度、网络密度、平均聚类系数、最大中介中心度、最大接近中心度、最大特征向量中心度、组元数
116诺奖论文，同年同期刊引用数近似的论文，构建引文网络
按照诺贝尔奖项类型分学科（医学、化学、物理、经济）分析结构指标，呈现出不同；
诺奖组和对照组对比，认为边数（NoE）、平均聚类系数（ACC）、最大接近中心度（MCC）、平均度（AD）能有效区别两组
基于以上四个结构特征与是否诺奖构建回归；
- 经济&自科（三组合并）：经济学不显著；自科ACC/AD显著(0.01)、MEC显著(0.1)
- 第一年观测期和更长观测期的显著程度不同，第二年起显著的关系较多
构建预测模型：二分类；五折；AUC评价
- 自科-2年 max auc 0.675（特征：
  
  “ACC, AD, MCC, MEC”
  
  ）
- 自科-3年 max auc 0.652（特征：
  
  “ACC, AD, MCC, MEC”
  
  ）
- 经济-2年 0.695/-3年 0.630

🚩 主要结论

[comment]

更激进的突破将导致科学知识结构的更大变化
[comment]

利用结构特征，我们能够预测一个科学作品在未来成为突破的潜力
[comment]

突破性引用网络的结构特征是“范式转换”的体现；突破论文引起“淘金热”

📌 创新启示

🔬 展望思考

📜 原文摘录

[comment]

突破通常会动摇其相关研究领域的现状，甚至确立新的科学范式
[comment]

尽管生理学家和医学科学家比经济学家更频繁地创建和引用知识，但他们构建的知识网络不如经济学家所建立的那样密集
[comment]

就论文自我中心引用网络，自然科学知识引用和流动更为频繁，而经济科学的知识流动相对较慢。
[comment]

在突破形成的早期阶段，直接引用计数无法有效地区分诺贝尔奖得主和控制组