主管:中国科学院
主办:中国优选法统筹法与经济数学研究会
   中国科学院科技战略咨询研究院

中国管理科学 ›› 2021, Vol. 29 ›› Issue (3): 90-99.doi: 10.16381/j.cnki.issn1003-207x.2019.1841

• 论文 • 上一篇    下一篇

重要数据跨境流动背景下风险路径的识别与分级

李金1,2, 申苏浩1, 孙晓蕾2, 邢潇3   

  1. 1. 西安电子科技大学经济与管理学院, 陕西 西安 710071;
    2. 中国科学院科技战略咨询研究院, 北京 100190;
    3. 国家计算机网络应急技术处理协调中心, 北京 100029
  • 收稿日期:2019-11-14 修回日期:2020-03-04 发布日期:2021-04-02
  • 通讯作者: 邢潇(1989-),男(蒙古族),河北承德人,国家计算机网络应急技术处理协调中心,工程师,博士,研究方向:数据安全,E-mail:xingxiao@cert.org.cn. E-mail:xingxiao@cert.org.cn
  • 作者简介:邢潇(1989-),男(蒙古族),河北承德人,国家计算机网络应急技术处理协调中心,工程师,博士,研究方向:数据安全,E-mail:xingxiao@cert.org.cn.
  • 基金资助:
    国家自然科学基金资助项目(71901169);国家重点研发计划资助项目(2017YFC1201202);中国博士后科学基金资助项目(2019M650035)

Identification and Classification for Risk Paths in the Context of Cross-Border Important Data Flow

LI Jin1,2, SHEN Su-hao1, SUN Xiao-lei2, XING Xiao3   

  1. 1. School of Economics and Management, Xidian University, Xi'an 710071, China;
    2. Institute of Science and Development, Chinese Academy of Sciences, Beijing 100190, China;
    3. National Computer Network Emergency Response Technical Team/Coordination Center of China, Beijing 100029, China
  • Received:2019-11-14 Revised:2020-03-04 Published:2021-04-02

摘要: 重要数据的跨境流动引发了数据安全、国家安全等风险挑战。风险路径的识别和分级是对重要数据跨境流动进行预警管理的重要内容。本文基于复杂网络中的二分网络模型,对重要数据的跨境流动进行研究。首先,通过重要数据跨境流动的二分网络和关联网络识别风险路径;其次,构建基于网络结构和接收节点属性的目标风险路径方法以计算其风险值;最后,对我国某重要行业跨境流动的数据开展实证分析,验证算法的有效性和精准度。本文旨在为重要数据跨境流动的预警管理提供量化方法,有效预防重要数据跨境流动带来的风险,提升我国数据治理能力。

关键词: 重要数据, 跨境流动, 路径识别, 风险管理, 预警管理

Abstract: With the development of information technologies, such as artificial intelligence, big data, and cloud computing, massive data areexplosively produced and collected. The global economy and collaboration have also initiated a large scale of cross-border data flow.The data transmission potentially raises risks and challenges for data security and national security. The identification and classification of risk paths work as an important component in early warning management for the cross-border flow of important data. However, previous researches focus more on the regulation and policy suggestions. There are few researches on the risk management from a quantitative perspective, and relatively few on the early warning management for illegal transmission of important data.
Based on the complex network theory, the cross-border flow of important data are studied. First, the binary network model, including two types of nodes for data senders and receivers, is employed to simulate the cross-border data flow network. Second, the associated network is established by the common neighbor structure in the binary network. Meanwhile, the associated network can also reflect the data flow mechanism based on its transmission tendency. Third,the risk paths for important data flows across borders can be identified through the constructed binary network and its associated network. The destination risk path(DRP) algorithm, incorporating the network structure, node attribute, and data transmission frequency, is also designed to calculate the risk value for each risk path.
By collecting the cross-border data flow from an important industry in China, empirical analyses are conducted to detect the performance of proposed methods. Risk paths are empirically identified and risk values are obtained through the above methods. Using AUC as the criterion, the comparison results indicate that our proposed DRP algorithm performs better in link prediction than those algorithms in previous literatures, such as common neighbors, Jaccard, Sørensen, and potential link prediction, etc. Furthermore, the risk classification is also provided towards an efficient data flow monitoring and management. Considering the effects of network size and node attribute, a series of robustness checks are also conducted to support the main findings.
This paper focuses on the risk management issues emerging in the cross-border flow of important data. The methodology framework proposed in this paper can be widely used by different important industries,and benefits regulatory authorities to accurately identify and classify the potential risks existing in the cross-border data flow. A quantitative method is provided for the early warning management, effectively reducing the related risk, and furtherly improving the data governance capacity.

Key words: important data, cross-border flow, risk identification, risk management, early warning management

中图分类号: