A Data-Centric Architecture for Insider Threat Indicators: From Binary Risk Instances to Governed Security Knowledge Bases

Hanwen Zhao; Mei Lin; Qiang Xu

doi:10.63646/datamind.2025.030402

Open Access PDF

Published 2025-12-30

Hanwen Zhao

School of Information Management, Yunnan Normal University, Kunming 650500, China

Mei Lin*

School of Computer Science and Engineering, Guangxi University of Science and Technology, Liuzhou 545006, China
mei.lin@gxust.edu.cn

Qiang Xu

School of Management Science and Engineering, Hunan University of Technology and Business, Changsha 410205, China

DOI: https://doi.org/10.63646/datamind.2025.030402

Abstract

Insider threat prevention requires more than event detection after a violation has already occurred. Organizations need a governed data architecture that converts scattered behavioral, organizational, and technical observations into interpretable early-warning indicators before misuse becomes an incident. This article develops a data-centric architecture for insider threat indicators that transforms binary risk instances into a governed security knowledge base. The proposed framework includes six layers: indicator vocabulary design, binary evidence capture, provenance-aware storage, data quality validation, entropy-informed scoring, and governance-oriented knowledge services. A synthetic enterprise dataset is used to demonstrate how the architecture supports indicator traceability, quality control, risk score computation, and decision documentation. Results show that governance controls improve average indicator quality from 0.66 to 0.89, reduce missing high-criticality evidence by 58%, and identify risk concentration in work-context and data-protection indicators before a simulated breach pathway becomes operationally visible. The study contributes to data-driven AI and computational discovery by reframing insider threat measurement as a knowledge-base construction problem rather than a single predictive modeling task. The framework offers practical guidance for security teams, data stewards, and enterprise risk managers seeking transparent, auditable, and reusable insider-risk analytics.

Keywords: insider threat indicators; security knowledge base; binary risk instances; data governance; information entropy; risk analytics; data-centric architecture

This work is licensed under a Creative Commons Attribution 4.0 International License.

How to Cite

Zhao, H., Lin, M., & Xu, Q. (2025). A Data-Centric Architecture for Insider Threat Indicators: From Binary Risk Instances to Governed Security Knowledge Bases. DATAMIND, 3(4), 5-27. https://doi.org/10.63646/datamind.2025.030402

Download Citation

Article sidebar

Main article

Abstract

Article details

How to Cite