Precision-Aware Workload Analytics for AI Systems: A Unified Monitoring Framework Across PyTorch and Scikit-learn Pipelines

Haoran Liu; Marco Bianchi; Yuki Tanaka

doi:10.63646/datamind.2025.030202

Open Access PDF

Published 2025-06-30

Haoran Liu

School of Computing, National University of Singapore, Singapore 117417, Singapore

Marco Bianchi*

Department of Information Engineering, University of Padua, Padua 35131, Italy
marco.bianchi@unipd.it

Yuki Tanaka

School of Computing, National University of Singapore, Singapore 117417, Singapore

Abstract

The computational cost of modern machine-learning (ML) and deep-learning (DL) workloads has become a first-class concern in applied AI research, particularly as workloads grow in size and as hardware diversity widens. Conventional efficiency indicators such as wall-clock time, joules, or equivalent CO2 emissions depend strongly on the physical machine on which an experiment is executed, which makes reproducible comparison across laboratories and across hardware generations difficult. This paper develops a unified monitoring framework that measures computational workload at two complementary levels: the algorithmic level, captured by floating-point-operation (FLOP) counts, and the hardware-level, captured by bit-operation (BOP) counts that incorporate operand precision. The framework is implemented as a hardware-agnostic, backend-pluggable Python pipeline that intercepts operations dynamically in PyTorch through the dispatcher layer and wraps estimator methods analytically in Scikit-learn. Using a structured evaluation across three canonical workloads—a fully-connected classifier on tabular data, a convolutional model on image data, and a small transformer on text—we show that FLOP counts alone systematically mis-represent the efficiency benefits of quantization, while BOP counts provide a more faithful view of hardware-level effort. Aggregation over training and inference phases, combined with precision-aware scaling, yields a reproducible efficiency fingerprint that is stable across CPU and GPU backends to within a narrow interval. The framework preserves the structure of existing experimental pipelines and adds only a thin supervisory layer. The contribution is not a single tool but an analytics pattern that connects algorithmic complexity, numerical precision, and practitioner workflow into one coherent monitoring surface.

Keywords: Green AI; computational cost; FLOPs; bit-operations; quantization; reproducible benchmarking; PyTorch; Scikit-learn

This work is licensed under a Creative Commons Attribution 4.0 International License.

How to Cite

Liu, H., Bianchi, M., & Tanaka, Y. (2025). Precision-Aware Workload Analytics for AI Systems: A Unified Monitoring Framework Across PyTorch and Scikit-learn Pipelines. DATAMIND, 3(2), 5-19. https://doi.org/10.63646/datamind.2025.030202

Download Citation

Article sidebar

Main article

Abstract

Article details

How to Cite