预处理1

This commit is contained in:
ChuXun
2026-01-31 13:43:21 +08:00
parent 84fb77d060
commit 8f767b45b6
3 changed files with 202 additions and 0 deletions

171
A题/参考/数据集.txt Normal file
View File

@@ -0,0 +1,171 @@
1.手机硬件性能与能耗
样本id
红色光通道数(无)
绿色光通道数(无)
蓝色光通道数 (无)
屏幕亮度 (无)
L21S电源域平均能量消耗μW・s
硬盘平均能量消耗μW・s
S12S平均能量消耗μW・s
相机平均能量消耗μW・s
GPU3D平均能量消耗μW・s
传感器平均能量消耗μW・s
内存1平均能量消耗μW・s
内存2平均能量消耗μW・s
显示屏平均能量消耗μW・s
GPS平均能量消耗μW・s
GPU平均能量消耗μW・s
无线局域网平均能量消耗μW・s
L22M显示屏平均能量消耗μW・s
S6M平均能量消耗μW・s
S8M平均能量消耗μW・s
S9M平均能量消耗μW・s
CPU大核平均能量消耗 μW・s
CPU小核平均能量消耗 μW・s
CPU中核平均能量消耗 μW・s
底层平均能量消耗μW・s
4G网络平均能量消耗μW・s
5G网络平均能量消耗μW・s
张量处理器平均能量消耗μW・s
CPU_LITTLE_FREQ_KHz KHz
CPU_MID_FREQ_KHz KHz
CPU_BIG_FREQ_KHz KHz
GPU0_FREQ (无)
GPU_1FREQ
GPU_MEM_AVG
BATTERY_DISCHARGE_TOTAL_UA μA
BATTERY_DISCHARGE_RATE_UAS μA・s
TOTAL_DATA_WIFI_BYTES Bytes
AVG_SOC_TEMP (℃)
DIFF_SOC_TEMP (℃)
BATTERY__PERCENT %
L21S_VDD2L_MEM_ENERGY_UW μW
UFS(Disk)_ENERGY_UW μW
S12S_VDD_AUR_ENERGY_UW μW
Camera_ENERGY_UWμW
GPU3D_ENERGY_UWμW
Sensor_ENERGY_UWμW
Memory_ENERGY_UWμW
Memory_ENERGY_UW.1μW
Display_ENERGY_UWμW
GPS_ENERGY_UWμW
GPU_ENERGY_UWμW
WLANBT_ENERGY_UWμW
L22M_DISP_ENERGY_UWμW
S6M_LLDO1_ENERGY_UW μW
S8M_LLDO2_ENERGY_UWμW
S9M_VDD_CPUCL0_M_ENERGY_UWμW
CPU_BIG_ENERGY_UWμW
CPU_LITTLE_ENERGY_UW μW
CPU_MID_ENERGY_UWμW
INFRASTRUCTURE_ENERGY_UWμW
CELLULAR_ENERGY_UWμW
CELLULAR_ENERGY_UW.1μW
INFRASTRUCTURE_ENERGY_UW.1μW
TPU_ENERGY_UWμW
C_ID
C_PL
M_ID
M_PL
2.锂电池使用老化(这里是压缩包)
id
Cumulative charging capacity corresponding to the voltage grid (Ah)
3.不同温度的电池充电,放电,阻抗情况(压缩包)
a.子文件名称:
1. BatteryAgingARC-FY08Q4
Files:
B0005.mat Data for Battery #5
B0006.mat Data for Battery #6
B0007.mat Data for Battery #7
B0018.mat Data for Battery #18
2. BatteryAgingARC_25_26_27_28_P1
Files:
B0025.mat Data for Battery #25
B0026.mat Data for Battery #26
B0027.mat Data for Battery #27
B0028.mat Data for Battery #28
3. BatteryAgingARC_25-44
Files:
B0025.mat Data for Battery #25
B0026.mat Data for Battery #26
B0027.mat Data for Battery #27
B0028.mat Data for Battery #28
Files:
B0029.mat Data for Battery #29
B0030.mat Data for Battery #30
B0031.mat Data for Battery #31
B0032.mat Data for Battery #32
Files:
B0033.mat Data for Battery #33
B0034.mat Data for Battery #34
B0036.mat Data for Battery #36
Files:
B0038.mat Data for Battery #38
B0039.mat Data for Battery #39
B0040.mat Data for Battery #40
Files:
B0041.mat Data for Battery #41
B0042.mat Data for Battery #42
B0043.mat Data for Battery #43
B0044.mat Data for Battery #44
4. BatteryAgingARC_45_46_47_48
Files:
B0045.mat Data for Battery #45
B0046.mat Data for Battery #46
B0047.mat Data for Battery #47
B0048.mat Data for Battery #48
5. BatteryAgingARC_49_50_51_52
Files:
B0049.mat Data for Battery #49
B0050.mat Data for Battery #50
B0051.mat Data for Battery #51
B0052.mat Data for Battery #52
6. BatteryAgingARC_53_54_55_56
Files:
B0053.mat Data for Battery #53
B0054.mat Data for Battery #54
B0055.mat Data for Battery #55
B0056.mat Data for Battery #56
b.Data Structure:
cycle: top level structure array containing the charge, discharge and impedance operations
type: operation type, can be charge, discharge or impedance
ambient_temperature: ambient temperature (degree C)
time: the date and time of the start of the cycle, in MATLAB date vector format
data: data structure containing the measurements
for charge the fields are:
Voltage_measured: Battery terminal voltage (Volts)
Current_measured: Battery output current (Amps)
Temperature_measured: Battery temperature (degree C)
Current_charge: Current measured at charger (Amps)
Voltage_charge: Voltage measured at charger (Volts)
Time: Time vector for the cycle (secs)
for discharge the fields are:
Voltage_measured: Battery terminal voltage (Volts)
Current_measured: Battery output current (Amps)
Temperature_measured: Battery temperature (degree C)
Current_charge: Current measured at load (Amps)
Voltage_charge: Voltage measured at load (Volts)
Time: Time vector for the cycle (secs)
Capacity: Battery capacity (Ahr) for discharge till 2.7V
for impedance the fields are:
Sense_current: Current in sense branch (Amps)
Battery_current: Current in battery branch (Amps)
Current_ratio: Ratio of the above currents
Battery_impedance: Battery impedance (Ohms) computed from raw data
Rectified_impedance: Calibrated and smoothed battery impedance (Ohms)
Re: Estimated electrolyte resistance (Ohms)
Rct: Estimated charge transfer resistance (Ohms)

31
A题/参考/预处理1.md Normal file
View File

@@ -0,0 +1,31 @@
### 二、6步核心预处理操作
#### 1. 编码适配与数据读取
- 识别原始CSV文件编码为GB2312非默认UTF-8使用该编码读取数据确保字段名如“屏幕亮度”“CPU大核平均能量消耗”无乱码成功加载1000条记录、68个字段。
#### 2. 样本排序与索引规范
- 按“样本id”字段升序排序原始id无序如[0,1,10,100...]),重置数据索引,使样本按序号自然连续,保证后续分析的逻辑连贯性。
#### 3. 有效分析窗口筛选
- 以“样本序号替代时间步长”基于“电池电量百分比BATTERY__PERCENT”变化规律筛选出300个样本的稳定窗口样本234-534
- 该窗口内电量从91%降至74%下降17%),无极端波动,数据质量最优。
#### 4. 异常数据清洗
- 处理3类关键异常
1. **字符串转数值**将“CPU_MID_FREQ_KHz”中的“err”异常值转为空值用前向填充+后向填充补全;
2. **数值逻辑修正**电池放电电流BATTERY_DISCHARGE_TOTAL_UA为负值表示放电方向取负转为正值单位A
3. **单位统一**将温度AVG_SOC_TEMP的毫摄氏度单位如44533→44.5℃除以1000转为常规摄氏度。
#### 5. 特征标准化处理
- 对核心硬件指标进行归一化统一到0-1范围便于后续对比分析
- 屏幕亮度原始值0-100÷100
- CPU性能大中小核频率分别归一后按0.4大核、0.3中核、0.3小核加权计算“CPU代理性能值”
- WiFi数据传输量差值取对数后归一同时生成“WiFi活跃标识”1=有数据传输0=无)。
#### 6. 核心字段筛选
- 从68个原始字段中筛选出9个关键字段组成最终数据集覆盖“标识、硬件性能、电池状态、温度”核心维度
样本id、时间步长索引、标准化屏幕亮度、CPU代理性能、WiFi代理指标、WiFi活跃标识、标准化电量、放电电流、电池温度。
---
### 三、预处理结果
最终得到**300行×9列**的清洁数据集,满足:无缺失值、无异常值、核心指标标准化,可直接用于硬件性能与能耗关联分析、建模等场景。