Micro- and macro-averages

https://datascience.stackexchange.com/questions/15989/micro-average-vs-macro-average-performance-in-a-multiclass-classification-settin/16001

Micro- and macro-averages (for whatever metric) will compute slightly different things, and thus their interpretation differs.

A macro-average will compute the metric independently for each class and then take the average (hence treating all classes equally), whereas a micro-average will aggregate the contributions of all classes to compute the average metric. In a multi-class classification setup, micro-average is preferable if you suspect there might be class imbalance (i.e you may have many more examples of one class than of other classes).

To illustrate why, take for example precision $P r = \frac{T P}{(T P + F P)}$

Class A: 1 TP and 1 FP
Class B: 10 TP and 90 FP
Class C: 1 TP and 1 FP
Class D: 1 TP and 1 FP

You can see easily that $P r_{A} = P r_{C} = P r_{D} = 0.5$

A macro-average will then compute: $P r = \frac{0.5 + 0.1 + 0.5 + 0.5}{4} = 0.4$
A micro-average will compute: $P r = \frac{1 + 10 + 1 + 1}{2 + 100 + 2 + 2} = 0.123$

These are quite different values for precision. Intuitively, in the macro-average the "good" precision (0.5) of classes A, C and D is contributing to maintain a "decent" overall precision (0.4). While this is technically true (across classes, the average precision is 0.4), it is a bit misleading, since a large number of examples are not properly classified. These examples predominantly correspond to class B, so they only contribute 1/4 towards the average in spite of constituting 94.3% of your test data. The micro-average will adequately capture this class imbalance, and bring the overall precision average down to 0.123 (more in line with the precision of the dominating class B (0.1)).

相关阅读:
MYSQL查询和插入数据的流程是怎样的
Nacos服务心跳和健康检查源码介绍
Nacos使用和注册部分源码介绍
实用程序包utils
SOLID原则
前端实用程序包utils
实现 strStr()
记一次华为机试
十分钟入门 Python 教程
字符串转换整数 (atoi)

原文地址：https://www.cnblogs.com/TMatrix52/p/9075598.html