What is fit and overfit

In statistics, goodness of fit refers to how closely a model's predicted values match the observed(true) values
overfit: A model that has learned the noise instead of the signal is considered "overfit" because it fits the training dataset but has poor fit with new dataset.
Underfitting occurs when a model is too simple —— informed by too few features or regularized too much(由于太少的特征或太多的正则化导致)-----which makes it inflexible in learning the dataset.
simple learners tend to have less variance in their predictions but more bias towards wrong outcome; On the other hand, complex learners tend to have more variance in their predictions. Both bias and variance are forms of prediction error in machine learning. Typically, we can reduce error from bias but might increase error from variance as a result, or vice versa.
This trade-off between too simple (high bias) vs. too complex (high variance) is a key concept in statistics and machine learning, and one that affects all supervised learning algorithm
How to detect a overfitting

5.1 We can split our initial dataset into separate training and test subsets
5.2 Another tip is to start with a very simple model to serve as a benchmark (基准),Then, as you try more complex algorithms, you’ll have a reference point to see if the additional complexity is worth it.

相关阅读:
.net core 3.1 过滤器(Filter) 和中间件和AOP面向切面拦截器
socket通信框架——boost asio
远程过程调用框架——gRPC
数据序列化工具——flatbuffer
springboot项目启动----8080端口被占用排雷经过
如何配置HOSTS文件
使用线程Callable实现分段获取一个url连接的资源数据
Socket网络编程课件
(6）优化TCP编写服务器端同时支持多个客户端同时访问
SpringBoot配置属性之Security

原文地址：https://www.cnblogs.com/qiulinzhang/p/9513414.html

What is fit and overfit

How to detect a overfitting