|
255 | 255 | "errors made during the data collection process (besides not measuring the\n", |
256 | 256 | "unobserved input feature).\n", |
257 | 257 | "\n", |
258 | | - "One extreme case could happen if there where samples in the dataset with\n", |
259 | | - "exactly the same input feature values but different values for the target\n", |
260 | | - "variable. That is very unlikely in real life settings, but could the case if\n", |
261 | | - "all features are categorical or if the numerical features were discretized\n", |
262 | | - "or rounded up naively. In our example, we can imagine two houses having\n", |
263 | | - "the exact same features in our dataset, but having different prices because\n", |
264 | | - "of the (unmeasured) seller's rush.\n", |
265 | | - "\n", |
266 | | - "Apart from these extreme case, it's hard to know for sure what should qualify\n", |
267 | | - "or not as noise and which kind of \"noise\" as introduced above is dominating.\n", |
268 | | - "But in practice, the best ways to make our predictive models robust to noise\n", |
269 | | - "are to avoid overfitting models by:\n", |
| 258 | + "One extreme case could happen if there where samples in the dataset with exactly\n", |
| 259 | + "the same input feature values but different values for the target variable. That\n", |
| 260 | + "is very unlikely in real life settings, but could be the case if all features\n", |
| 261 | + "are categorical or if the numerical features were discretized or rounded up\n", |
| 262 | + "naively. In our example, we can imagine two houses having the exact same\n", |
| 263 | + "features in our dataset, but having different prices because of the (unmeasured)\n", |
| 264 | + "seller's rush.\n", |
| 265 | + "\n", |
| 266 | + "Apart from this extreme case, it's hard to know for sure what should qualify or\n", |
| 267 | + "not as noise and which kind of \"noise\" as introduced above is dominating. But in\n", |
| 268 | + "practice, the best way to make our predictive models robust to noise is to\n", |
| 269 | + "avoid overfitting models by:\n", |
270 | 270 | "\n", |
271 | 271 | "- selecting models that are simple enough or with tuned hyper-parameters as\n", |
272 | 272 | " explained in this module;\n", |
|
0 commit comments