When looking at a pile of data, sometimes there is a data point that is not like the others. It attracts attention as it is different than the rest of the data.
When I spot something odd in a dataset, I wonder if there is something to learn here. Is this an opportunity to make a discovery or improve a process?
All too often it is tempting to remove the outlier as a mistake. Or to drop the outlier as it doesn’t make any sense and ‘messes up’ the analysis. [Read more…]