Anomaly detection problems have many different facets, and the detection techniques can be highly influenced by the way we define anomalies, type of input data, expected output, etc. This leads to wide variations in problem formulations, which need to be addressed through different analytical approaches. At present, there is a fairly rich variety of R software packages supporting anomaly detection tasks within various disciplinary contexts using different analytical techniques. Some of these use an approach to anomaly detection based on a forecast distribution. We locate over 75 R packages with anomaly detection capabilities via a comprehensive online search. We first present a structured and comprehensive discussion on the functionality and capability of these publicly available R packages for anomaly detection. Despite the large number of packages available, there are some anomaly detection challenges that are not supported with existing packages. We reduce this gap by introducing three new R packages for anomaly detection, oddstream, oddwater and stray, with special reference to their capabilities, competitive features and target applications. Package oddstream introduces a framework that provides early detection of anomalous behaviours within a large collection of streaming time series. This includes a novel approach that adapts to non-stationarity in the time series. Package oddwater provides a framework for early detection of outliers in water-quality data from in situ sensors caused by technical issues. Package stray provides a framework to detect anomalies in high dimensional data. Using various synthetic and real datasets, we demonstrate the wide applicability and usefulness of our proposed frameworks.
Impressive work by @pridiltal on anomaly detection in streaming data, demonstrated on several applications: https://t.co/1nVBgHFgKX #useR2019 pic.twitter.com/7eSv0ilf68
— Tim Radtke (@TimRadtke) July 10, 2019
Very interesting presentations on forecasting, outliers, smoothing, features and time series in tidy formats by @mitchoharawild, @iSvetunkov, @robjhyndman, @pridiltal and @thiyangt at the #useR2019 pic.twitter.com/OBAujqXEYg
— Fotios Petropoulos (@fotpetr) July 12, 2019
Great #user2019 talk by @pridiltal ????????????
— Emi Tanaka ???? (@statsgen) September 8, 2019
"Anomaly detection in R"
Very interesting application!
I didn't realise that those slides were not beamer till the end! (the tourrr animation should have given it away)
???? https://t.co/qjmsCGhdee
???? https://t.co/0kYkTWXotD pic.twitter.com/pbfJ9nO7KA