The mining task of outlier detection is essential in many expert and intelligent systems exploited in a wide range of applications, from intrusion detection to molecular biology. In some of such applications the ability to process large amounts of data in a very short time can be critical, for instance in intrusion and fraud detection. This paper explores a solution for the optimisation of an exact, unsupervised outlier detection method by avoiding unnecessary computations, and therefore reducing the running time and making the method usable also in settings where response times are crucial. In particular, we enhance the SolvingSet-based approach by using a mechanism that exploits the knowledge learned during the algorithm execution and avoids a large amount of distance computations. We demonstrate the strength of the proposed solution, named FastSolvingSet, through both theoretical and experimental analysis.

Angiulli, F., Basta, S., Lodi, S., Sartori, C. (2020). Reducing distance computations for distance-based outliers. EXPERT SYSTEMS WITH APPLICATIONS, 147, 1-11 [10.1016/j.eswa.2020.113215].

Reducing distance computations for distance-based outliers

Lodi, Stefano
Penultimo
Membro del Collaboration Group
;
Sartori, Claudio
Ultimo
Membro del Collaboration Group
2020

Abstract

The mining task of outlier detection is essential in many expert and intelligent systems exploited in a wide range of applications, from intrusion detection to molecular biology. In some of such applications the ability to process large amounts of data in a very short time can be critical, for instance in intrusion and fraud detection. This paper explores a solution for the optimisation of an exact, unsupervised outlier detection method by avoiding unnecessary computations, and therefore reducing the running time and making the method usable also in settings where response times are crucial. In particular, we enhance the SolvingSet-based approach by using a mechanism that exploits the knowledge learned during the algorithm execution and avoids a large amount of distance computations. We demonstrate the strength of the proposed solution, named FastSolvingSet, through both theoretical and experimental analysis.
2020
Angiulli, F., Basta, S., Lodi, S., Sartori, C. (2020). Reducing distance computations for distance-based outliers. EXPERT SYSTEMS WITH APPLICATIONS, 147, 1-11 [10.1016/j.eswa.2020.113215].
Angiulli, Fabrizio; Basta, Stefano; Lodi, Stefano; Sartori, Claudio
File in questo prodotto:
File Dimensione Formato  
ESWA-D-19-02678_R1-w-ref-to-pub-and-cc.pdf

accesso aperto

Tipo: Postprint
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale - Non opere derivate (CCBYNCND)
Dimensione 995.58 kB
Formato Adobe PDF
995.58 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/812328
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 21
  • ???jsp.display-item.citation.isi??? 13
social impact