ARTÍCULO
TITULO

Applying a probabilistic algorithm to spam filtering

Olga V. Okhlupina    
Dmitry S. Murashko    

Resumen

Among the common methods of combating spam, a special place is occupied by a probabilistic machine learning algorithm, which is based on the well-known Bayes theorem. The so-called "naive" Bayesian classifier establishes the class of the document by determining the a posteriori maximum. With the development of machine learning methods, the Bayesian algorithm has not lost its relevance and continues to be very popular for solving a large number of tasks, including spam detection. The main advantages of this classifier are simplicity, fast learning, fairly high accuracy, reliability. The paper considers the solution of the problem of determining spam messages using a probabilistic machine learning algorithm. The mathematical justification and implementation of the Bayesian algorithm on a concrete example using program code in the Python programming language is given

 Artículos similares

       
 
Augusto Rafael Garrido-Arévalo, Luis Mauricio Agudelo-Otálora, Nelson Obregón-Neira, Victor Garrido-Arévalo, Edgar Eduardo Quiñones-Bolaños, Parisa Naraei, Mehrab Mehrvar and Ciro Fernando Bustillo-Lecompte    
An assessment of the rainfall station distribution in the mountainous area of the Regional Autonomous Corporation of Cundinamarca (CAR, for its acronym in Spanish), Colombia, was conducted by applying concepts from information entropy and artificial neur... ver más
Revista: Water

 
?piro Ivo?evic, Romeo Me?trovic and Nata?a Kovac    
This paper presents an approach for the model estimating the probabilistic percent corrosion depth for inner bottom plates of fuel oil tanks located in the double bottom of aging bulk carriers. Assuming that corrosion begins after four years of exploitat... ver más

 
Serhii Domoroshchyn,Alexandr Sakhno     Pág. 70 - 81
The approach has been developed to determining the numerical value of a failure probability and to forecasting the resource of an instrument transformer cell at the time of observation. Underlying a given approach is the control over the main parameters ... ver más

 
Mousa Shakeri, Atefeh Arjmand    
Introduction: One of the approaches to improve life quality in a residential complex is by modifying them properly to an educational public space. An educational public space is consisted of three factors: individual, community and physical space, the im... ver más
Revista: Innovaciencia

 
Umair Khan, Pooyan Safari and Javier Hernando    
Restricted Boltzmann Machines (RBMs) have shown success in both the front-end and backend of speaker verification systems. In this paper, we propose applying RBMs to the front-end for the tasks of speaker clustering and speaker tracking in TV broadcast s... ver más
Revista: Applied Sciences