Selecting training documents for better learning

Abdul Mohsen Algarni

doi:10.4172/2153-0602.C1.003

Awards Nomination 20+ Million Readerbase

Google Scholar citation report

Citations : 1498

Journal of Data Mining in Genomics & Proteomics received 1498 citations as per Google Scholar report

Journal of Data Mining in Genomics & Proteomics peer review process verified at publons

25+ Million Website Visitors

Indexed In

Academic Journals Database
Open J Gate
Genamics JournalSeek
JournalTOCs
ResearchBible
Ulrich's Periodicals Directory
Electronic Journals Library
RefSeek
Hamdard University
EBSCO A-Z
OCLC- WorldCat
Scholarsteer
SWB online catalog
Virtual Library of Biology (vifabio)
Publons
MIAR
Geneva Foundation for Medical Education and Research
Euro Pub
Google Scholar

Useful Links

Share This Page

Journal Flyer

Open Access Journals

Selecting training documents for better learning

2^nd International Conference on Big Data Analysis and Data Mining

November 30-December 01, 2015 San Antonio, USA

Abdul Mohsen Algarni

King Khalid University, Saudi Arabia

Posters-Accepted Abstracts: J Data Mining In Genomics & Proteomics

Abstract:

In general, there are two types of feedback documents: Positive feedback documents and negative feedback documents. Term based approaches can extract many features in text documents, but most include noise. It is clear that all feedback documents contain some noise knowledge that affects the quality of the extracted features. The amount of noise is different from one document to another. Therefore, reducing the noise data in the training documents would help to reduce noise in the extracted features. Moreover, we believe that removing some training documents (documents that contain more noise data than useful data) can help to improve the effectiveness of a classifier. Based on that observation, we found that short documents are more important than long documents. Testing that idea, we found that using the advantages of short training documents to improve the quality of extracted features can give a promising result. Moreover, we found that not all training documents are useful for training the classifier.

Biography :

Abdul Mohsen Algarni received a PhD degree in the Faculty of Information Technology at Queensland University of Technology, Brisbane, Australia in 2011. He is currently an Assistant Professor in the Department of Computer Science, King Khalid University. His research interests include web intelligence, data mining, text intelligence, information retrieval, and information systems.

Email: a.algarni@kku.edu.sa

PDF HTML

Journal of Data Mining in Genomics & Proteomics

PMC/PubMed Indexed Articles

Google Scholar citation report

Citations : 1498

Journal of Data Mining in Genomics & Proteomics peer review process verified at publons

25+ Million Website Visitors

Indexed In

Useful Links

Share This Page

Journal Flyer

Open Access Journals

Selecting training documents for better learning

2nd International Conference on Big Data Analysis and Data Mining

November 30-December 01, 2015 San Antonio, USA

Abstract:

Biography :

2^nd International Conference on Big Data Analysis and Data Mining