Implementasi Ekstraksi Fitur untuk Pengelompokan Dokumen Proposal Menggunakan Algoritma NaÃ¯ve Bayes

Dini Nurmalasari; Heri  Ribut Yuliantoro

doi:10.35143/jkt.v8i1.5351

Submitted

14 April 2022

Accepted

26 June 2022

Published

26 June 2022

Download

PDF (Bahasa Indonesia)

Statistic

Read Counter : 1104 Download : 590

Downloads

Download data is not yet available.

Abstract

Text mining is the process of discovering new, previously unknown information from several text documents. Text mining can be applied to the fields of information extraction, topic tracking, document summarization, document categorization or grouping, concept linking or question answering systems. One thing that is often done in implementing text mining is information extraction. Information extraction aims to extract information from unstructured documents into structured data, with the aim of making it easier to analyze the data. In this study, feature extraction will be used to extract features from the Community Service document, using the Frequent Itemset Mining (FIM) algorithm. The features taken are PKM Title, Abstract, Year of Service, Location, and research topic. After obtaining the features, the service topics will be grouped using the Naive Bayes algorithm. The results of this study were tested using a confusion matrix, with an accuracy of 70%. Factors that affect the accuracy results include the amount of training data, the distribution of training data, and the optimization of the algorithm used

Keywords

Text Mining Ektraksi Fitur Algoritma Naive Bayes

License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Copyright info for authors

1. Authors hold the copyright in any process, procedure, or article described in the work and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.

2. Authors retain publishing rights to re-use all or portion of the work in different work but can not granting third-party requests for reprinting and republishing the work.

3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) as it can lead to productive exchanges, as well as earlier and greater citation of published work.

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

How to Cite

Nurmalasari, D., & Ribut Yuliantoro, H. . (2022). Implementasi Ekstraksi Fitur untuk Pengelompokan Dokumen Proposal Menggunakan Algoritma NaÃ¯ve Bayes. Jurnal Komputer Terapan, 8(1), 194–203. https://doi.org/10.35143/jkt.v8i1.5351

Download Citation

References

Ardanu, F., Himawan, H., & P, D. B. (2013). Pemanfaatan Teknologi Data Mining Dalam Menentukan Efektifitas Penyebaran Brosur.
Dharmayanti, D., Bachtiar, A. M., & Heryandi, A. (2013). Pemodelan Data Warehouse,
(2), 151â€“168.
Fadilah, U., Winarno, W. W., Amborowati, A., Fadilah, U., Winarno, W. W., & Amborowati, A. (2016). Perancangan Data Warehouse Untuk Sistem Akademik STMIK Kadiri Data Warehouse System Design For Academic STMIK Kadiri, 6(2), 217â€“228.
Ilmiah, J., Komputa, I., Volume, E., Issn, F., Cv, D. I., Anugerah, K., â€¦ Bandung, U. (2016a). PEMBANGUNAN PERANGKAT LUNAK DATA WAREHOUSE Jurnal Ilmiah Komputer Dan Informatika ( KOMPUTA ), 1.
Ilmiah, J., Komputa, I., Volume, E., Issn, F., Cv, D. I., Anugerah, K., â€¦ Bandung, U. (2016b). PEMBANGUNAN PERANGKAT LUNAK DATA WAREHOUSE Jurnal Ilmiah Komputer Dan Informatika ( KOMPUTA ). Ok
Mulyati, S., Amini, S., & Juliasari, N. (2014). 104-279-1-PB.Pdf. Jurnal Telematika MKOM, 6 No.1.
Ponniah, P. (2001). Data Warehouse Fundamentals: A Comprehensive Guide For IT Professional.J.Wiley. New York.
M. Ainiyah, D. Nurmalasari, And W. Nengsih, â€œVisualisasi Data Teks Food Reviews Menggunakan Frequent Itemset Mining,â€ J. Aksara Komput. Terap., Vol. 6, No. 2, 2017.
L. Tanjaya, A. Wibowo, And D. Nurmalasari, â€œSistem Pengelompokan E-Journal Berdasarkan Abstrak Menggunakan Text Mining Dan K-Means Clustering,â€ J. Aksara Komput. Terap., Vol. 5, No. 1, 2016
J. Han, J. Pei, And M. Kamber, Data Mining: Concepts And Techniques. Elsevier, 2011.
R. Feldman And J. Sanger, The Text Mining Handbook: Advanced Approaches In Analyzing Unstructured Data. Cambridge University Press, 2007.
E. Muningsih, H. M. Nur, F. F. D. Imaniawan, V. R. Handayani, And F. Endiarto, â€œComparative Analysis On Dimension Reduction Algorithm Of Principal Component Analysis And Singular Value Decomposition For Clustering,â€ In Journal Of Physics: Conference Series, 2020, Vol. 1641, No. 1, P. 012101.
A. Sukma, B. Zaman, And E. Purwanti, â€œInformation Retrieval Document Classification With K-Nearest Neighbor,â€ Rec. Libr. J., Vol. 1, No. 2, Pp. 129â€“138, 2015.
A. N. Asyfa, D. Nurmalasari, And R. P. Sari, â€œIdentifikasi Kinerja Perusahaan Berdasarkan Laporan Keuangan Menggunakan Algoritma K-NN,â€ J. Aksara Komput. Terap., Vol. 5, No. 1, 2016.
S. H. Myaeng, K. S. Han, And H. C. Rim, â€œSome Effective Techniques For Naive Bayes Text Classification,â€ IEEE Trans. Knowl. Data Eng., Vol. 18, No. 11, Pp. 1457â€“1466, 2006
W. Zhang And F. Gao, â€œPerformance Analysis And Improvement Of NaÃ¯ve Bayes In Text Classification Application,â€ 2013 IEEE Conf. Anthol. Nthol. 2013, Pp. 1â€“4, 2013.
M. A. Fauzi, S. Gosario, A. Z. Arifin, And I. S. Prabowo, â€œKlasifikasi Berita Berbahasa Indonesia Menggunakan Seleksi Fitur Dua Tahap Dan Naive Bayes,â€ SYSTEMIC, Vol. 03, No. 02, Pp. 7â€“12, 2017.
Nurhuda, F., Widya Sihwi, S. Dan Doewes, A. (2016) â€œAnalisis Sentimen Masyarakat Terhadap Calon Presiden Indonesia 2014 Berdasarkan Opini Dari Twitter Menggunakan Metode Naive Bayes Classifier,â€ Jurnal Teknologi & Informasi Itsmart, 2(2), Hal. 35

References

Ardanu, F., Himawan, H., & P, D. B. (2013). Pemanfaatan Teknologi Data Mining Dalam Menentukan Efektifitas Penyebaran Brosur.

Dharmayanti, D., Bachtiar, A. M., & Heryandi, A. (2013). Pemodelan Data Warehouse,

(2), 151â€“168.

Fadilah, U., Winarno, W. W., Amborowati, A., Fadilah, U., Winarno, W. W., & Amborowati, A. (2016). Perancangan Data Warehouse Untuk Sistem Akademik STMIK Kadiri Data Warehouse System Design For Academic STMIK Kadiri, 6(2), 217â€“228.

Ilmiah, J., Komputa, I., Volume, E., Issn, F., Cv, D. I., Anugerah, K., â€¦ Bandung, U. (2016a). PEMBANGUNAN PERANGKAT LUNAK DATA WAREHOUSE Jurnal Ilmiah Komputer Dan Informatika ( KOMPUTA ), 1.

Ilmiah, J., Komputa, I., Volume, E., Issn, F., Cv, D. I., Anugerah, K., â€¦ Bandung, U. (2016b). PEMBANGUNAN PERANGKAT LUNAK DATA WAREHOUSE Jurnal Ilmiah Komputer Dan Informatika ( KOMPUTA ). Ok

Mulyati, S., Amini, S., & Juliasari, N. (2014). 104-279-1-PB.Pdf. Jurnal Telematika MKOM, 6 No.1.

Ponniah, P. (2001). Data Warehouse Fundamentals: A Comprehensive Guide For IT Professional.J.Wiley. New York.

M. Ainiyah, D. Nurmalasari, And W. Nengsih, â€œVisualisasi Data Teks Food Reviews Menggunakan Frequent Itemset Mining,â€ J. Aksara Komput. Terap., Vol. 6, No. 2, 2017.

L. Tanjaya, A. Wibowo, And D. Nurmalasari, â€œSistem Pengelompokan E-Journal Berdasarkan Abstrak Menggunakan Text Mining Dan K-Means Clustering,â€ J. Aksara Komput. Terap., Vol. 5, No. 1, 2016

J. Han, J. Pei, And M. Kamber, Data Mining: Concepts And Techniques. Elsevier, 2011.

R. Feldman And J. Sanger, The Text Mining Handbook: Advanced Approaches In Analyzing Unstructured Data. Cambridge University Press, 2007.

E. Muningsih, H. M. Nur, F. F. D. Imaniawan, V. R. Handayani, And F. Endiarto, â€œComparative Analysis On Dimension Reduction Algorithm Of Principal Component Analysis And Singular Value Decomposition For Clustering,â€ In Journal Of Physics: Conference Series, 2020, Vol. 1641, No. 1, P. 012101.

A. Sukma, B. Zaman, And E. Purwanti, â€œInformation Retrieval Document Classification With K-Nearest Neighbor,â€ Rec. Libr. J., Vol. 1, No. 2, Pp. 129â€“138, 2015.

A. N. Asyfa, D. Nurmalasari, And R. P. Sari, â€œIdentifikasi Kinerja Perusahaan Berdasarkan Laporan Keuangan Menggunakan Algoritma K-NN,â€ J. Aksara Komput. Terap., Vol. 5, No. 1, 2016.

S. H. Myaeng, K. S. Han, And H. C. Rim, â€œSome Effective Techniques For Naive Bayes Text Classification,â€ IEEE Trans. Knowl. Data Eng., Vol. 18, No. 11, Pp. 1457â€“1466, 2006

W. Zhang And F. Gao, â€œPerformance Analysis And Improvement Of NaÃ¯ve Bayes In Text Classification Application,â€ 2013 IEEE Conf. Anthol. Nthol. 2013, Pp. 1â€“4, 2013.

M. A. Fauzi, S. Gosario, A. Z. Arifin, And I. S. Prabowo, â€œKlasifikasi Berita Berbahasa Indonesia Menggunakan Seleksi Fitur Dua Tahap Dan Naive Bayes,â€ SYSTEMIC, Vol. 03, No. 02, Pp. 7â€“12, 2017.

Nurhuda, F., Widya Sihwi, S. Dan Doewes, A. (2016) â€œAnalisis Sentimen Masyarakat Terhadap Calon Presiden Indonesia 2014 Berdasarkan Opini Dari Twitter Menggunakan Metode Naive Bayes Classifier,â€ Jurnal Teknologi & Informasi Itsmart, 2(2), Hal. 35

Implementasi Ekstraksi Fitur untuk Pengelompokan Dokumen Proposal Menggunakan Algoritma NaÃ¯ve Bayes

Article Sidebar

Downloads

Main Article Content

Abstract

Keywords

Article Details

Copyright info for authors

References

References

Most read articles by the same author(s)