Enhancing News Article Summarization with Machine Learning

Authors

DOI:

https://doi.org/10.55938/ijgasr.v3i4.152

Keywords:

News Summarization, Machine Learning, Python, Automated Summarization, Feature Extraction

Abstract

The exponential growth of online news content has created a pressing need for automated summarization tools to help process and condense information effectively. This paper presents a machine learning-based approach to summarizing news articles, focusing on techniques that produce concise and coherent summaries. The methodology includes text preprocessing steps such as tokenization, stop-word removal, and stemming, followed by feature extraction and model training using machine learning frameworks. Libraries such as NLTK and TensorFlow are employed to facilitate text processing and the implementation of the summarization model. The proposed approach is evaluated against baseline models, showcasing its ability to generate high-quality summaries efficiently. The research highlights the advantages of machine learning in automating news summarization, saving time and effort for readers and editors. Challenges such as handling nuanced language and context are discussed, and the paper outlines future research directions to address these limitations and further enhance summarization performance. This study contributes to the growing field of automated news summarization by providing a practical, scalable, and effective solution. It underscores the potential of machine learning to revolutionize how news content is consumed and processed, offering valuable insights for advancing this domain.

Downloads

Download data is not yet available.

References

S. Subramanian, B. Tseng, R. Barbieri and E. N. Brown, "Unsupervised Machine Learning Methods for Artifact Removal in Electrodermal Activity," 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Mexico, 2021, pp. 399-402,doi:10.1109/EMBC46164.2021.9630535.

S. Subramanian, B. Tseng, R. Barbieri and E. N. Brown, "Unsupervised Machine Learning Methods for Artifact Removal in Electrodermal Activity," 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Mexico, 2021, pp. 399-402, doi: 10.1109/EMBC46164.2021.9630535. DOI: https://doi.org/10.1109/EMBC46164.2021.9630535

B. Wang and W. Zhang, "Research on Edge Network Topology Optimization Based on Machine Learning," 2023 5th International Conference on Applied Machine Learning (ICAML), Dalian, China, 2023, pp. 41-46, doi: 10.1109/ICAML60083.2023.00018.

B. Wang and W. Zhang, "Research on Edge Network Topology Optimization Based on Machine Learning," 2023 5th International Conference on Applied Machine Learning (ICAML), Dalian, China, 2023, pp. 41-46, doi: 10.1109/ICAML60083.2023.00018. DOI: https://doi.org/10.1109/ICAML60083.2023.00018

W. F. C. Rocha, A. U. Lucena, G. F. Sarmanho, R. C. Félix, S. A. Miqueleti and T. C. Dourado, "Machine learning protocol from ultrasound data for monitoring, predicting, and supporting the analysis of dam slopes," 2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA), Nassau, Bahamas, 2022, pp. 1619-1623, doi: 10.1109/ICMLA55696.2022.00084. DOI: https://doi.org/10.1109/ICMLA55696.2022.00084

N. M. Kailash Varma, S. H. Mattaparty, S. Ismail, J. Thaduri, G. Deep Arora and A. B, "Sentiment Analysis: A Machine Learning Perspective," 2024 First International Conference on Electronics, Communication and Signal Processing (ICECSP), New Delhi, India, 2024, pp. 1-6, doi: 10.1109/ICECSP61809.2024.10698402. DOI: https://doi.org/10.1109/ICECSP61809.2024.10698402

B. -f. Zhang, J. -s. Su and X. Xu, "A Class-Incremental Learning Method for Multi-Class Support Vector Machines in Text Classification," 2006 International Conference on Machine Learning and Cybernetics, Dalian, China, 2006, pp. 2581-2585, doi: 10.1109/ICMLC.2006.258853. DOI: https://doi.org/10.1109/ICMLC.2006.258853

A. Morton, E. Marzban, G. Giannoulis, A. Patel, R. Aparasu and I. A. Kakadiaris, "A Comparison of Supervised Machine Learning Techniques for Predicting Short-Term In-Hospital Length of Stay among Diabetic Patients," 2014 13th International Conference on Machine Learning and Applications, Detroit, MI, USA, 2014, pp. 428-431, doi: 10.1109/ICMLA.2014.76. DOI: https://doi.org/10.1109/ICMLA.2014.76

k

S. C. Mana, G. Kalaiarasi, Y. R, L. S. Helen and R. Senthamil Selvi, "Application of Machine Learning in Healthcare: An Analysis," 2022 3rd International Conference on Electronics and Sustainable Communication Systems (ICESC), Coimbatore, India, 2022, pp. 1611-1615, doi: 10.1109/ICESC54411.2022.9885296. DOI: https://doi.org/10.1109/ICESC54411.2022.9885296

L. Singelmann, E. Swartz, M. Pearson, R. Striker and E. A. Vazquez, "Design and Development of a Machine Learning Tool for an Innovation-Based Learning MOOC," 2019 IEEE Learning With MOOCS (LWMOOCS), Milwaukee, WI, USA, 2019, pp. 105-109, doi: 10.1109/LWMOOCS47620.2019.8939621 DOI: https://doi.org/10.1109/LWMOOCS47620.2019.8939621

C. Noak and S. Wang, "Analysis of COVID-19 Data Using Advanced Machine Learning Models," 2022 3rd International Conference on Pattern Recognition and Machine Learning (PRML), Chengdu, China, 2022, pp. 430-437, doi: 10.1109/PRML56267.2022.9882191. DOI: https://doi.org/10.1109/PRML56267.2022.9882191

A. A. Gharawi, J. Alsubhi and L. Ramaswamy, "Impact of Labeling Noise on Machine Learning: A Cost-aware Empirical Study," 2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA), Nassau, Bahamas, 2022, pp. 936-939, doi: 10.1109/ICMLA55696.2022.00156. DOI: https://doi.org/10.1109/ICMLA55696.2022.00156

N. Hu, "Classification of Mobile Phone Price Dataset Using Machine Learning Algorithms," 2022 3rd International Conference on Pattern Recognition and Machine Learning (PRML), Chengdu, China, 2022, pp. 438-443, doi: 10.1109/PRML56267.2022.9882236. DOI: https://doi.org/10.1109/PRML56267.2022.9882236

B. Lu and B. K. Tsou, "Combining a large sentiment lexicon and machine learning for subjectivity classification," 2010 International Conference on Machine Learning and Cybernetics, Qingdao, China, 2010, pp. 3311-3316, doi: 10.1109/ICMLC.2010.5580672. DOI: https://doi.org/10.1109/ICMLC.2010.5580672

A. Rawat, H. Maheshwari, M. Khanduja, R. Kumar, M. Memoria and S. Kumar, "Sentiment Analysis of Covid19 Vaccines Tweets Using NLP and Machine Learning Classifiers," 2022 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COM-IT-CON), Faridabad, India, 2022, pp. 225-230, doi: 10.1109/COM-IT-CON54601.2022.9850629. DOI: https://doi.org/10.1109/COM-IT-CON54601.2022.9850629

T. R. N and R. Gupta, "A Survey on Machine Learning Approaches and Its Techniques:," 2020 IEEE International Students' Conference on Electrical,Electronics and Computer Science (SCEECS), Bhopal, India, 2020, pp. 1-6, doi: 10.1109/SCEECS48394.2020.190. DOI: https://doi.org/10.1109/SCEECS48394.2020.190

Y. Shi, Y. Cai and Y. Chen, "A Method of Predicting the Deposited Film Thickness in IC Fabrication Based on Automatic Machine Learning," 2023 6th International Conference on Artificial Intelligence and Big Data (ICAIBD), Chengdu, China, 2023, pp. 299-304, doi: 10.1109/ICAIBD57115.2023.10206117. DOI: https://doi.org/10.1109/ICAIBD57115.2023.10206117

A. M. Malik, "Spatial Based Feature Generation for Machine Learning Based Optimization Compilation," 2010 Ninth International Conference on Machine Learning and Applications, Washington, DC, USA, 2010, pp. 925-930, doi: 10.1109/ICMLA.2010.147. DOI: https://doi.org/10.1109/ICMLA.2010.147

I. C. Freeman, A. J. Haigler, S. E. Schmeelk, L. R. Ellrodt and T. L. Fields, "What are they Researching? Examining Industry-Based Doctoral Dissertation Research through the Lens of Machine Learning," 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), Orlando, FL, USA, 2018, pp. 1338-1340, doi: 10.1109/ICMLA.2018.00217. DOI: https://doi.org/10.1109/ICMLA.2018.00217

J. A. Abraham and V. R. Bindu, "Intrusion Detection and Prevention in Networks Using Machine Learning and Deep Learning Approaches: A Review," 2021 International Conference on Advancements in Electrical, Electronics, Communication, Computing and Automation (ICAECA), Coimbatore, India, 2021, pp. 1-4, doi: 10.1109/ICAECA52838.2021.9675595 DOI: https://doi.org/10.1109/ICAECA52838.2021.9675595

Y. AlShehri and L. Ramaswamy, "SECOE: Alleviating Sensors Failure in Machine Learning-Coupled IoT Systems," 2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA), Nassau, Bahamas, 2022, pp. 743-747, doi: 10.1109/ICMLA55696.2022.00124. DOI: https://doi.org/10.1109/ICMLA55696.2022.00124

B. Thompson, K. O. Elish and R. Steele, "Machine Learning-Based Prediction of Prolonged Length of Stay in Newborns," 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), Orlando, FL, USA, 2018, pp. 1454-1459, doi: 10.1109/ICMLA.2018.00236 DOI: https://doi.org/10.1109/ICMLA.2018.00236

N. Singh, S. Patel and S. Dubey, "An Efficient Machine Learning Technique for Crop Yield Prediction," 2022 IEEE International Conference on Current Development in Engineering and Technology (CCET), Bhopal, India, 2022, pp. 1-4, doi: 10.1109/CCET56606.2022.10079993. DOI: https://doi.org/10.1109/CCET56606.2022.10079993

Published

2025-01-08

How to Cite

Prakash, A. (2025). Enhancing News Article Summarization with Machine Learning. International Journal for Global Academic & Scientific Research, 3(4), 20–34. https://doi.org/10.55938/ijgasr.v3i4.152