Please use this identifier to cite or link to this item: https://repository.seku.ac.ke/handle/123456789/7537
Full metadata record
DC FieldValueLanguage
dc.contributor.authorMokhosi, Refuoe-
dc.contributor.authorShikali, Casper S.-
dc.contributor.authorSethobane, Matello-
dc.date.accessioned2024-04-05T07:44:33Z-
dc.date.available2024-04-05T07:44:33Z-
dc.date.issued2024-03-27-
dc.identifier.citationData in Brief, 54, 110371 27 March 2024en_US
dc.identifier.urihttps://www.sciencedirect.com/science/article/pii/S2352340924003408-
dc.identifier.urihttp://repository.seku.ac.ke/xmlui/handle/123456789/7537-
dc.descriptionhttps://doi.org/10.1016/j.dib.2024.110371en_US
dc.description.abstractSentiment Analysis (SA) is a subset of Natural Language Processing (NLP) which has become a promising research area enabling the provision of language specific services. Although research in high resource languages such as English and Chinese has achieved promising results, research in low resource African languages such as Sesotho is still in its infancy due to limited text and speech datasets. This study contributes in this regard by availing the Sesotho News (SN) dataset, as an annotated dataset for the SA and Aspect Based Sentiment Analysis (ABSA) tasks. This dataset may be used for NLP research to benefit 1.85 million Sesotho speakers in Lesotho and 11.5 million speakers in South Africa. The dataset includes 4651 headlines for the ABSA task and 2401 headlines for the SA task using Lesotho's orthography of Sesotho. The news headlines were collected from Sesotho online newspapers and then annotated for the ABSA and SA tasks. The Spearman's correlation and Cohen's Kappa Index metrics show that there is good correlation between the annotators, implying that the SN dataset is of gold standard.en_US
dc.language.isoenen_US
dc.publisherElsevieren_US
dc.subjectSesotho dataseten_US
dc.subjectNews headlinesen_US
dc.subjectSentiment analysisen_US
dc.subjectAspect based sentiment analysisen_US
dc.subjectNatural language processingen_US
dc.subjectMachine learningen_US
dc.titleA Sesotho news headlines dataset for sentiment analysisen_US
dc.typeArticleen_US
Appears in Collections:School of Science and Computing (JA)

Files in This Item:
File Description SizeFormat 
Mokhosi_A Sesotho news headlines dataset for sentiment analysis.pdfAbstract3.72 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.