Normal view MARC view ISBD view

Proposed model for context topic identification of english and hindi news article through LDA approach with NLP technique (Record no. 17562)

000 -LEADER
fixed length control field	a
003 - CONTROL NUMBER IDENTIFIER
control field	OSt
005 - DATE AND TIME OF LATEST TRANSACTION
control field	20220920153500.0
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field	220920b xxu\|\|\|\|\| \|\|\|\| 00\| 0 eng d
040 ## - CATALOGING SOURCE
Original cataloging agency	AIKTC-KRRC
Transcribing agency	AIKTC-KRRC
100 ## - MAIN ENTRY--PERSONAL NAME
9 (RLIN)	17968
Author	Srivastav, Anukriti
245 ## - TITLE STATEMENT
Title	Proposed model for context topic identification of english and hindi news article through LDA approach with NLP technique
250 ## - EDITION STATEMENT
Volume, Issue number	Vol.103(2), Apr
260 ## - PUBLICATION, DISTRIBUTION, ETC.
Place of publication, distribution, etc.	New York
Name of publisher, distributor, etc.	Springer
Year	2022
300 ## - PHYSICAL DESCRIPTION
Pagination	591-597p.
520 ## - SUMMARY, ETC.
Summary, etc.	According to the survey, India has the world's second-largest newspaper market, with more than 100 K newspaper outlets, approx 240 million circulation, and 1300 million subscribers or readers. The topic modeling work is increasing day by day, and researchers have published multiple topic modeling papers and have implemented them in different areas like software engineering, political science and medical, etc. LDA topic modeling is used in this research because it has been introduced successfully for topic modeling and classification and it measures the probability of a text-dependent on the bag-of-words scheme without considering the word series. LDA is a common topic modeling algorithm with excellent implementation in the Gensim Python package. However, the challenge is how to extract good quality topics that are simple, separated, and meaningful. The purpose of this research deals with finding the main topics of the same category news articles which are in two different languages (Hindi and English) and then classifying these different language news topics with similarity measurement. In this research, the corpus is constructed with bigram. To achieve the research goal, we have to first build a headline and link extractor that scrap the top news from Google News feeds for both English and Hindi languages (Google News collects news stories that have appeared on different news website which is already accessible in 35 languages over the last 30 days) and then analyses which two news headlines are similar.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
9 (RLIN)	4642
Topical term or geographic name entry element	Humanities and Applied Sciences
700 ## - ADDED ENTRY--PERSONAL NAME
9 (RLIN)	17970
Co-Author	Singh, Satwinder
773 0# - HOST ITEM ENTRY
International Standard Serial Number	2250-2106
Title	Journal of the institution of engineers (India): Series B
856 ## - ELECTRONIC LOCATION AND ACCESS
URL	https://link.springer.com/article/10.1007/s40031-021-00655-w
Link text	Click here
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme
Koha item type	Articles Abstract Database

Holdings
Withdrawn status	Lost status	Source of classification or shelving scheme	Damaged status	Not for loan	Permanent Location	Current Location	Shelving location	Date acquired	Barcode	Date last seen	Price effective from	Koha item type
					School of Engineering & Technology	School of Engineering & Technology	Archieval Section	2022-09-20	2022-1664	2022-09-20	2022-09-20	Articles Abstract Database

Knowledge Reosurces & Relay Centre | AIKTC-KRRC

Proposed model for context topic identification of english and hindi news article through LDA approach with NLP technique (Record no. 17562)