Fake News Researchpaper
Fake News Researchpaper
Fake News Researchpaper
MACHINE LEARNING
SUJITHA G
Department of Computer Science & Engineering
Government College of Engineering, Dharmapuri
ABSTRACT
Long-Term Memory Network (LSTM) has performed well in sentiment analysis work. The usual way is to use
LSTMs to combine word embeddings for text representation.However, word embeddings contain more semantic
information rather than emotion information. Only It is wrong to use word embeddings to represent words in sentiment
analysis tasks. To solve this problem,We propose a vocabulary-enhanced LSTM model. The model uses the first sentiment
lexicon as additional information.Pre-training a word sentiment classifier and then obtaining word sentiment embeddings
including words The word is not in the dictionary. Can create words by combining sentiment embedding and its word
embedding Representation more accurate. In addition, we define a new method to find the attention vector in general
Sentiment analysis without target which can improve LSTM capability in capturing global sentiment Information The
results of experiments on the English and Chinese datasets show that in our model Comparable or better results than
existing models
3.PROPOSED SYSTEM
3. Classification of Fake News
In the proposed system, we propose a vocabulary-enhanced 3.1. preprocessing
LSTM model (LE-LSTM) to integrate Sentiment Lexicon A data standardization process called text, in order to
in LSTM to get more sentiment information of words by obtain consistent results Default, which removes non-
using Sentiment Lexicon To train a word sentiment alphanumeric characters In the text, there are some
classifier, we can do Get the sentiment embedding of each techniques commonly used in NLP:
word. Serialized Embedding as word and its feeling
LSTM's input can perform high in sentiment analysis work. Stop words: We have removed the words in the
agreement that they do not contribute to the samples
The proposed system is sub-divided into the following The learning process behind the problem; For example,
modules: articles and Proposals.
1. Fake News Spotter UI Stemming: This technique was used to reduce words to
2. Preparation and Exploration of the FN Dataset their roots.
2.1. import dataset
2.2. read dataset Tokenization and padding: As usual in text
2.3. Explore Datasets processing tasks, we do tokenization And when
necessary, padding for the representation of words and Figure 4.1,Training and Validation
sentences accuracy