0% found this document useful (0 votes)
723 views

Informatics Practices Project Synopsis Title: Imdb Movie Analysis System

This document provides a synopsis for an IMDb movie analysis project created by Aadya Kishore, a class 12 student. The project analyzes movie data from IMDb using Python. Key features include a user-friendly interface, fast access to the IMDb movie database, and data visualization using graphs. The project aims to simplify movie selection for viewers by analyzing metrics like vote count, ratings, and languages. Python modules like Pandas, NumPy, and Matplotlib are used to manipulate the IMDb movie data and CSV files are used to store the data.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
723 views

Informatics Practices Project Synopsis Title: Imdb Movie Analysis System

This document provides a synopsis for an IMDb movie analysis project created by Aadya Kishore, a class 12 student. The project analyzes movie data from IMDb using Python. Key features include a user-friendly interface, fast access to the IMDb movie database, and data visualization using graphs. The project aims to simplify movie selection for viewers by analyzing metrics like vote count, ratings, and languages. Python modules like Pandas, NumPy, and Matplotlib are used to manipulate the IMDb movie data and CSV files are used to store the data.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 24

INFORMATICS PRACTICES

PROJECT SYNOPSIS
TITLE: IMDb MOVIE
ANALYSIS SYSTEM
NAME – AADYA KISHORE

CLASS - XII-C

ROLL NO.- 18

YEAR - 2021-2022

TEACHER’S NAME - MS. SIMRAT KAUR

TEAM MEMBERS’ NAMES:-

1. AADYA KISHORE
2. JAI BANSAL
3. ANUJ KUMAR
INDEX
S NO. PARTICULARS

1 CERTIFICATE

2 ACKNOWLEDGEMENT

3 INTRODUCTION

4 OBJECTIVE

5 FEATURES OF THE PROJECT

6 PYTHON MODULES & BUILT IN FUNCTIONS

7 PYTHON AS FRONT END

8 CSV AS BACK END

9 SOURCE CODE

10 OUTPUT

11 BIBLIOGRAPHY
CERTIFICATE
This is to certify that AADYA KIHORE student
of St. Xavier’s School class Xll C has
successfully completed the Informatics
Practices Project titled:

ANALYSIS AND VISUALISATION OF MOVIES


BASED ON IMDb RATINGS

Along with her group members Anuj Kumar


and Jai Bansal under the guidance of
Ms. Simrat Kaur (I.P. teacher)
during the year 2021-22.

-------------------------
Teacher’s signature
ACKNOWLEDGEMENT
I would like to express my special thanks to
our school St. Xavier’s School, Principal Fr.
Anil Kispotta, S.J. , my I.P. teacher Ms. Simrat
Kaur and to the management team of our
school who gave me the golden opportunity
to do this wonderful project on the topic -
IMDB MOVIE ANALYSIS SYSTEM which also
helped me in doing a lot of research and I
came to know about so many new things.
Secondly, I would also like to thank my
parents and friends who helped me a lot in
finishing this project within the limited time.
INTRODUCTION
IMDb is an acronym for Internet Movie
Database. It is an online database of
information related to films, television
programs, home videos, video games, and
streaming content online – including cast,
production crew and personal biographies, plot
summaries, trivia, ratings, and fan and critical
reviews. Originally a fan-operated website, the
database is now owned and operated by
IMDb.com, Inc., a subsidiary of Amazon.
The movie and talent pages of IMDb are
accessible to all internet users, but a
registration process is necessary to contribute
information to the site. Most data in the
database is provided by volunteer contributors.
The site enables registered users to submit new
material and edits to existing entries. Users with
a proven track record of submitting data are
given instant approval for additions or
corrections to cast, credits, and other
demographics of media product and
personalities.

OBJECTIVE
The objective of this project is to apply
our programming knowledge into real-
world situations and to explore how
programming skills help in developing a
good software.
Most of the people love watching movies
but the major quandary is choosing which
movie to watch…Our project’s major aim
is to simplify the selection of movies so
that viewers can watch the desired movies
without wasting a whole lot of time on
deciding what to watch…
So In our project we have tried to simplify
the selection process by internally
analysing the vote count, vote average,
overview of each movie and the language
in which it is aired the most and by
presenting the final data / output to the
users in an uncomplicated manner i.e. in
the form of data frames and different
graphs.

ADVANTAGES of IMDb
Detailed contact and representation information:
Updated contact info (email, phone number, street address, website)
and representation details (agents, managers, publicists, legal
representatives, production company, etc.) for thousands of
individuals and companies in the entertainment industry.
Please note that it is our top priority to have the most up-to-date and
accurate information available. If you find that information in our
database is out-of-date or inaccurate, please contact us and we'll be
glad to help.
IMDb page management:
Promote yourself on the world’s most popular and authoritative
source for movie, TV and celebrity content. IMDbPro members can
claim their page to select a primary image, add up to 200 images,
upload demo reels, add professional information and additional
credits, keep contact and representation info current, manage client
rosters, and more.
Expanded title database:
Expanded In-Development listings for films, television and web
series, and other projects in pre-production, production, and post-
production status. Depending on the status of the title, listings will
include comprehensive details such as filmmakers, cast, crew,
company credits, distributors, plot summary, synopsis, release
details, filming locations, technical specs, box office, and media.
Casting service:
Talent can browse hundreds of casting notices and apply directly to
roles with their IMDbPro profile. New notices are posted daily and
there are opportunities for feature films, television series, student
films and more. Refine the search with advanced filters and apply
from anywhere, anytime, on almost any device.
Casting Directors and Filmmakers can directly cast talent by posting
and managing breakdowns in the Casting Notices section. In
addition, matches for roles can be discovered by searching our
database of 6 million people and using advanced filters such as
STARmeter, performance skills, physical attributes and more. The
Find Matches feature generates a list of profiles based on selected
criteria and customized list building features make it easy to track
the people who are most interesting.
STARmeter, MOVIEmeter, and COMPANYmeter ranking:
Additional meter details include rank movement and tracking over
time. The information is available in an easy-to-use chart format that
can be filtered based on month, year, frequency, scale, release
events, and more.
Box office information:
We can find extensive box office information for titles, people, and
companies. Searches can be tailored and sorted by country, ranking,
weekend or daily gross revenue, and more. This information is
powered by box office mojo for the most comprehensive and
accurate data yet, and available on IMDb.
Features of the Project

 User friendly interface.


 Time saving.
 Fast and easy access to database.
 No technical errors.
 Easy manipulation of data.
 Quick data analysis with various
options
like sorting data, finding minimum or
maximum value, etc.
 Various graphs can be drawn to know
different relations about the data.
WHY WE CHOSE THIS PROJECT?

- As of June 2021, IMDb had


approximately 8 million titles (including
episodes) and 10.4 million
personalities in its database as well as
83 million registered users. Therefore
large data was available which helped
us through the analysis and
visualisation process.
- Due to avalbility of large data, we could
get numerous areas to analysis.
- We got opportunity to test our skills on
each data type (string, integer, float) as it
contains various data of different data
types.
- It provides a search function, editorial
content, users review and more to help
people find the best content.
APPLICATIONS:
o You can see details of any
movie you want.
o To see which category of
movies have the highest rating.
o To see the average reviews and
vote count of each movie.
o Will help to see the language
of each movie too.
o You can also add, remove or
manipulate the data of the imdb
movie title, ratings, overview, etc.
HARDWARE AND SOFTWARE
REQUIREMENTS

1. Processors: AMD Ryzen 5 3500U with


Radeon Vega Mobile Gfx 2.10 GHz
2. Operating system: Windows 10
3. System type: 64-bit operating system
x-64based processor
4. Python version: 3.9.1 or higher
5. MS word
6. Printer
PYTHON MODULES & BUILT
IN FUNCTIONS

MODULES
 PANDAS - Pandas is a python data analysis
library that provides data structures and
functions for data manipulation and
analysis.
 NUMPY - numpy stands for numerical
python , it is a library consisting of
multidimensional array objects and a
collection of routines for processing those
arrays.
 MATPLOTLIB – matplotlib is a python
library that provides many interfaces and
functionality for 2-D graphics in various
forms.
FUNCTIONS
Functions supported by this program are :
1. read.csv() – this function is used to read
data from a csv file to form a dataframe.

2. head() – head function is used to fetch n


number of
rows from a pandas object.
3. tail() – tail functions returns last n rows
from a
pandas object.
5. int() – the int functions converts the
specified value
into an integer number.
6. input() – the input function presents a
prompt to the
users , gets input from the user and returns
the data
in a format interpreted by python.
7. append() – the append method adds an
item to the
end of the list.
8. sort_values() – sort_values arranges the
value in a
pandas object in ascending or descending
order.
9. sort_index() – sort_index arranges the
index in a
pandas object in ascending or descending
order.
10. drop() – drop function is used to drop
specified
labels from rows and columns.
11. rename() – rename a function is used to
change
name of any row or column individually.
12. min() – min function find out the
minimum value
from a given set of data.
13. max() - max function find out the
maximum value
from a given set of data.
14. mean() – the mean function returns the
computed mean (average) from a set of
values for the requested axis.

15. median() – median function returns the


middle
number from the set of values for the
requested axis.
16. mode() – mode function returns the mode
value
(the value that appears most often ) from a
set of values.
17. groupby() – groupby function allows you
to split
your data in separate groups to perform
computations for better analysis.
18. plot() – plot function is used to plot a line
chart in
matplotlib for the given set of data.
19. bar() - bar function is used to create a bar
graph in
matplotlib for the given set of data.
20. hist() – hist function allows us to create
and plot
histogram from a given sequence of numbers.
21. title() – title function is used to add a title
to your plot.
22. xlabel() – xlabel is used to set labels for
the xaxis.
23. ylabel() - ylabel is used to set labels for
the yaxis.
24. grid() – grid function is used to add grid to
your plot.
25. legend() – legend function adds a legend
to your plot.
26. show() – show function is used in the end
to show a plot as per the given specifications .
DEVELOPMENT TOOLS &
TECHNOLOGIES

PYTHON CSV

Front End Back End


(Interface visible to the user) (Database)
Python as front end
Python is a simple, open-source and object-
oriented coding
language. It is one of the programming
languages that are easy
to learn as it is a dynamic type, high-level, and
interpreted coding language. This is also used
for debugging of errors and motivate for
instant growth of application prototypes and
using it as a language to program with. 
Python language supports a cross-platform
operating system that is used to build
different applications that create a more
convenient environment for the users. There
is some renowned application of this
language like BitTorrent, YouTube, DropBox,
and much more.
Features:
 Python is compact and very easy to use
object oriented language.
 Python is an expressive language – fewer
line of codes and simpler syntax
 It is an interpreted language not a
compiled language
 Python can run equally well on variety of
platforms like windows , LINUX/UNIX , etc.
 Python language is freely available along
with its source Code
 Python has evolved into a powerful,
complete and useful language over these
years.
CSV as back end
CSV stands for Comma Separated Values.
It is a popular way of representing and
storing tabular, column oriented data in a
persistent storage.It is very simple and
contains only text, separated by commas,
or a long text with strings and lines or
some other definite character. It helps
files to be easily transferred and
transformed into any other format that
you need. It is used when data has a strict
tabular structure or to transfer large
database between programs.

Features:
 CSV is human readable and easy to edit
manually
 It is simple to implement and parse
 It is processed by almost all existing
applications
 It provides a straight-forward information
schema
 It is faster to handle
 It is smaller in size
 It is considered to be standard format

We choose csv because:


 CSV files are plain-text files, making them
easier for the website developer to create
 Since they're plain text, they're easier to import
into a spreadsheet or another storage database,
regardless of the specific software you're using
 To better organize large amounts of data
BIBLIOGRAPHY
 Informative practices textbook – NCERT
 Informative practices textbook -Sumita Arora for
class 12
 www.kaggle.com

 www.youtube.com

You might also like