This document discusses data science tools. It defines data science as combining computer science, statistics, predictive analytics, and deep learning to extract, process, and analyze structured or unstructured data. Examples of popular data science tools mentioned include R, MATLAB, Python, and Spark. R was designed for statistical operations and is used widely in organizations for data analysis. MATLAB is a numerical computing environment invented for students and used in various fields like machine learning and signal processing. Spark is used for big data workloads and offers APIs in Java, Python, and R. Excel is also highlighted as a powerful tool for data science tasks like calculations, data processing, visualization, and complex formula creation.
Download as PPTX, PDF, TXT or read online on Scribd
Download as pptx, pdf, or txt
0 ratings0% found this document useful (0 votes)
44 views7 pages
1.5 - Matlab
This document discusses data science tools. It defines data science as combining computer science, statistics, predictive analytics, and deep learning to extract, process, and analyze structured or unstructured data. Examples of popular data science tools mentioned include R, MATLAB, Python, and Spark. R was designed for statistical operations and is used widely in organizations for data analysis. MATLAB is a numerical computing environment invented for students and used in various fields like machine learning and signal processing. Spark is used for big data workloads and offers APIs in Java, Python, and R. Excel is also highlighted as a powerful tool for data science tasks like calculations, data processing, visualization, and complex formula creation.
Download as PPTX, PDF, TXT or read online on Scribd
Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1/ 7
1.
5 DATA SCIENCE TOOLS
1.5.1 WHAT IS DATA SCIENCE TOOLS
11.5.2 EXAMPLES OF DATA SCIENCE TOOLS
1.5.1 WHAT IS DATA SCIENCE TOOLS
DEFINITION: Combining computer science, statistics, predictive analytics, and
deep learning, it is used to drill down into complex data by extracting, processing, and analysing structured or unstructured data to efficiently generate meaningful information.
The purpose of using current and advanced technologies is to make
data science faster, deeper, and more productive by blending together hundreds of algorithms that allow you to organize and clean the data
there are numerous data science
tools and techniques that provide scientists with an easier, more digestible workflow and powerful results. 1..5.2 TYPE OF DATA SICENCE TOOLS
✔ DEVELOPED AT NORTH CAROLINA ✔ A MULTI-PARADIGN NUMERICAL
STATE UNIVERSITY COMPUTING ENVIRONMENT FOR ✔ DESIGNED FOR STATISTICAL PROCESSING MATHEMATICAL OPERATIONS INFORMATION ✔ USED BY LARGE ORGANIZATIONS ✔ INVENTED BY MATHEMATICIAN AND TO ANALYSE DATA COMPUTER PROGRAMMER CLEVE MOLER ✔ OFFERS NUMEROUS STATISTICAL FOR HIS STUDENT LIBRARIES AND TOOLS ✔ MANY ENGINEERS AND SCIENTISTS USE ✔ HIGHLY EXPENSIVE AND HAS THIS FOR LEARNING AND MACHINE STRONG SUPPORT FROM LEARNING,SIGNAL PROCESSING AND COMPANY COMMUNICATIONS,CONTROL SYSTEM AND ETC ✔ USED FOR BIG DATA DESIGNED TO HANDLE BATCH AND STREAM WORKLOADS PROCESSING ✔ OFFERS VARIOUS API’S ⮚ BATCH PROCESSING THAT ARE ANALYSIS HAPPENS ON A SET OF DATA PROGRAMMABLE IN WHICH HAVE BEEN STORED OVER A PERIOD OF JAVA,PYTHON AND R TIME ✔ ADVANTAGES: EX: BILLING SYSTEM (i) FAST: RUN FAST ⮚ STREAM PROCESSING ANALYTIC QUERIES HAPPENS WHEN A DATA FLOWS THROUGH A AGAINST DATA OF ANY SYSTEM.DATA WILL ANALYSED AND ACTIONS SIZE WILL BE TAKEN WITHIN SHORT PERIOD OF TIME (II)DEVELOPER FRIENDLY: SUPPORT JAVA,PYTHON ✔ Powerful tool for data science ✔ Mostly for spreadsheet calculations and today used for data processing,visualization and complex calculations ✔ Can create own functions and formula ✔ It is also an ideal choice for creating powerful data visualizations and spreadsheet THANK YOU ;)