Codeless Time Series Analysis with KNIME: A practical guide to implementing forecasting models for time series analysis applications

Ebook677 pages4 hours

Codeless Time Series Analysis with KNIME: A practical guide to implementing forecasting models for time series analysis applications

Name: Codeless Time Series Analysis with KNIME: A practical guide to implementing forecasting models for time series analysis applications
Author: KNIME AG
ISBN: 9781803239972

By KNIME AG, Corey Weisinger, Maarit Widmann and Daniele Tonini

Rating: 0 out of 5 stars

()

Read preview

About this ebook

This book will take you on a practical journey, teaching you how to implement solutions for many use cases involving time series analysis techniques.
This learning journey is organized in a crescendo of difficulty, starting from the easiest yet effective techniques applied to weather forecasting, then introducing ARIMA and its variations, moving on to machine learning for audio signal classification, training deep learning architectures to predict glucose levels and electrical energy demand, and ending with an approach to anomaly detection in IoT. There’s no time series analysis book without a solution for stock price predictions and you’ll find this use case at the end of the book, together with a few more demand prediction use cases that rely on the integration of KNIME Analytics Platform and other external tools.
By the end of this time series book, you’ll have learned about popular time series analysis techniques and algorithms, KNIME Analytics Platform, its time series extension, and how to apply both to common use cases.

Skip carousel

LanguageEnglish

PublisherPackt Publishing

Release dateAug 19, 2022

ISBN9781803239972

Author

KNIME AG

Related authors

Skip carousel

Related to Codeless Time Series Analysis with KNIME

Related ebooks

Skip carousel

Hands-On Artificial Intelligence for Beginners: An introduction to AI concepts, algorithms, and their implementation
Ebook
Hands-On Artificial Intelligence for Beginners: An introduction to AI concepts, algorithms, and their implementation
byPatrick D. Smith
Rating: 0 out of 5 stars
0 ratings
Hands-On Predictive Analytics with Python: Master the complete predictive analytics process, from problem definition to model deployment
Ebook
Hands-On Predictive Analytics with Python: Master the complete predictive analytics process, from problem definition to model deployment
byAlvaro Fuentes
Rating: 0 out of 5 stars
0 ratings
Data Labeling in Machine Learning with Python: Explore modern ways to prepare labeled data for training and fine-tuning ML and generative AI models
Ebook
Data Labeling in Machine Learning with Python: Explore modern ways to prepare labeled data for training and fine-tuning ML and generative AI models
byVijaya Kumar Suda
Rating: 0 out of 5 stars
0 ratings
Machine Learning with LightGBM and Python: A practitioner's guide to developing production-ready machine learning systems
Ebook
Machine Learning with LightGBM and Python: A practitioner's guide to developing production-ready machine learning systems
byAndrich van Wyk
Rating: 0 out of 5 stars
0 ratings
Deep Learning with PyTorch: A practical approach to building neural network models using PyTorch
Ebook
Deep Learning with PyTorch: A practical approach to building neural network models using PyTorch
byVishnu Subramanian
Rating: 0 out of 5 stars
0 ratings
Go Machine Learning Projects: Eight projects demonstrating end-to-end machine learning and predictive analytics applications in Go
Ebook
Go Machine Learning Projects: Eight projects demonstrating end-to-end machine learning and predictive analytics applications in Go
byXuanyi Chew
Rating: 0 out of 5 stars
0 ratings
The Pandas Workshop: A comprehensive guide to using Python for data analysis with real-world case studies
Ebook
The Pandas Workshop: A comprehensive guide to using Python for data analysis with real-world case studies
byBlaine Bateman
Rating: 0 out of 5 stars
0 ratings
A Handbook of Mathematical Models with Python: Elevate your machine learning projects with NetworkX, PuLP, and linalg
Ebook
A Handbook of Mathematical Models with Python: Elevate your machine learning projects with NetworkX, PuLP, and linalg
byDr. Ranja Sarkar
Rating: 0 out of 5 stars
0 ratings
Data Forecasting and Segmentation Using Microsoft Excel: Perform data grouping, linear predictions, and time series machine learning statistics without using code
Ebook
Data Forecasting and Segmentation Using Microsoft Excel: Perform data grouping, linear predictions, and time series machine learning statistics without using code
byFernando Roque
Rating: 0 out of 5 stars
0 ratings
Machine Learning with the Elastic Stack.: Gain valuable insights from your data with Elastic Stack's machine learning features
Ebook
Machine Learning with the Elastic Stack.: Gain valuable insights from your data with Elastic Stack's machine learning features
byRich Collier
Rating: 0 out of 5 stars
0 ratings
Deep Learning By Example: A hands-on guide to implementing advanced machine learning algorithms and neural networks
Ebook
Deep Learning By Example: A hands-on guide to implementing advanced machine learning algorithms and neural networks
byAhmed Menshawy
Rating: 0 out of 5 stars
0 ratings
The Deep Learning Architect's Handbook: Build and deploy production-ready DL solutions leveraging the latest Python techniques
Ebook
The Deep Learning Architect's Handbook: Build and deploy production-ready DL solutions leveraging the latest Python techniques
byEe Kin Chin
Rating: 0 out of 5 stars
0 ratings
Hands-On Data Preprocessing in Python: Learn how to effectively prepare data for successful data analytics
Ebook
Hands-On Data Preprocessing in Python: Learn how to effectively prepare data for successful data analytics
byRoy Jafari
Rating: 0 out of 5 stars
0 ratings
Python Machine Learning By Example: The easiest way to get into machine learning
Ebook
Python Machine Learning By Example: The easiest way to get into machine learning
byYuxi (Hayden) Liu
Rating: 5 out of 5 stars
5/5
Feature Engineering Made Easy: Identify unique features from your dataset in order to build powerful machine learning systems
Ebook
Feature Engineering Made Easy: Identify unique features from your dataset in order to build powerful machine learning systems
bySinan Ozdemir
Rating: 0 out of 5 stars
0 ratings
Mastering Predictive Analytics with scikit-learn and TensorFlow: Implement machine learning techniques to build advanced predictive models using Python
Ebook
Mastering Predictive Analytics with scikit-learn and TensorFlow: Implement machine learning techniques to build advanced predictive models using Python
byAlan Fontaine
Rating: 0 out of 5 stars
0 ratings
The Handbook of NLP with Gensim: Leverage topic modeling to uncover hidden patterns, themes, and valuable insights within textual data
Ebook
The Handbook of NLP with Gensim: Leverage topic modeling to uncover hidden patterns, themes, and valuable insights within textual data
byChris Kuo
Rating: 0 out of 5 stars
0 ratings
Hands-On Machine Learning with ML.NET: Getting started with Microsoft ML.NET to implement popular machine learning algorithms in C#
Ebook
Hands-On Machine Learning with ML.NET: Getting started with Microsoft ML.NET to implement popular machine learning algorithms in C#
byJarred Capellman
Rating: 0 out of 5 stars
0 ratings
Agile Machine Learning with DataRobot: Automate each step of the machine learning life cycle, from understanding problems to delivering value
Ebook
Agile Machine Learning with DataRobot: Automate each step of the machine learning life cycle, from understanding problems to delivering value
byBipin Chadha
Rating: 0 out of 5 stars
0 ratings
Hands-On Neural Network Programming with C#: Add powerful neural network capabilities to your C# enterprise applications
Ebook
Hands-On Neural Network Programming with C#: Add powerful neural network capabilities to your C# enterprise applications
byMatt R. Cole
Rating: 0 out of 5 stars
0 ratings
Machine Learning Fundamentals: Use Python and scikit-learn to get up and running with the hottest developments in machine learning
Ebook
Machine Learning Fundamentals: Use Python and scikit-learn to get up and running with the hottest developments in machine learning
byHyatt Saleh
Rating: 0 out of 5 stars
0 ratings
Time Series Analysis on AWS: Learn how to build forecasting models and detect anomalies in your time series data
Ebook
Time Series Analysis on AWS: Learn how to build forecasting models and detect anomalies in your time series data
byMichaël Hoarau
Rating: 0 out of 5 stars
0 ratings
Practical Data Analysis - Second Edition
Ebook
Practical Data Analysis - Second Edition
byHector Cuesta
Rating: 0 out of 5 stars
0 ratings
Machine Learning Engineering with Python: Manage the production life cycle of machine learning models using MLOps with practical examples
Ebook
Machine Learning Engineering with Python: Manage the production life cycle of machine learning models using MLOps with practical examples
byAndrew P. McMahon
Rating: 0 out of 5 stars
0 ratings
Learning Data Mining with Python
Ebook
Learning Data Mining with Python
byRobert Layton
Rating: 0 out of 5 stars
0 ratings
Modern Time Series Forecasting with Python: Explore industry-ready time series forecasting using modern machine learning and deep learning
Ebook
Modern Time Series Forecasting with Python: Explore industry-ready time series forecasting using modern machine learning and deep learning
byManu Joseph
Rating: 0 out of 5 stars
0 ratings
Engineering MLOps: Rapidly build, test, and manage production-ready machine learning life cycles at scale
Ebook
Engineering MLOps: Rapidly build, test, and manage production-ready machine learning life cycles at scale
byEmmanuel Raj
Rating: 0 out of 5 stars
0 ratings
Practical Convolutional Neural Networks: Implement advanced deep learning models using Python
Ebook
Practical Convolutional Neural Networks: Implement advanced deep learning models using Python
byMohit Sewak
Rating: 0 out of 5 stars
0 ratings
Introduction to Machine Learning and Neural Classification
Ebook
Introduction to Machine Learning and Neural Classification
byTrilokesh Khatri
Rating: 0 out of 5 stars
0 ratings
15 Math Concepts Every Data Scientist Should Know: Understand and learn how to apply the math behind data science algorithms
Ebook
15 Math Concepts Every Data Scientist Should Know: Understand and learn how to apply the math behind data science algorithms
byDavid Hoyle
Rating: 0 out of 5 stars
0 ratings

Data Visualization For You

Skip carousel

Data Analytics for Beginners: Introduction to Data Analytics
Ebook
Data Analytics for Beginners: Introduction to Data Analytics
byAnthony S. Williams
Rating: 4 out of 5 stars
4/5
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
Ebook
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
Ebook
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
byNigel Tillery
Rating: 5 out of 5 stars
5/5
How to Lie with Maps
Ebook
How to Lie with Maps
byMark Monmonier
Rating: 4 out of 5 stars
4/5
Data Visualization: A Practical Introduction
Ebook
Data Visualization: A Practical Introduction
byKieran Healy
Rating: 5 out of 5 stars
5/5
Data Analytics & Visualization All-in-One For Dummies
Ebook
Data Analytics & Visualization All-in-One For Dummies
byJack A. Hyman
Rating: 0 out of 5 stars
0 ratings
The Applied SQL Data Analytics Workshop - Second Edition: Develop your practical skills and prepare to become a professional data analyst, 2nd Edition
Ebook
The Applied SQL Data Analytics Workshop - Second Edition: Develop your practical skills and prepare to become a professional data analyst, 2nd Edition
byMatt Goldwasser
Rating: 0 out of 5 stars
0 ratings
Hands-On Data Science for Marketing: Improve your marketing strategies with machine learning using Python and R
Ebook
Hands-On Data Science for Marketing: Improve your marketing strategies with machine learning using Python and R
byYoon Hyup Hwang
Rating: 5 out of 5 stars
5/5
Effective Data Storytelling: How to Drive Change with Data, Narrative and Visuals
Ebook
Effective Data Storytelling: How to Drive Change with Data, Narrative and Visuals
byBrent Dykes
Rating: 4 out of 5 stars
4/5
Visual Analytics with Tableau
Ebook
Visual Analytics with Tableau
byAlexander Loth
Rating: 0 out of 5 stars
0 ratings
Teach Yourself VISUALLY Power BI
Ebook
Teach Yourself VISUALLY Power BI
byAlexander Loth
Rating: 0 out of 5 stars
0 ratings
Python For Beginners.Learn Data Science in 5 Days the Smart Way and Remember it Longer. With Easy Step by Step Guidance & Hands on Examples. (Python Crash Course-Programming for Beginners): Python for Beginners
Ebook
Python For Beginners.Learn Data Science in 5 Days the Smart Way and Remember it Longer. With Easy Step by Step Guidance & Hands on Examples. (Python Crash Course-Programming for Beginners): Python for Beginners
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
How to be Clear and Compelling with Data: Principles, Practice and Getting Beyond the Basics
Ebook
How to be Clear and Compelling with Data: Principles, Practice and Getting Beyond the Basics
byJohn J Burrett
Rating: 0 out of 5 stars
0 ratings
Google Analytics 4 Migration Quick Guide 2022: Universal Analytics disappears in July 2023 - are you ready?
Ebook
Google Analytics 4 Migration Quick Guide 2022: Universal Analytics disappears in July 2023 - are you ready?
byWynne Pirini
Rating: 0 out of 5 stars
0 ratings
Python Automation Cookbook: 75 Python automation ideas for web scraping, data wrangling, and processing Excel, reports, emails, and more, 2nd Edition
Ebook
Python Automation Cookbook: 75 Python automation ideas for web scraping, data wrangling, and processing Excel, reports, emails, and more, 2nd Edition
byJaime Buelta
Rating: 0 out of 5 stars
0 ratings
Data Structures & Algorithms Interview Questions You'll Most Likely Be Asked
Ebook
Data Structures & Algorithms Interview Questions You'll Most Likely Be Asked
byVibrant Publishers
Rating: 1 out of 5 stars
1/5
Learn Power BI: A comprehensive, step-by-step guide for beginners to learn real-world business intelligence
Ebook
Learn Power BI: A comprehensive, step-by-step guide for beginners to learn real-world business intelligence
byGregory Deckler
Rating: 4 out of 5 stars
4/5
Salesforce Reporting and Dashboards
Ebook
Salesforce Reporting and Dashboards
byJohan Yu
Rating: 4 out of 5 stars
4/5
Data Visualization Guide
Ebook
Data Visualization Guide
byAlex Campbell
Rating: 0 out of 5 stars
0 ratings
The Big Book of Dashboards: Visualizing Your Data Using Real-World Business Scenarios
Ebook
The Big Book of Dashboards: Visualizing Your Data Using Real-World Business Scenarios
bySteve Wexler
Rating: 4 out of 5 stars
4/5
How to Become a Data Analyst: My Low-Cost, No Code Roadmap for Breaking into Tech
Ebook
How to Become a Data Analyst: My Low-Cost, No Code Roadmap for Breaking into Tech
byAnnie Nelson
Rating: 0 out of 5 stars
0 ratings
Data Visualization with Excel Dashboards and Reports
Ebook
Data Visualization with Excel Dashboards and Reports
byDick Kusleika
Rating: 4 out of 5 stars
4/5
Visualizing Graph Data
Ebook
Visualizing Graph Data
byCorey Lanum
Rating: 0 out of 5 stars
0 ratings
LaTeX Graphics with TikZ: A practitioner's guide to drawing 2D and 3D images, diagrams, charts, and plots
Ebook
LaTeX Graphics with TikZ: A practitioner's guide to drawing 2D and 3D images, diagrams, charts, and plots
byStefan Kottwitz
Rating: 0 out of 5 stars
0 ratings
Learning PySpark
Ebook
Learning PySpark
byTomasz Drabas
Rating: 0 out of 5 stars
0 ratings
Chatgpt | Generative AI - The Step-By-Step Guide For OpenAI & Azure OpenAI In 36 Hrs.
Ebook
Chatgpt | Generative AI - The Step-By-Step Guide For OpenAI & Azure OpenAI In 36 Hrs.
byAJIT DASH
Rating: 0 out of 5 stars
0 ratings
Excel 2024: Mastering Charts, Functions, Formula and Pivot Table in Excel 2024 as a Beginner with Step by Step GuideMastering Charts, Functions, Formula and Pivot Table in Excel 2024 as a Beginner with Step by Step Guide
Ebook
Excel 2024: Mastering Charts, Functions, Formula and Pivot Table in Excel 2024 as a Beginner with Step by Step GuideMastering Charts, Functions, Formula and Pivot Table in Excel 2024 as a Beginner with Step by Step Guide
byThomas Reynolds
Rating: 0 out of 5 stars
0 ratings
Jupyter Cookbook: Over 75 recipes to perform interactive computing across Python, R, Scala, Spark, JavaScript, and more
Ebook
Jupyter Cookbook: Over 75 recipes to perform interactive computing across Python, R, Scala, Spark, JavaScript, and more
byDan Toomey
Rating: 0 out of 5 stars
0 ratings
DAX Patterns: Second Edition
Ebook
DAX Patterns: Second Edition
byMarco Russo
Rating: 5 out of 5 stars
5/5
Mastering Python for Data Science
Ebook
Mastering Python for Data Science
bySamir Madhavan
Rating: 3 out of 5 stars
3/5

Related podcast episodes

Skip carousel

Harnessing Generative AI For Creating Educational Content With Illumidesk: Generative AI has unlocked a massive opportunity for content creation. There is also an unfulfilled need for experts to be able to share their knowledge and build communities. Illumidesk was built to take advantage of this intersection. In this episode Greg Werner explains how they are using generative AI as an assistive tool for creating educational material, as well as building a data driven experience for learners.
UNLIMITED
Harnessing Generative AI For Creating Educational Content With Illumidesk: Generative AI has unlocked a massive opportunity for content creation. There is also an unfulfilled need for experts to be able to share their knowledge and build communities. Illumidesk was built to take advantage of this intersection. In this episode Greg Werner explains how they are using generative AI as an assistive tool for creating educational material, as well as building a data driven experience for learners.
byData Engineering Podcast
0 ratings
0% found this document useful
Building Vector Search Applications
UNLIMITED
Building Vector Search Applications
byThe Cloudcast
0 ratings
0% found this document useful
Privacy-aware Data Pipelines with Skyflow’s Piper Keyes: A data analytics pipeline is important to modern businesses because it allows them to extract valuable insights from the large amounts of data they generate and collect on a daily basis. This leads to better decision making, improved efficiency, and ...
UNLIMITED
Privacy-aware Data Pipelines with Skyflow’s Piper Keyes: A data analytics pipeline is important to modern businesses because it allows them to extract valuable insights from the large amounts of data they generate and collect on a daily basis. This leads to better decision making, improved efficiency, and ...
byPartially Redacted: Data, AI, Security, and Privacy
0 ratings
0% found this document useful
An Event-Driven Apps Look Ahead for 2021
UNLIMITED
An Event-Driven Apps Look Ahead for 2021
byThe Cloudcast
0 ratings
0% found this document useful
Exploring the Future of AI: Prompt Management and Generative Thinking with Steven Forth: Steven Forth is Ibbaka’s Co-Founder, CEO, and Partner. Ibbaka is a strategic pricing advisory firm. In this episode, Steven discusses how AI, particularly transformer models, is revolutionizing business strategies by automating complex tasks like...
UNLIMITED
Exploring the Future of AI: Prompt Management and Generative Thinking with Steven Forth: Steven Forth is Ibbaka’s Co-Founder, CEO, and Partner. Ibbaka is a strategic pricing advisory firm. In this episode, Steven discusses how AI, particularly transformer models, is revolutionizing business strategies by automating complex tasks like...
byImpact Pricing
0 ratings
0% found this document useful
Building ML Apps
UNLIMITED
Building ML Apps
byThe Cloudcast
0 ratings
0% found this document useful
Use Your Data Warehouse To Power Your Product Analytics With NetSpring: With the rise of the web and digital business came the need to understand how customers are interacting with the products and services that are being sold. Product analytics has grown into its own category and brought with it several services with generational differences in how they approach the problem. NetSpring is a warehouse-native product analytics service that allows you to gain powerful insights into your customers and their needs by combining your event streams with the rest of your business data. In this episode Priyendra Deshwal explains how NetSpring is designed to empower your product and data teams to build and explore insights around your products in a streamlined and maintainable workflow.
UNLIMITED
Use Your Data Warehouse To Power Your Product Analytics With NetSpring: With the rise of the web and digital business came the need to understand how customers are interacting with the products and services that are being sold. Product analytics has grown into its own category and brought with it several services with generational differences in how they approach the problem. NetSpring is a warehouse-native product analytics service that allows you to gain powerful insights into your customers and their needs by combining your event streams with the rest of your business data. In this episode Priyendra Deshwal explains how NetSpring is designed to empower your product and data teams to build and explore insights around your products in a streamlined and maintainable workflow.
byData Engineering Podcast
0 ratings
0% found this document useful
Strise with Marit Rødevand: Priyanka Vergadia hops back into the host seat this week, joining Mark Mirchandani to talk to Marit Rødevand of Strise.
UNLIMITED
Strise with Marit Rødevand: Priyanka Vergadia hops back into the host seat this week, joining Mark Mirchandani to talk to Marit Rødevand of Strise.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Secure Cloud Migrations
UNLIMITED
Secure Cloud Migrations
byThe Cloudcast
0 ratings
0% found this document useful
Composable Data Analytics
UNLIMITED
Composable Data Analytics
byThe Cloudcast
0 ratings
0% found this document useful
Jeremiah Lowin – Machine Learning in Investing – [Invest Like the Best, EP.105]: My guest this week is one of my best and oldest friends, Jeremiah Lowin. Jeremiah has had a fascinating career, starting with advanced work in statistics before moving into the risk management field in the hedge fund world. Through his career he has studi
UNLIMITED
Jeremiah Lowin – Machine Learning in Investing – [Invest Like the Best, EP.105]: My guest this week is one of my best and oldest friends, Jeremiah Lowin. Jeremiah has had a fascinating career, starting with advanced work in statistics before moving into the risk management field in the hedge fund world. Through his career he has studi
byInvest Like the Best with Patrick O'Shaughnessy
0 ratings
0% found this document useful
Continuous Application Profiling
UNLIMITED
Continuous Application Profiling
byThe Cloudcast
0 ratings
0% found this document useful
Ep 85: The biggest risk when developing machine learning w/ Rosaria Silipo (KNIME): Rosaria Silipo joins the show to share her experience as Head of Data Science Evangelism at KNIME. On this episode, we discuss how to get started in data analytics, what does low code/no code actually mean, and the biggest risk when developing machine...
UNLIMITED
Ep 85: The biggest risk when developing machine learning w/ Rosaria Silipo (KNIME): Rosaria Silipo joins the show to share her experience as Head of Data Science Evangelism at KNIME. On this episode, we discuss how to get started in data analytics, what does low code/no code actually mean, and the biggest risk when developing machine...
byThe Audit Podcast
0 ratings
0% found this document useful
End-to-End Data Science to Drive Business Decisions at LinkedIn with Burcu Baran - TWiML Talk #256: In this episode of our Strata Data conference series, we’re joined by Burcu Baran, Senior Data Scientist at LinkedIn. At Strata, Burcu, along with a few members of her team, delivered the presentation “Using the full spectrum of data science to...
UNLIMITED
End-to-End Data Science to Drive Business Decisions at LinkedIn with Burcu Baran - TWiML Talk #256: In this episode of our Strata Data conference series, we’re joined by Burcu Baran, Senior Data Scientist at LinkedIn. At Strata, Burcu, along with a few members of her team, delivered the presentation “Using the full spectrum of data science to...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
A "AI & ML" Look Ahead for 2020
UNLIMITED
A "AI & ML" Look Ahead for 2020
byThe Cloudcast
0 ratings
0% found this document useful
How Column-Aware Development Tooling Yields Better Data Models: Architectural decisions are all based on certain constraints and a desire to optimize for different outcomes. In data systems one of the core architectural exercises is data modeling, which can have significant impacts on what is and is not possible for downstream use cases. By incorporating column-level lineage in the data modeling process it encourages a more robust and well-informed design. In this episode Satish Jayanthi explores the benefits of incorporating column-aware tooling in the data modeling process.
UNLIMITED
How Column-Aware Development Tooling Yields Better Data Models: Architectural decisions are all based on certain constraints and a desire to optimize for different outcomes. In data systems one of the core architectural exercises is data modeling, which can have significant impacts on what is and is not possible for downstream use cases. By incorporating column-level lineage in the data modeling process it encourages a more robust and well-informed design. In this episode Satish Jayanthi explores the benefits of incorporating column-aware tooling in the data modeling process.
byData Engineering Podcast
0 ratings
0% found this document useful
Reduce Friction In Your Business Analytics Through Entity Centric Data Modeling: For business analytics the way that you model the data in your warehouse has a lasting impact on what types of questions can be answered quickly and easily. The major strategies in use today were created decades ago when the software and hardware for warehouse databases were far more constrained. In this episode Maxime Beauchemin of Airflow and Superset fame shares his vision for the entity-centric data model and how you can incorporate it into your own warehouse design.
UNLIMITED
Reduce Friction In Your Business Analytics Through Entity Centric Data Modeling: For business analytics the way that you model the data in your warehouse has a lasting impact on what types of questions can be answered quickly and easily. The major strategies in use today were created decades ago when the software and hardware for warehouse databases were far more constrained. In this episode Maxime Beauchemin of Airflow and Superset fame shares his vision for the entity-centric data model and how you can incorporate it into your own warehouse design.
byData Engineering Podcast
0 ratings
0% found this document useful
Understanding Time-Series Database Patterns
UNLIMITED
Understanding Time-Series Database Patterns
byThe Cloudcast
0 ratings
0% found this document useful
Dr. Lee helps us understand Quantum Computing as a new technology
UNLIMITED
Dr. Lee helps us understand Quantum Computing as a new technology
byMaking Data Simple
0 ratings
0% found this document useful
Understanding Machine Learning Features and Platforms
UNLIMITED
Understanding Machine Learning Features and Platforms
byThe Cloudcast
0 ratings
0% found this document useful
GitOps, Security and Modern CI/CD Pipelines
UNLIMITED
GitOps, Security and Modern CI/CD Pipelines
byThe Cloudcast
0 ratings
0% found this document useful
#140 Isabelle Guyon: The Future of AI and Support Vector Machines: This episode is sponsored by MindStudio by YouAi. MindStudio is the best way to build an AI business. Start driving some serious revenue before everyone else. Mind Studio allows you to use conversational language to program incredibly powerful AI...
UNLIMITED
#140 Isabelle Guyon: The Future of AI and Support Vector Machines: This episode is sponsored by MindStudio by YouAi. MindStudio is the best way to build an AI business. Start driving some serious revenue before everyone else. Mind Studio allows you to use conversational language to program incredibly powerful AI...
byEye On A.I.
0 ratings
0% found this document useful
[Replay] Dr. Lee helps us understand Quantum Computing as a new technology
UNLIMITED
[Replay] Dr. Lee helps us understand Quantum Computing as a new technology
byMaking Data Simple
0 ratings
0% found this document useful
[DataFramed Careers Series #3]: Accelerating Data Careers with Writing
UNLIMITED
[DataFramed Careers Series #3]: Accelerating Data Careers with Writing
byDataFramed
0 ratings
0% found this document useful
An AI and ML Look Ahead for 2019
UNLIMITED
An AI and ML Look Ahead for 2019
byThe Cloudcast
0 ratings
0% found this document useful
Identity Across Multiple Clouds
UNLIMITED
Identity Across Multiple Clouds
byThe Cloudcast
0 ratings
0% found this document useful
2024 Look Ahead - Using AI to Enable Personal Productivity
UNLIMITED
2024 Look Ahead - Using AI to Enable Personal Productivity
byThe Cloudcast
0 ratings
0% found this document useful
Use Consistent And Up To Date Customer Profiles To Power Your Business With Segment Unify
UNLIMITED
Use Consistent And Up To Date Customer Profiles To Power Your Business With Segment Unify
byData Engineering Podcast
0 ratings
0% found this document useful
Cloud-Native Security & Usage
UNLIMITED
Cloud-Native Security & Usage
byThe Cloudcast
0 ratings
0% found this document useful
How LLMs and Generative AI are Revolutionizing AI for Science with Anima Anandkumar - #614
UNLIMITED
How LLMs and Generative AI are Revolutionizing AI for Science with Anima Anandkumar - #614
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful

Related categories

Skip carousel

Reviews for Codeless Time Series Analysis with KNIME

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Codeless Time Series Analysis with KNIME - KNIME AG

Cover.png

BIRMINGHAM—MUMBAI

Codeless Time Series Analysis with KNIME

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the authors, nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

Group Product Manager: Reshma Raman

Publishing Product Manager: Reshma Raman

Senior Editor: Nithya Sadanandan

Technical Editor: Pradeep Sahu

Copy Editor: Safis Editing

Project Coordinator: Deeksha Thakkar

Proofreader: Safis Editing

Indexer: Manju Arasan

Production Designer: Prashant Ghare

Marketing Coordinator: Priyanka Mhatre

First published: July 2022

Production reference: 1220722

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham

B3 2PB, UK.

ISBN 978-1-80323-206-5

www.packt.com

Thanks to my colleagues at KNIME for the technical support and encouragement, especially to Andisa Dewi and Tobias Kötter for the Taxi Demand Prediction application, and Rosaria Silipo, Phil Winters, and Iris Adä for the Anomaly Detection application.

– Maarit Widmann

I would like to thank the KNIME team for including me in this great project. Especially thanks to Rosaria Silipo, for her trust and support, and to my co-authors Maarit and Corey, for taking this long journey with me.

– Daniele Tonini

Contributors

About the authors

Corey Weisinger is a data scientist with KNIME in Austin, Texas. He studied mathematics at Michigan State University focusing on actuarial techniques and functional analysis. Before coming to work for KNIME, he worked as an analytics consultant for the auto industry in Detroit, Michigan. He currently focuses on signal processing and numeric prediction techniques and is the author of the From Alteryx to KNIME guidebook.

Maarit Widmann is a data scientist and an educator at KNIME: the instructor behind the KNIME self-paced courses and a teacher of the KNIME courses. She is the author of the From Modeling to Model Evaluation eBook and she publishes regularly on the KNIME blog and on Medium. She holds a master’s degree in data science and a bachelor’s degree in sociology.

Daniele Tonini is an experienced advisor and educator in the field of advanced business analytics and machine learning. In the last 15 years, he designed and deployed predictive analytics systems, and data quality management and dynamic reporting tools, mainly for customer intelligence, risk management, and pricing applications. He is an Academic Fellow at Bocconi University (Department of Decision Science) and SDA Bocconi School of Management (Decision Sciences & Business Analytics Faculty). He’s also an adjunct professor in data mining at Franklin University, Switzerland. He currently teaches statistics, predictive analytics for data-driven decision making, big data and databases, market research, and data mining.

About the reviewers

Miguel Infestas Maderuelo has a Ph.D. in applied economics and has developed his career around data analytics in different fields (digital marketing, data mining, academic research, and so on). His last project is as a founder of a digital marketing agency, applying analytics on digital data to optimize digital communication.

Rosaria Silipo, Ph.D., now head of data science evangelism at KNIME, has spent 25+ years in applied AI, predictive analytics, and machine learning at Siemens, Viseca, Nuance Communications, and private consulting. Sharing her practical experience in a broad range of industries and deployments, including IoT, customer intelligence, financial services, social media, and cybersecurity, Rosaria has authored 50+ technical publications, including her recent books Guide to Intelligent Data Science (Springer) and Codeless Deep Learning with KNIME (Packt).

Table of Contents

Preface

Part 1: Time Series Basics and KNIME Analytics Platform

Chapter 1: Introducing Time Series Analysis

Understanding TSA

Exploring time series properties and examples

Continuous and discrete time series

Independence and serial correlation

Time series examples

TSA goals and applications

Goals of TSA

Domains of applications and use cases

Exploring time series forecasting techniques

Quantitative forecasting properties and techniques

Summary

Questions

Chapter 2: Introduction to KNIME Analytics Platform

Exploring the KNIME software

Introducing KNIME Analytics Platform for creating data science applications

Introducing KNIME Server for productionizing data science applications

Introducing nodes and workflows

Introducing nodes

Introducing workflows

Searching for and sharing resources on the KNIME Hub

Building your first workflow

Creating a new workflow (group)

Reading and transforming data

Filtering rows

Visualizing data

Building a custom interactive view

Documenting workflows

Configuring the time series integration

Introducing the time series components

Configuring Python in KNIME

Summary

Questions

Chapter 3: Preparing Data for Time Series Analysis

Introducing different sources of time series data

Time granularity and time aggregation

Defining time granularity

Finding the right time granularity

Aggregating time series data

Equal spacing and time alignment

Explaining the concept of equal spacing

Missing value imputation

Defining the different types of missing values

Introducing missing value imputation techniques

Summary

Questions

Chapter 4: Time Series Visualization

Technical requirements

Introducing an energy consumption time series

Describing raw energy consumption data

Clustering energy consumption data

Introducing line plots

Displaying simple dynamics with a line plot

Interpreting the dynamics of a time series based on a line plot

Building a line plot in KNIME

Introducing lag plots

Introducing insights derived from a lag plot

Building a lag plot in KNIME

Introducing seasonal plots

Comparing seasonal patterns in a seasonal plot

Building a seasonal plot in KNIME

Introducing box plots

Inspecting variability of data in a box plot

Building a box plot in KNIME

Summary

Questions

Chapter 5: Time Series Components and Statistical Properties

Technical requirements

Trend and seasonality components

Trend

Seasonality

Decomposition

Autocorrelation

Stationarity

Summary

Questions

Part 2: Building and Deploying a Forecasting Model

Chapter 6: Humidity Forecasting with Classical Methods

Technical requirements

The importance of predicting the weather

Other IoT sensors

The use case

Streaming humidity data from an Arduino sensor

What is an Arduino?

Moving data to KNIME

Storing the data to create a training set

Resampling and granularity

Aligning data timestamps

Missing values

Aggregation techniques

Training and deployment

Types of classic models available in KNIME

Training a model in KNIME

Available deployment options

Building the workflow

Writing model predictions to a database

Summary

Questions

Chapter 7: Forecasting the Temperature with ARIMA and SARIMA Models

Recapping regression

Defining a regression

Introducing the (S)ARIMA models

Requirements of the (S)ARIMA model

How to configure the ARIMA or SARIMA model

Fitting the model and generating forecasts

The data

Summary

Further reading

Questions

Chapter 8: Audio Signal Classification with an FFT and a Gradient-Boosted Forest

Technical requirements

Why do we want to classify a signal?

Windowing your data

Windowing your data in KNIME

What is a transform?

The Fourier transform

Discrete Fourier Transform (DFT)

Fast Fourier Transform (FFT)

Applying the Fourier transform in KNIME

Preparing data for modeling

Reducing dimensionality

Training a Gradient Boosted Forest

Applying the Fourier transform in KNIME

Applying the Gradient Boosted Trees Learner

Deploying a Gradient Boosted Forest

Summary

Questions

Chapter 9: Training and Deploying a Neural Network to Predict Glucose Levels

Technical requirements

Glucose prediction and the glucose dataset

Glucose prediction

The glucose dataset

A quick introduction to neural networks

Artificial neurons and artificial neural networks

The backpropagation algorithm

Other types of neural networks

Training a feedforward neural network to predict glucose levels

KNIME Deep Learning Keras Integration

Building the network

Training the network

Scoring the network and creating the output rule

Deploying an FFNN-based alarm system

Summary

Questions

Chapter 10: Predicting Energy Demand with an LSTM Model

Technical requirements

Introducing recurrent neural networks and LSTMs

Recapping recurrent neural networks

The architecture of the LSTM unit

Forget Gate

Input Gate

Output Gate

Encoding and tensors

Input data

Reshaping the data

Training an LSTM-based neural network

The Keras Network Learner node

Deploying an LSTM network for future prediction

Scoring the forecasts

Summary

Questions

Chapter 11: Anomaly Detection – Predicting Failure with No Failure Examples

Technical requirements

Introducing the problem of anomaly detection in predictive maintenance

Introducing the anomaly detection problem

IoT data preprocessing

Exploring anomalies visually

Detecting anomalies with a control chart

Introducing a control chart

Implementing a control chart

Predicting the next sample in a correctly working system with an auto-regressive model

Introducing an auto-regressive model

Training an auto-regressive model with the linear regression algorithm

Deploying an auto-regressive model

Summary

Questions

Part 3: Forecasting on Mixed Platforms

Chapter 12: Predicting Taxi Demand on the Spark Platform

Technical requirements

Predicting taxi demand in NYC

Connecting to the Spark platform and preparing the data

Introducing the Hadoop ecosystem

Accessing the data and loading it into Spark

Introducing the Spark compatible nodes

Training a random forest model on Spark

Exploring seasonalities via line plots and auto-correlation plot

Preprocessing the data

Training and testing the random forest model on Spark

Building the deployment application

Predicting the trip count in the next hour

Predicting the trip count in the next 24 hours

Summary

Questions

Chapter 13: GPU Accelerated Model for Multivariate Forecasting

Technical requirements

From univariate to multivariate – extending the prediction problem

Building and training the multivariate neural architecture

Enabling GPU execution for neural networks

Setting up a new GPU Python environment

Switching Python environments dynamically

Building the deployment application

Summary

Questions

Chapter 14: Combining KNIME and H2O to Predict Stock Prices

Technical requirements

Introducing the stock price prediction problem

Describing the KNIME H2O Machine Learning Integration

Starting a workflow running on the H2O platform

Introducing the H2O nodes for machine learning

Accessing and preparing data within KNIME

Accessing stock market data from Yahoo Finance

Preparing the data for modeling on H2O

Training an H2O model from within KNIME

Optimizing the number of predictor columns

Training, applying, and testing the optimized model

Consuming the H2O model in the deployment application

Summary

Questions

Final note

Answers

Chapter 1

Chapter 2

Chapter 3

Chapter 4

Chapter 5

Chapter 6

Chapter 7

Chapter 8

Chapter 9

Chapter 10

Chapter 11

Chapter 12

Chapter 13

Chapter 14

Other Books You May Enjoy

Preface

This book gives an overview of the basics of time series data and time series analysis and of KNIME Analytics Platform and its time series integration. It shows how to implement practical solutions for a wide range of use cases, from demand prediction to signal classification and signal forecasting, and from price prediction to anomaly detection. It also demonstrates how to integrate other tools in addition to KNIME Analytics Platform within the same application.

The book instructs you on common preprocessing steps of time series data and statistics and machine learning-based techniques for forecasting. These things need to be learned to master the field of time series analysis. The book also points you to examples implemented in KNIME Analytics Platform, which is a visual programming tool that is accessible and fast to learn. This removes the common time and skill barrier of learning to code.

Who this book is for

This book is for data analysts and data scientists who want to develop forecasting applications on time series data. The first part of the book targets beginners in time series analysis by introducing the main concepts of time series analysis and visual exploration and preprocessing of time series data. The subsequent parts of the book challenge both beginners and advanced users by introducing real-world time series analysis applications.

What this book covers

Chapter 1, Introducing Time Series Analysis, explains what a time series is, states some classic time series problems, and introduces the two historical approaches: statistics and machine learning.

Chapter 2, Introduction to KNIME Analytics Platform, explains the basic concepts of KNIME Analytics Platform and its time series integration. This chapter covers installation, an introduction to the platform, and a first workflow example.

Chapter 3, Preparing Data for Time Series Analysis, introduces the common first steps in a time series analysis project. It explores different sources of time series data and shows time alignment, time aggregation, and missing value imputation as common preprocessing steps.

Chapter 4, Time Series Visualization, explores time series visualization. It provides an exploration of the most common visualization techniques to visually represent and display the time series data: from the classic line plot to the lag plot, and from the seasonal plot to the box plot.

Chapter 5, Time Series Components and Statistical Properties, introduces common concepts and measures for descriptive statistics of time series, including the decomposition of a time series, autocorrelation measures and plots, and the stationarity property.

Chapter 6, Humidity Forecasting with Classical Methods, completes a classic time series analysis use case: forecasting. It introduces some simple yet powerful classical methods, which often solve the time series analysis problem quickly without much computational expense.

Chapter 7, Forecasting the Temperature with ARIMA and SARIMA Models, delves into the ARIMA and SARIMA models. It aims at predicting tomorrow’s temperatures with the whole range of ARIMA models: AR, ARMA, ARIMA, and SARIMA.

Chapter 8, Audio Signal Classification with an FFT and a Gradient Boosted Forest, introduces a use case for signal classification. It performs the classification of audio signals via a Gradient Boosted Forest model and the FFT transforms the raw audio signals before modeling.

Chapter 9, Training and Deploying a Neural Network to Predict Glucose Levels, gives an example of a critical prediction problem: predicting the glucose level for a timely insulin intervention. This chapter also introduces neural networks.

Chapter 10, Predicting Energy Demand with an LSTM Model, introduces recurrent neural networks based on Long Short Term Memory (LSTM) layers, which are advanced predictors when temporal context is involved. It tests whether the prediction accuracy improves considerably from an ARIMA model when using a recurrent LSTM-based neural network.

Chapter 11, Anomaly Detection – Predicting Failure with No Failure Examples, tackles the problem of anomaly detection in predictive maintenance by introducing approaches that work exclusively on the data from a correctly working system.

Chapter 12, Predicting Taxi Demand on the Spark Platform, implements a solution to the demand prediction problem via a Random Forest to run on a Spark platform in an attempt to make the solution more scalable.

Chapter 13, GPU Accelerated Model for Multivariate Forecasting, extends the demand prediction problem to a multivariate by taking into account exogenous time series as well, and scalable, by training the recurrent neural network on a GPU-enabled machine.

Chapter 14, Combining KNIME and H2O to Predict Stock Prices, describes the integration of KNIME Analytics Platform with H2O, another open source platform, to implement a solution for stock price prediction.

To get the most out of this book

This book will introduce the basics of the open source visual programming tool KNIME Analytics Platform and time series analysis. Basic knowledge of data transformations is assumed, while no coding skills are required thanks to the codeless implementation of the examples. Python installation is required for using the time series integration in KNIME.

The installation of some use case-specific extensions and integrations will be indicated and instructed in the respective chapters. We will introduce KNIME Server for enterprise features in Chapter 2, Introduction to KNIME Analytics Platform, but all practical examples are implemented in the open source KNIME Analytics Platform.

If you are using the digital version of this book, we advise you to type the code yourself or access the code from the book’s GitHub repository (a link is available in the next section). Doing so will help you avoid any potential errors related to the copying and pasting of code.

Download the example code files

You can download the example code files for this book from GitHub at https://github.com/PacktPublishing/Codeless-Time-Series-Analysis-with-KNIME and https://hub.knime.com/knime/spaces/Codeless%20Time%20Series%20Analysis%20with%20KNIME/latest/~GxjXX6WmLi-WjLNx/. If there’s an update to the code, it will be updated in the GitHub repository.

We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!

Download the color images

We also provide a PDF file that has color images of the screenshots and diagrams used in this book. You can download it here: https://packt.link/2RomT.

Conventions used

There are a number of text conventions used throughout this book.

Code in text: Indicates code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles. Here is an example: For example, the ../sales.csv workflow relative path reads the sales.csv file located in the same workflow group as the executing workflow.

Bold: Indicates a new term, an important word, or words that you see onscreen. For instance, words in menus or dialog boxes appear in bold. Here is an example: If you want to do that, you will need to unlink it via the component’s context menu by selecting Component | Disconnect Link.

Tips or Important Notes

Appear like this.

Get in touch

Feedback from our readers is always welcome.

General feedback: If you have questions about any aspect of this book, email us at [email protected] and mention the book title in the subject of your message.

Errata: Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you have found a mistake in this book, we would be grateful if you would report this to us. Please visit www.packtpub.com/support/errata and fill in the form.

Piracy: If you come across any illegal copies of our works in any form on the internet, we would be grateful if you would provide us with the location address or website name. Please contact us at [email protected] with a link to the material.

If you are interested in becoming an author: If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, please visit authors.packtpub.com.

Share Your Thoughts

Once you’ve read Codeless Time Series Analysis with KNIME, we’d love to hear your thoughts! Please click here to go straight to the Amazon review page for this book and share your feedback.

Your review is important to us and the tech community and will help us make sure we’re delivering excellent quality content.

Part 1: Time Series Basics and KNIME Analytics Platform

By the end of this part, you will know what a time series is, how to preprocess, visualize, and explore it, and how to configure and use KNIME Analytics Platform for time series analysis. The following are the chapters included in this part:

Chapter 1, Introducing Time Series Analysis

Chapter 2, Introduction to KNIME Analytics Platform

Chapter 3, Preparing Data for Time Series Analysis

Chapter 4, Time Series Visualization

Chapter 5, Time Series Components and Statistical Properties

Chapter 1: Introducing Time Series Analysis

In this introductory chapter, we’ll examine the concept of time series, explore some examples and case studies, and then understand how Time Series Analysis (TSA) can be useful in different frameworks and applications. Finally, we’ll provide a brief overview of the forecasting models used over the years, highlighting their key features, which will be further explored in the following chapters.

In this chapter, we will cover the following topics:

Understanding TSA and its importance within data analytics

Time series properties and examples

TSA goals and applications

Overview of the main forecasting techniques used over the years

By the end of the chapter, you will have a good understanding of the key aspects of TSA, gaining the foundation to explore the subsequent chapters of the book with greater confidence.

Understanding TSA

When analyzing business data, it’s quite common to focus on what happened at a particular point in time: sales figures at the end of the month, customer characteristics at the end of the year, conversion results at the end of a marketing campaign, and more. Even in the development of the most sophisticated ML models, in most cases, we collect information that refers to different objects at a specific instant in time (or by taking a few snapshots of historical data). This approach, which is absolutely valid and correct for many applications, not only in business, uses cross-sectional data as the basis for analytics: data collected by observing many subjects (such as individuals, companies, shops, countries, equipment, and more) at one point or period of time.

Although the fact of not considering the temporal factor in the analysis is widespread and rooted in common practice, there are several situations where the analysis of the temporal evolution of a phenomenon provides more complete and interesting results. In fact, it’s only through the analysis of the temporal dynamics of the data that it is possible to identify the presence of some peculiar characteristics of the phenomenon we are analyzing, be it sales/consumption data, rather than a physical parameter or a macroeconomic index. These characteristics that act over time, such as trends, periodic fluctuations, level changes, anomalous observations, turning points, and more can have an effect in the short or long term, and often, it is important to be able to measure them precisely. Furthermore, it is only by analyzing data over time that it is possible to provide a reliable quantitative estimate of what might occur in the future (whether immediate or not). Since economic conditions are constantly changing over time, data analysts must be able to assess and predict the effects of these changes in order to suggest the most appropriate actions to take for the future.

For these reasons, TSA can be a very useful tool in the hands of business analysts and data scientists when it comes to both describing the patterns of a phenomenon along the time axis and providing a reliable forecast for it. Through the use of the right tools, TSA can significantly expand the understanding of any variable of interest (typically numerical) such as sales, financial KPIs, logistic metrics, sensors’ measurements, and more. More accurate and less biased forecasts that have been obtained through quantitative TSA can be one of the most effective drivers of performance in many fields and industries.

In the next sections of this chapter, we will provide definitions, examples, and some additional elements to gain a further understanding of how to recognize some key features of time series and how to approach their analyses in a structured way.

Exploring time series properties and examples

A general definition of a time series is the following:

A Time Series is a collection of observations made sequentially through time, whose dynamics are often characterized by short/long period fluctuations and/or long period direction.

This definition highlights two fundamental aspects of a time series: the fact that observations are a function of time and that, as a consequence of this fact, some typical temporal features are often observed. The fluctuations and the long period direction of the series are just some of these features, as there might be other relevant aspects to take into consideration such as autocorrelation, stationarity, and the order of integration. We will explore these aspects in more detail in future chapters. In this section, we will focus on the distinction between discrete time series and continuous time series, on the concept of independence between observations, and finally, we will show some examples of real-world time series.

Continuous and discrete time series

A Time Series is defined as continuous when observations are collected continuously over time, that is, there can be an infinite number of observations in a given time range. Typically, continuous time series data is sampled at irregular time intervals. Consider the measurement of a patient’s blood pressure in a hospital done at varying time points during the day, not equally spaced. This happens because, in some settings, regular monitoring at fixed intervals is not possible. For instance, in Figure 1.1, there are four medical continuous time series, relative to the health parameters of four patients:

Mean blood pressure

Heart rate

Temperature

Glucose data

As evident from the graphs, there are some temporal ranges where the measures are not present, for example, the temperature and glucose between approximately 20 hours and 30 hours of the monitoring period. There are other time points where data is collected more frequently than in other periods. These time series features are due to the fact that the data has been collected manually by the physician or by the nurse, not at fixed moments of the day. Therefore, this type of time series is inherently irregularly sampled:

Figure 1.1 – Four continuous, irregularly sampled, medical time series

Figure 1.1 – Four continuous, irregularly sampled, medical time series

A time series is defined as discrete when observations are collected regularly at specific times, typically equally spaced (that is, hourly, daily, weekly, and yearly data points).

A time series of this type can be natively discrete, such as the annual budget data of a company, or it can be created through the aggregation or accumulation of a numerical variable in equal time intervals. For example, the monthly sales of a supermarket or the number of daily passengers in a train station. A continuous time series can be discretized by binning/grouping the original data and, eventually, obtaining a discrete time series.

Classical TSA focuses on discrete time series because they are more common in real-world applications and easier to analyze. Therefore, in this book, we mainly deal with discrete time series, where observations are collected at equal intervals. When we consider irregularly sampled time series, first, we will try to transform them into regularly sampled data points.

Independence and serial correlation

One of the most distinctive characteristics of a time series is the mutual dependence between the observations, generally called serial correlation or autocorrelation.

In many statistical models, observations are assumed to be generated by a random sampling process and to be independent of each other (consider the linear regression model). Typically, this assumption turns out to be inconsistent with time series data, where simply collecting the data sequentially, along the time axis, generally produces observations that are not independent of each other.

Think of the daily sales of an e-commerce company. It’s

Enjoying the preview?

Page 1 of 1

Codeless Time Series Analysis with KNIME: A practical guide to implementing forecasting models for time series analysis applications

About this ebook

KNIME AG

Related authors

Related to Codeless Time Series Analysis with KNIME

Related ebooks

Hands-On Artificial Intelligence for Beginners: An introduction to AI concepts, algorithms, and their implementation

Hands-On Predictive Analytics with Python: Master the complete predictive analytics process, from problem definition to model deployment

Data Labeling in Machine Learning with Python: Explore modern ways to prepare labeled data for training and fine-tuning ML and generative AI models

Machine Learning with LightGBM and Python: A practitioner's guide to developing production-ready machine learning systems

Deep Learning with PyTorch: A practical approach to building neural network models using PyTorch

Go Machine Learning Projects: Eight projects demonstrating end-to-end machine learning and predictive analytics applications in Go

The Pandas Workshop: A comprehensive guide to using Python for data analysis with real-world case studies

A Handbook of Mathematical Models with Python: Elevate your machine learning projects with NetworkX, PuLP, and linalg

Data Forecasting and Segmentation Using Microsoft Excel: Perform data grouping, linear predictions, and time series machine learning statistics without using code

Machine Learning with the Elastic Stack.: Gain valuable insights from your data with Elastic Stack's machine learning features

Deep Learning By Example: A hands-on guide to implementing advanced machine learning algorithms and neural networks

The Deep Learning Architect's Handbook: Build and deploy production-ready DL solutions leveraging the latest Python techniques

Hands-On Data Preprocessing in Python: Learn how to effectively prepare data for successful data analytics

Python Machine Learning By Example: The easiest way to get into machine learning

Feature Engineering Made Easy: Identify unique features from your dataset in order to build powerful machine learning systems

Mastering Predictive Analytics with scikit-learn and TensorFlow: Implement machine learning techniques to build advanced predictive models using Python

The Handbook of NLP with Gensim: Leverage topic modeling to uncover hidden patterns, themes, and valuable insights within textual data

Hands-On Machine Learning with ML.NET: Getting started with Microsoft ML.NET to implement popular machine learning algorithms in C#

Agile Machine Learning with DataRobot: Automate each step of the machine learning life cycle, from understanding problems to delivering value

Hands-On Neural Network Programming with C#: Add powerful neural network capabilities to your C# enterprise applications

Machine Learning Fundamentals: Use Python and scikit-learn to get up and running with the hottest developments in machine learning

Time Series Analysis on AWS: Learn how to build forecasting models and detect anomalies in your time series data

Practical Data Analysis - Second Edition

Machine Learning Engineering with Python: Manage the production life cycle of machine learning models using MLOps with practical examples

Learning Data Mining with Python

Modern Time Series Forecasting with Python: Explore industry-ready time series forecasting using modern machine learning and deep learning

Engineering MLOps: Rapidly build, test, and manage production-ready machine learning life cycles at scale

Practical Convolutional Neural Networks: Implement advanced deep learning models using Python

Introduction to Machine Learning and Neural Classification

15 Math Concepts Every Data Scientist Should Know: Understand and learn how to apply the math behind data science algorithms

Data Visualization For You

Data Analytics for Beginners: Introduction to Data Analytics

Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work

Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence

How to Lie with Maps

Data Visualization: A Practical Introduction

Data Analytics & Visualization All-in-One For Dummies

The Applied SQL Data Analytics Workshop - Second Edition: Develop your practical skills and prepare to become a professional data analyst, 2nd Edition

Hands-On Data Science for Marketing: Improve your marketing strategies with machine learning using Python and R

Effective Data Storytelling: How to Drive Change with Data, Narrative and Visuals

Visual Analytics with Tableau

Teach Yourself VISUALLY Power BI

Python For Beginners.Learn Data Science in 5 Days the Smart Way and Remember it Longer. With Easy Step by Step Guidance & Hands on Examples. (Python Crash Course-Programming for Beginners): Python for Beginners

How to be Clear and Compelling with Data: Principles, Practice and Getting Beyond the Basics

Google Analytics 4 Migration Quick Guide 2022: Universal Analytics disappears in July 2023 - are you ready?

Python Automation Cookbook: 75 Python automation ideas for web scraping, data wrangling, and processing Excel, reports, emails, and more, 2nd Edition

Data Structures & Algorithms Interview Questions You'll Most Likely Be Asked

Learn Power BI: A comprehensive, step-by-step guide for beginners to learn real-world business intelligence

Salesforce Reporting and Dashboards

Data Visualization Guide

The Big Book of Dashboards: Visualizing Your Data Using Real-World Business Scenarios

How to Become a Data Analyst: My Low-Cost, No Code Roadmap for Breaking into Tech

Data Visualization with Excel Dashboards and Reports

Visualizing Graph Data

LaTeX Graphics with TikZ: A practitioner's guide to drawing 2D and 3D images, diagrams, charts, and plots

Learning PySpark

Chatgpt | Generative AI - The Step-By-Step Guide For OpenAI & Azure OpenAI In 36 Hrs.

Excel 2024: Mastering Charts, Functions, Formula and Pivot Table in Excel 2024 as a Beginner with Step by Step GuideMastering Charts, Functions, Formula and Pivot Table in Excel 2024 as a Beginner with Step by Step Guide

Jupyter Cookbook: Over 75 recipes to perform interactive computing across Python, R, Scala, Spark, JavaScript, and more

DAX Patterns: Second Edition

Mastering Python for Data Science

Related podcast episodes

Related categories

Reviews for Codeless Time Series Analysis with KNIME

What did you think?

Book preview

Codeless Time Series Analysis with KNIME - KNIME AG