Scalable Data Streaming with Amazon Kinesis: Design and secure highly available, cost-effective data streaming applications with Amazon Kinesis

Ebook537 pages4 hours

Scalable Data Streaming with Amazon Kinesis: Design and secure highly available, cost-effective data streaming applications with Amazon Kinesis

Name: Scalable Data Streaming with Amazon Kinesis: Design and secure highly available, cost-effective data streaming applications with Amazon Kinesis
Author: Tarik Makota
ISBN: 9781800564336

By Tarik Makota, Brian Maguire, Danny Gagne and Rajeev Chakrabarti

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Amazon Kinesis is a collection of secure, serverless, durable, and highly available purpose-built data streaming services. This data streaming service provides APIs and client SDKs that enable you to produce and consume data at scale.
Scalable Data Streaming with Amazon Kinesis begins with a quick overview of the core concepts of data streams, along with the essentials of the AWS Kinesis landscape. You'll then explore the requirements of the use case shown through the book to help you get started and cover the key pain points encountered in the data stream life cycle. As you advance, you'll get to grips with the architectural components of Kinesis, understand how they are configured to build data pipelines, and delve into the applications that connect to them for consumption and processing. You'll also build a Kinesis data pipeline from scratch and learn how to implement and apply practical solutions. Moving on, you'll learn how to configure Kinesis on a cloud platform. Finally, you’ll learn how other AWS services can be integrated into Kinesis. These services include Redshift, Dynamo Database, AWS S3, Elastic Search, and third-party applications such as Splunk.
By the end of this AWS book, you’ll be able to build and deploy your own Kinesis data pipelines with Kinesis Data Streams (KDS), Kinesis Data Firehose (KFH), Kinesis Video Streams (KVS), and Kinesis Data Analytics (KDA).

Skip carousel

LanguageEnglish

PublisherPackt Publishing

Release dateMar 31, 2021

ISBN9781800564336

Author

Tarik Makota

Related authors

Skip carousel

Related to Scalable Data Streaming with Amazon Kinesis

Related ebooks

Skip carousel

Practical AWS Networking: Build and manage complex networks using services such as Amazon VPC, Elastic Load Balancing, Direct Connect, and Amazon Route 53
Ebook
Practical AWS Networking: Build and manage complex networks using services such as Amazon VPC, Elastic Load Balancing, Direct Connect, and Amazon Route 53
byMitesh Soni
Rating: 0 out of 5 stars
0 ratings
Serverless Architectures with AWS: Discover how you can migrate from traditional deployments to serverless architectures with AWS
Ebook
Serverless Architectures with AWS: Discover how you can migrate from traditional deployments to serverless architectures with AWS
byMohit Gupta
Rating: 0 out of 5 stars
0 ratings
Actionable Insights with Amazon QuickSight: Develop stunning data visualizations and machine learning-driven insights with Amazon QuickSight
Ebook
Actionable Insights with Amazon QuickSight: Develop stunning data visualizations and machine learning-driven insights with Amazon QuickSight
byManos Samatas
Rating: 0 out of 5 stars
0 ratings
VMware Cross-Cloud Architecture: Automate and orchestrate your Software-Defined Data Center on AWS
Ebook
VMware Cross-Cloud Architecture: Automate and orchestrate your Software-Defined Data Center on AWS
byAjit Pratap Kundan
Rating: 0 out of 5 stars
0 ratings
Accelerating DevSecOps on AWS: Create secure CI/CD pipelines using Chaos and AIOps
Ebook
Accelerating DevSecOps on AWS: Create secure CI/CD pipelines using Chaos and AIOps
byNikit Swaraj
Rating: 0 out of 5 stars
0 ratings
Learning AWS
Ebook
Learning AWS
byAmit Shah
Rating: 4 out of 5 stars
4/5
Data Engineering with AWS: Learn how to design and build cloud-based data transformation pipelines using AWS
Ebook
Data Engineering with AWS: Learn how to design and build cloud-based data transformation pipelines using AWS
byGareth Eagar
Rating: 0 out of 5 stars
0 ratings
Generative AI-Powered Assistant for Developers: Accelerate software development with Amazon Q Developer
Ebook
Generative AI-Powered Assistant for Developers: Accelerate software development with Amazon Q Developer
byBehram Irani
Rating: 0 out of 5 stars
0 ratings
AWS Administration - The Definitive Guide: Design, build, and manage your infrastructure on Amazon Web Services, 2nd Edition
Ebook
AWS Administration - The Definitive Guide: Design, build, and manage your infrastructure on Amazon Web Services, 2nd Edition
byYohan Wadia
Rating: 0 out of 5 stars
0 ratings
Intelligent Workloads at the Edge: Deliver cyber-physical outcomes with data and machine learning using AWS IoT Greengrass
Ebook
Intelligent Workloads at the Edge: Deliver cyber-physical outcomes with data and machine learning using AWS IoT Greengrass
byIndraneel Mitra
Rating: 0 out of 5 stars
0 ratings
Building Serverless Web Applications
Ebook
Building Serverless Web Applications
byDiego Zanon
Rating: 0 out of 5 stars
0 ratings
Architecting Cloud-Native Serverless Solutions: Design, build, and operate serverless solutions on cloud and open source platforms
Ebook
Architecting Cloud-Native Serverless Solutions: Design, build, and operate serverless solutions on cloud and open source platforms
bySafeer Cm
Rating: 0 out of 5 stars
0 ratings
Learn Microsoft Azure: Step by Step in 7 day for .NET Developers
Ebook
Learn Microsoft Azure: Step by Step in 7 day for .NET Developers
bySaillesh Pawar
Rating: 0 out of 5 stars
0 ratings
Simplify Big Data Analytics with Amazon EMR: A beginner's guide to learning and implementing Amazon EMR for building data analytics solutions
Ebook
Simplify Big Data Analytics with Amazon EMR: A beginner's guide to learning and implementing Amazon EMR for building data analytics solutions
bySakti Mishra
Rating: 0 out of 5 stars
0 ratings
Hands-On Azure for Developers: Implement rich Azure PaaS ecosystems using containers, serverless services, and storage solutions
Ebook
Hands-On Azure for Developers: Implement rich Azure PaaS ecosystems using containers, serverless services, and storage solutions
byKamil Mrzygłód
Rating: 0 out of 5 stars
0 ratings
Modern Data Architecture on AWS: A Practical Guide for Building Next-Gen Data Platforms on AWS
Ebook
Modern Data Architecture on AWS: A Practical Guide for Building Next-Gen Data Platforms on AWS
byBehram Irani
Rating: 0 out of 5 stars
0 ratings
Expert AWS Development: Efficiently develop, deploy, and manage your enterprise apps on the Amazon Web Services platform
Ebook
Expert AWS Development: Efficiently develop, deploy, and manage your enterprise apps on the Amazon Web Services platform
byAtul V. Mistry
Rating: 0 out of 5 stars
0 ratings
AI as a Service: Serverless machine learning with AWS
Ebook
AI as a Service: Serverless machine learning with AWS
byPeter Elger
Rating: 1 out of 5 stars
1/5
AWS Cloud Projects: Strengthen your AWS skills through practical projects, from websites to advanced AI applications
Ebook
AWS Cloud Projects: Strengthen your AWS skills through practical projects, from websites to advanced AI applications
byIvo Pinto
Rating: 0 out of 5 stars
0 ratings
Building Serverless Microservices in Python: A complete guide to building, testing, and deploying microservices using serverless computing on AWS
Ebook
Building Serverless Microservices in Python: A complete guide to building, testing, and deploying microservices using serverless computing on AWS
byRichard Takashi Freeman
Rating: 0 out of 5 stars
0 ratings
Hands-On Artificial Intelligence on Amazon Web Services: Decrease the time to market for AI and ML applications with the power of AWS
Ebook
Hands-On Artificial Intelligence on Amazon Web Services: Decrease the time to market for AI and ML applications with the power of AWS
bySubhashini Tripuraneni
Rating: 0 out of 5 stars
0 ratings
Learn AWS Serverless Computing: A beginner's guide to using AWS Lambda, Amazon API Gateway, and services from Amazon Web Services
Ebook
Learn AWS Serverless Computing: A beginner's guide to using AWS Lambda, Amazon API Gateway, and services from Amazon Web Services
byScott Patterson
Rating: 0 out of 5 stars
0 ratings
Learning AWS: Design, build, and deploy responsive applications using AWS Cloud components, 2nd Edition
Ebook
Learning AWS: Design, build, and deploy responsive applications using AWS Cloud components, 2nd Edition
bySarkar Aurobindo
Rating: 0 out of 5 stars
0 ratings
Amazon Redshift Cookbook: Recipes for building modern data warehousing solutions
Ebook
Amazon Redshift Cookbook: Recipes for building modern data warehousing solutions
byShruti Worlikar
Rating: 0 out of 5 stars
0 ratings
Optimizing Your Modernization Journey with AWS: Best practices for transforming your applications and infrastructure on the cloud
Ebook
Optimizing Your Modernization Journey with AWS: Best practices for transforming your applications and infrastructure on the cloud
byMridula Grandhi
Rating: 0 out of 5 stars
0 ratings
Hybrid Cloud for Architects: Build robust hybrid cloud solutions using AWS and OpenStack
Ebook
Hybrid Cloud for Architects: Build robust hybrid cloud solutions using AWS and OpenStack
byShrivastwa Alok
Rating: 0 out of 5 stars
0 ratings
Applied Machine Learning and High-Performance Computing on AWS: Accelerate the development of machine learning applications following architectural best practices
Ebook
Applied Machine Learning and High-Performance Computing on AWS: Accelerate the development of machine learning applications following architectural best practices
byMani Khanuja
Rating: 0 out of 5 stars
0 ratings
Amazon EC2 Cookbook
Ebook
Amazon EC2 Cookbook
byReddy Sekhar
Rating: 0 out of 5 stars
0 ratings
Mastering AWS Security: Create and maintain a secure cloud ecosystem
Ebook
Mastering AWS Security: Create and maintain a secure cloud ecosystem
byAlbert Anthony
Rating: 0 out of 5 stars
0 ratings
Azure for Developers.: Implement rich Azure PaaS ecosystems using containers, serverless services, and storage solutions
Ebook
Azure for Developers.: Implement rich Azure PaaS ecosystems using containers, serverless services, and storage solutions
byKamil Mrzygłód
Rating: 0 out of 5 stars
0 ratings

Computers For You

Skip carousel

Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
Ebook
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
byGary Smith
Rating: 4 out of 5 stars
4/5
Elon Musk
Ebook
Elon Musk
byWalter Isaacson
Rating: 4 out of 5 stars
4/5
CompTIA Security+ Get Certified Get Ahead: SY0-701 Study Guide
Ebook
CompTIA Security+ Get Certified Get Ahead: SY0-701 Study Guide
byJoe Shelley
Rating: 5 out of 5 stars
5/5
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
Ebook
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
byTJ Books
Rating: 3 out of 5 stars
3/5
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Ebook
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
Ebook
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
byAaron Smith
Rating: 0 out of 5 stars
0 ratings
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Ebook
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
byMargot Lee Shetterly
Rating: 4 out of 5 stars
4/5
The Invisible Rainbow: A History of Electricity and Life
Ebook
The Invisible Rainbow: A History of Electricity and Life
byArthur Firstenberg
Rating: 5 out of 5 stars
5/5
The Innovators: How a Group of Hackers, Geniuses, and Geeks Created the Digital Revolution
Ebook
The Innovators: How a Group of Hackers, Geniuses, and Geeks Created the Digital Revolution
byWalter Isaacson
Rating: 4 out of 5 stars
4/5
Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!
Ebook
Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!
byJohannes Wild
Rating: 0 out of 5 stars
0 ratings
The Simulation Hypothesis: An MIT Computer Scientist Shows Why AI, Quantum Physics and Eastern Mystics All Agree We Are In a Video Game
Ebook
The Simulation Hypothesis: An MIT Computer Scientist Shows Why AI, Quantum Physics and Eastern Mystics All Agree We Are In a Video Game
byRizwan Virk
Rating: 5 out of 5 stars
5/5
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
Ebook
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
byTriumph Books
Rating: 4 out of 5 stars
4/5
Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition
Ebook
Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition
byAndrew Hodges
Rating: 4 out of 5 stars
4/5
Slenderman: Online Obsession, Mental Illness, and the Violent Crime of Two Midwestern Girls
Ebook
Slenderman: Online Obsession, Mental Illness, and the Violent Crime of Two Midwestern Girls
byKathleen Hale
Rating: 4 out of 5 stars
4/5
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 4 out of 5 stars
4/5
Uncanny Valley: A Memoir
Ebook
Uncanny Valley: A Memoir
byAnna Wiener
Rating: 4 out of 5 stars
4/5
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
Ebook
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
byQuentin Docter
Rating: 0 out of 5 stars
0 ratings
Deep Search: How to Explore the Internet More Effectively
Ebook
Deep Search: How to Explore the Internet More Effectively
byAlan Pearce
Rating: 5 out of 5 stars
5/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
Ebook
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
byDavid Mayer
Rating: 0 out of 5 stars
0 ratings
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
Ebook
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
bySteven Cooper
Rating: 4 out of 5 stars
4/5
The Professional Voiceover Handbook: Voiceover training, #1
Ebook
The Professional Voiceover Handbook: Voiceover training, #1
byPeter Baker
Rating: 5 out of 5 stars
5/5
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
Ebook
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
byAlex Parkinson
Rating: 4 out of 5 stars
4/5
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
Ebook
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Dark Aeon: Transhumanism and the War Against Humanity
Ebook
Dark Aeon: Transhumanism and the War Against Humanity
byJoe Allen
Rating: 5 out of 5 stars
5/5
How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)
Ebook
How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)
byDavid Kadavy
Rating: 5 out of 5 stars
5/5
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
Ebook
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
bySeth Stephens-Davidowitz
Rating: 4 out of 5 stars
4/5
The Hacker Crackdown: Law and Disorder on the Electronic Frontier
Ebook
The Hacker Crackdown: Law and Disorder on the Electronic Frontier
byBruce Sterling
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

#302: March 2019 Update Show Part One: Simon is joined by new update co-host Nicki Klein (Senior Technical Evangelist, AWS) to discuss all
Podcast episode
#302: March 2019 Update Show Part One: Simon is joined by new update co-host Nicki Klein (Senior Technical Evangelist, AWS) to discuss all
byAWS Podcast
100%
100% found this document useful
#341: November 2019 Update Show: Simon and Nicki share a broad range of interesting updates! 00:48 Storage 01:43 Compute 05:27 Networ
Podcast episode
#341: November 2019 Update Show: Simon and Nicki share a broad range of interesting updates! 00:48 Storage 01:43 Compute 05:27 Networ
byAWS Podcast
0 ratings
0% found this document useful
55 | VS Code Extensions, Plugins, and Themes (Part 2): This episode is Part 2 of Amy and James's favorite VS Code Hot Tips and Tricks for improving the developer experience. They share their favorite extensions, plugins, and themes for getting the most out of VS Code, including some hot takes on GitHub CoPilot.
Podcast episode
55 | VS Code Extensions, Plugins, and Themes (Part 2): This episode is Part 2 of Amy and James's favorite VS Code Hot Tips and Tricks for improving the developer experience. They share their favorite extensions, plugins, and themes for getting the most out of VS Code, including some hot takes on GitHub CoPilot.
byCOMPRESSEDfm
0 ratings
0% found this document useful
Building An Internal Database As A Service Platform At Cloudflare: Data persistence is one of the most challenging aspects of computer systems. In the era of the cloud most developers rely on hosted services to manage their databases, but what if you are a cloud service? In this episode Vignesh Ravichandran explains how his team at Cloudflare provides PostgreSQL as a service to their developers for low latency and high uptime services at global scale. This is an interesting and insightful look at pragmatic engineering for reliability and scale.
Podcast episode
Building An Internal Database As A Service Platform At Cloudflare: Data persistence is one of the most challenging aspects of computer systems. In the era of the cloud most developers rely on hosted services to manage their databases, but what if you are a cloud service? In this episode Vignesh Ravichandran explains how his team at Cloudflare provides PostgreSQL as a service to their developers for low latency and high uptime services at global scale. This is an interesting and insightful look at pragmatic engineering for reliability and scale.
byData Engineering Podcast
0 ratings
0% found this document useful
39 | Tech to Look Forward to in 2022
Podcast episode
39 | Tech to Look Forward to in 2022
byCOMPRESSEDfm
0 ratings
0% found this document useful
Headless CMS Break Down & Roundup: In this episode of Syntax, Scott and Wes talk about headless content management systems — why you might want to use one, things you should take into account, and more! Sanity - Sponsor is a real-time headless CMS with a fully customizable...
Podcast episode
Headless CMS Break Down & Roundup: In this episode of Syntax, Scott and Wes talk about headless content management systems — why you might want to use one, things you should take into account, and more! Sanity - Sponsor is a real-time headless CMS with a fully customizable...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
48 | How to Make Money as a Developer: In this episode, James and Amy talk about all the ways that you can make money online, as a developer. This includes everything from content creation, to sponsored content, to creating a SaaS, to freelancing.
Podcast episode
48 | How to Make Money as a Developer: In this episode, James and Amy talk about all the ways that you can make money online, as a developer. This includes everything from content creation, to sponsored content, to creating a SaaS, to freelancing.
byCOMPRESSEDfm
0 ratings
0% found this document useful
75 | DevOps and Setting up a CICD Pipeline: In this episode, Amy talks through the details of Dev Operations and setting up a CI/CD (Continuous Integration and Continuous Deployment) pipeline on a recent project, using RedwoodJS, Husky, Postgres, Render, and GitHub Integrations.
Podcast episode
75 | DevOps and Setting up a CICD Pipeline: In this episode, Amy talks through the details of Dev Operations and setting up a CI/CD (Continuous Integration and Continuous Deployment) pipeline on a recent project, using RedwoodJS, Husky, Postgres, Render, and GitHub Integrations.
byCOMPRESSEDfm
0 ratings
0% found this document useful
CFD in the Cloud
Podcast episode
CFD in the Cloud
byThe CFD Mixtape
0 ratings
0% found this document useful
72 | Working with Storybook: In this episode, Amy shares her experience with working with Storybook, the pros and cons, and how it's changed her developer workflow.
Podcast episode
72 | Working with Storybook: In this episode, Amy shares her experience with working with Storybook, the pros and cons, and how it's changed her developer workflow.
byCOMPRESSEDfm
0 ratings
0% found this document useful
The Changing Faces of Data and Analytics
Podcast episode
The Changing Faces of Data and Analytics
byInsights Tomorrow
0 ratings
0% found this document useful
Node.js can do that?!: Brad and Amy discuss some of their favorite new features in Node.js and how you can get the most out of them.
Podcast episode
Node.js can do that?!: Brad and Amy discuss some of their favorite new features in Node.js and how you can get the most out of them.
byCOMPRESSEDfm
0 ratings
0% found this document useful
85 | Casual Conversations on Github Copilot, Frameworks, Desk Setups, Serverless, and More: In this episode, James and Amy answer questions from the audience about Github Copilot, modern frameworks, Serverless vs Express.js, PlanetScale vs Supabase vs Firebase, and more!
Podcast episode
85 | Casual Conversations on Github Copilot, Frameworks, Desk Setups, Serverless, and More: In this episode, James and Amy answer questions from the audience about Github Copilot, modern frameworks, Serverless vs Express.js, PlanetScale vs Supabase vs Firebase, and more!
byCOMPRESSEDfm
0 ratings
0% found this document useful
That Datadog Will Hunt with Dann Berg: Dann Berg, Senior Cloud Analyst at Datadog, was an early guest on "Screaming" is back again! Now with the title “senior” attached to the front end of his job. Dann and Datadog are also steeped in the mires of AWS billing, so naturally he and Corey have a
Podcast episode
That Datadog Will Hunt with Dann Berg: Dann Berg, Senior Cloud Analyst at Datadog, was an early guest on "Screaming" is back again! Now with the title “senior” attached to the front end of his job. Dann and Datadog are also steeped in the mires of AWS billing, so naturally he and Corey have a
byScreaming in the Cloud
0 ratings
0% found this document useful
77 | All Things Serverless: James and Amy talk about everything Serverless and how it fits into modern Web Development. They discuss Serverless Functions, hosting platforms (Netlify, Vercel, and Cloudflare), frameworks and tools, benefits, Edge Functions, and more.
Podcast episode
77 | All Things Serverless: James and Amy talk about everything Serverless and how it fits into modern Web Development. They discuss Serverless Functions, hosting platforms (Netlify, Vercel, and Cloudflare), frameworks and tools, benefits, Edge Functions, and more.
byCOMPRESSEDfm
0 ratings
0% found this document useful
Building Linked Data Products With JSON-LD: A significant amount of time in data engineering is dedicated to building connections and semantic meaning around pieces of information. Linked data technologies provide a means of tightly coupling metadata with raw information. In this episode Brian Platz explains how JSON-LD can be used as a shared representation of linked data for building semantic data products.
Podcast episode
Building Linked Data Products With JSON-LD: A significant amount of time in data engineering is dedicated to building connections and semantic meaning around pieces of information. Linked data technologies provide a means of tightly coupling metadata with raw information. In this episode Brian Platz explains how JSON-LD can be used as a shared representation of linked data for building semantic data products.
byData Engineering Podcast
0 ratings
0% found this document useful
#334: What Does AWS Professional Services do for Customers?: Simon speaks with Girish Nesaratnam (AWS Professional Services) to learn more about how customers us
Podcast episode
#334: What Does AWS Professional Services do for Customers?: Simon speaks with Girish Nesaratnam (AWS Professional Services) to learn more about how customers us
byAWS Podcast
100%
100% found this document useful
Episode 36: I'm Not Here to Correct Your English, Just Cloud Bills: Do you enjoy watching sports? Wear your favorite team or player’s jersey? Are you a fan who has shopped at Fanatics on the Cloud? Today, we’re talking to Johnny Sheeley, director of Cloud engineering at Fanatics, which is a sports eCommerce business that
Podcast episode
Episode 36: I'm Not Here to Correct Your English, Just Cloud Bills: Do you enjoy watching sports? Wear your favorite team or player’s jersey? Are you a fan who has shopped at Fanatics on the Cloud? Today, we’re talking to Johnny Sheeley, director of Cloud engineering at Fanatics, which is a sports eCommerce business that
byScreaming in the Cloud
0 ratings
0% found this document useful
86 | Chrome Developer Tools Walkthrough: In this episode, James and Amy talk about the Chrome Developer Tools including familiar tabs like Elements, Console, Network, and a few you've probably never heard of! They also share some of their favorite tips and tricks along the way.
Podcast episode
86 | Chrome Developer Tools Walkthrough: In this episode, James and Amy talk about the Chrome Developer Tools including familiar tabs like Elements, Console, Network, and a few you've probably never heard of! They also share some of their favorite tips and tricks along the way.
byCOMPRESSEDfm
0 ratings
0% found this document useful
57 | Authentication and Authorization and Other Buzz Words: In this episode, James and Amy, explain all the buzz words: authentication, authorization, JWTs, sessions, and cookies. And what's the best implementation for your site?
Podcast episode
57 | Authentication and Authorization and Other Buzz Words: In this episode, James and Amy, explain all the buzz words: authentication, authorization, JWTs, sessions, and cookies. And what's the best implementation for your site?
byCOMPRESSEDfm
0 ratings
0% found this document useful
30 | WordPress in 2021: In this episode, Amy and James discuss the state of WordPress in 2021. Is it still relevant? Is it worth learning? What does the developer experience look like? What does the future of WordPress hold?
Podcast episode
30 | WordPress in 2021: In this episode, Amy and James discuss the state of WordPress in 2021. Is it still relevant? Is it worth learning? What does the developer experience look like? What does the future of WordPress hold?
byCOMPRESSEDfm
0 ratings
0% found this document useful
ConnectWise Security 360, Asana's AI Workflows, Raspberry Pi's AI Kit, and Google's AI Overviews
Podcast episode
ConnectWise Security 360, Asana's AI Workflows, Raspberry Pi's AI Kit, and Google's AI Overviews
byBusiness of Tech: Daily 10-Minute IT Services Insights
0 ratings
0% found this document useful
54 | Why Redwood.js is the App Framework for Startups with David Price: In this episode, David Price talks about Redwood.js, its origin, how it can help you quickly spin up a full-stack JavaScript application, and how you can get involved in their community.
Podcast episode
54 | Why Redwood.js is the App Framework for Startups with David Price: In this episode, David Price talks about Redwood.js, its origin, how it can help you quickly spin up a full-stack JavaScript application, and how you can get involved in their community.
byCOMPRESSEDfm
0 ratings
0% found this document useful
25 | Starting Your Own Podcast: Amy and James continue their creator series for designers and developers, explaining some of the things that they learned while starting their own podcast.
Podcast episode
25 | Starting Your Own Podcast: Amy and James continue their creator series for designers and developers, explaining some of the things that they learned while starting their own podcast.
byCOMPRESSEDfm
0 ratings
0% found this document useful
Powering Vector Search With Real Time And Incremental Vector Indexes: The rapid growth of machine learning, especially large language models, have led to a commensurate growth in the need to store and compare vectors. In this episode Louis Brandy discusses the applications for vector search capabilities both in and outside of AI, as well as the challenges of maintaining real-time indexes of vector data.
Podcast episode
Powering Vector Search With Real Time And Incremental Vector Indexes: The rapid growth of machine learning, especially large language models, have led to a commensurate growth in the need to store and compare vectors. In this episode Louis Brandy discusses the applications for vector search capabilities both in and outside of AI, as well as the challenges of maintaining real-time indexes of vector data.
byData Engineering Podcast
0 ratings
0% found this document useful
#304: March Update Show Part 2: Simon and Nicki run through some interesting new AWS capabilities for customers as well as a look at
Podcast episode
#304: March Update Show Part 2: Simon and Nicki run through some interesting new AWS capabilities for customers as well as a look at
byAWS Podcast
0 ratings
0% found this document useful
Fastly with Tyler McMullen: Tyler McMullen of Fastly is with us today, telling our hosts Mark Mirchandani and Brian Dorsey all about the company, CDNs, and more.
Podcast episode
Fastly with Tyler McMullen: Tyler McMullen of Fastly is with us today, telling our hosts Mark Mirchandani and Brian Dorsey all about the company, CDNs, and more.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Real World AI Automation with Lambda & Bedrock presented by AWS Hero Ken Collins: Real World AI Automation with Lambda & Bedrock presented by AWS Hero Ken Collins
Podcast episode
Real World AI Automation with Lambda & Bedrock presented by AWS Hero Ken Collins: Real World AI Automation with Lambda & Bedrock presented by AWS Hero Ken Collins
byvBrownBag
0 ratings
0% found this document useful
44 | What does it look like to work on an actual dev team?
Podcast episode
44 | What does it look like to work on an actual dev team?
byCOMPRESSEDfm
0 ratings
0% found this document useful
56 | Building a SaaS is Less Scary Than You Think with James Perkins: In this episode, James Perkins talks about the SaaS he built, Roll Your Tweet, the tech behind it, and how much it costs to run.
Podcast episode
56 | Building a SaaS is Less Scary Than You Think with James Perkins: In this episode, James Perkins talks about the SaaS he built, Roll Your Tweet, the tech behind it, and how much it costs to run.
byCOMPRESSEDfm
0 ratings
0% found this document useful

Skip carousel

AI As A Service
PC Pro Magazine
Article
AI As A Service
Jul 9, 2020
2 min read
Supercomputer On A Platter
Business Today
Article
Supercomputer On A Platter
Apr 1, 2022
CHENNAI-HEADQUARTERED automobile major TVS Motor Company uses high-performance computing (HPC) for running R&D simulations and testing the aero-dynamics of two-wheelers, which allows it to make the vehicles stable at speed and more efficient, cool en
7 min read
AWS Chief Adam Selipsky Talks Generative AI, Amazon’s Investment In Anthropic And Cloud Cost Cutting
TechLife News
Article
AWS Chief Adam Selipsky Talks Generative AI, Amazon’s Investment In Anthropic And Cloud Cost Cutting
Dec 16, 2023
4 min read
AWS Chief Adam Selipsky Talks Generative AI, Amazon’s Investment In Anthropic And Cloud Cost Cutting
AppleMagazine
Article
AWS Chief Adam Selipsky Talks Generative AI, Amazon’s Investment In Anthropic And Cloud Cost Cutting
Dec 15, 2023
4 min read
Rolling The Database As A Service
Linux Format
Article
Rolling The Database As A Service
Aug 27, 2019
A couple of times during our conversation, Robin alluded to the fact that DataStax has now set its eyes on helping users eradicate some of the day-to-day operational complexity from their workflow. The DataStax Apache Cassandra as a Service is one of
2 min read
How Netflix’s OTT Architecture Functions?
Techfastly
Article
How Netflix’s OTT Architecture Functions?
May 1, 2022
With so many OTT platforms in the market today, Netflix has managed to capture a majority of the audience on a global scale. Netflix has become the go-to source of so much entertainment for consumers in less than 20 years. It can even be said that Ne
4 min read
Enterprise Soaring Success
Linux Format
Article
Enterprise Soaring Success
Aug 27, 2019
7 min read
AWS Vs Azure What’s The Difference?
PC Pro Magazine
Article
AWS Vs Azure What’s The Difference?
Sep 11, 2022
7 min read
AWS vs Azure
Linux Format
Article
AWS vs Azure
Aug 22, 2023
9 min read
How Amazon’s Cloud Took The World By Storm
Fortune
Article
How Amazon’s Cloud Took The World By Storm
Dec 5, 2022
11 min read
Cloud Giants Are Making It Rain for AI
Fortune
Article
Cloud Giants Are Making It Rain for AI
Dec 4, 2023
6 min read
DataStax The Real-time Data Company, Unveiled “Change Data Capture” (CDC) for Astra DB
Techfastly
Article
DataStax The Real-time Data Company, Unveiled “Change Data Capture” (CDC) for Astra DB
May 1, 2022
3 min read
Windows 365 launches Microsoft’s Cloud PC era
Tech Advisor
Article
Windows 365 launches Microsoft’s Cloud PC era
Aug 11, 2021
4 min read
Windows 365 launches Microsoft’s Cloud PC era
PCWorld
Article
Windows 365 launches Microsoft’s Cloud PC era
Aug 2, 2021
3 min read
Opinion
Linux Format
Article
Opinion
Aug 20, 2024
Italo Vignoli is one of the founders of LibreOffice and the Document Foundation. “Think about the personal and confidential information in your office suite documents; it’s essential your office suite respects user privacy. LibreOffice does not ask y
3 min read
On Cloud Nine
Business Today
Article
On Cloud Nine
Jul 8, 2022
8 min read
Business In The Cloud
Business Today
Article
Business In The Cloud
Oct 1, 2020
7 min read
Amazon Is Making It Easier for Companies to Track You
The Atlantic
Article
Amazon Is Making It Easier for Companies to Track You
Apr 14, 2017
3 min read
Microsoft cans Recall
Maximum PC
Article
Microsoft cans Recall
Jul 16, 2024
1 min read
Cloud Computing For All
Fast Company
Article
Cloud Computing For All
May 2, 2023
2 min read
5 Tools That Integrate Your Cloud Storage Into Windows File Explorer
PCWorld
Article
5 Tools That Integrate Your Cloud Storage Into Windows File Explorer
Apr 30, 2024
6 min read
It’s Great When You’re K8s
Linux Format
Article
It’s Great When You’re K8s
Oct 18, 2022
8 min read
IT For A New World
Business Today
Article
IT For A New World
Jun 10, 2021
6 min read
Microsoft Acquires Pittsburgh Cloud Storage Company Avere
TechLife News
Article
Microsoft Acquires Pittsburgh Cloud Storage Company Avere
Jan 6, 2018
1 min read
Edge and Cloud Computing Can They Coexist Peacefully?
Techfastly
Article
Edge and Cloud Computing Can They Coexist Peacefully?
Jun 1, 2022
6 min read
‘Blueprints’ Help Small Business Take Advantage Of The Cloud
Futurity
Article
‘Blueprints’ Help Small Business Take Advantage Of The Cloud
Sep 6, 2019
2 min read
Exploring New Worlds
Business Today
Article
Exploring New Worlds
Jun 24, 2019
3 min read
The Idea Factories
3D World
Article
The Idea Factories
Jul 15, 2020
6 min read
The Big Cloud Question: How Can You Protect Your Assets On Someone Else’s Servers?
PC Pro Magazine
Article
The Big Cloud Question: How Can You Protect Your Assets On Someone Else’s Servers?
Aug 10, 2023
I write this as an old fool who remembers sitting in endless meetings and presentations back when the whole concept of the cloud was starting up. So I can tell you that, right from the beginning, while vendors were pitching the all-encompassing busin
6 min read
Simplifying Network Deployment
Residential Tech Today
Article
Simplifying Network Deployment
Sep 29, 2023
Savant recently introduced the Savant Smart Network, dubbing it “the only learning wireless network solution leveraging Juniper Mist Wi-Fi technology designed for smart home integrators.” It might seem too soon to assess the success of partnership be
2 min read

Related categories

Skip carousel

Reviews for Scalable Data Streaming with Amazon Kinesis

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Scalable Data Streaming with Amazon Kinesis - Tarik Makota

Cover.png

BIRMINGHAM—MUMBAI

Scalable Data Streaming with Amazon Kinesis

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author(s), nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

Group Product Manager: Kunal Parikh

Publishing Product Manager: Devika Battike

Senior Editor: Mohammed Yusuf Imaratwale

Content Development Editors: Sean Lobo and Tazeen Shaikh

Technical Editor: Devanshi Deepak Ayare

Copy Editor: Safis Editing

Project Coordinator: Aparna Ravikumar Nair

Proofreader: Safis Editing

Indexer: Tejal Daruwale Soni

Production Designer: Shankar Kalbhor

First published: March 2021

Production reference: 1300321

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham

B3 2PB, UK.

ISBN 978-1-80056-540-1

www.packt.com

Contributors

About the authors

Tarik Makota hails from a small town in Bosnia. He is a principal solutions architect with AWS, a builder, a writer, and the self-proclaimed best fly fisherman at AWS. Never a perfect student, he managed to earn an MSc in software development and management from RIT. When he is not doing the cloud or writing, Tarik spends most of his time fly fishing to pursue slippery trout. He feeds his addiction by spending summers in Montana. Tarik lives in New Jersey with his family, Mersiha, Hana, and two exceptionally perfect dogs.

Brian Maguire is a solutions architect at AWS, where he is focused on helping customers build solutions in the cloud. He is a technologist, writer, teacher, and student who loves learning. Brian lives in New Hope, Pennsylvania, with his family, Lorna, Ciara, Chris, and several cats.

Danny Gagne is a solutions architect at AWS. He has extensive experience in the design and implementation of large-scale, high-performance analysis systems. He lives in New York City.

Rajeev Chakrabarti is a principal developer advocate with the Amazon Kinesis and the Amazon MSK team. He has worked for many years in the big data and data streaming space. Before joining the Amazon Kinesis team, he was a streaming specialist solutions architect helping customers build streaming pipelines. He lives in New Jersey with his family, Shaifalee and Anushka.

About the reviewers

Ritesh Gupta works as a software development manager with AWS, leading the control plane and data plane teams on the Kinesis Data Streams service. He has over 20 years of experience in leading and delivering geographically distributed web-scale applications and highly available distributed systems supporting millions of transactions per second; he has 10 years of experience in managing engineers and managers. Prior to Amazon, he worked at Microsoft, EA Games, Dell, and a few successful start-ups. His technical expertise cuts across building web-scale applications, enterprise software, and big data. I thank my wife, Jyothi, and daughter, Udita, for putting up with the late-night learning sessions that have allowed me to be where I am.

Randy Ridgley is an experienced technology generalist working with organizations in the media and entertainment, casino gaming, and public sector fields that are looking to adopt cloud technologies. He started his journey into software development at a young age, building BASIC programs on the Commodore 64. In his professional career, he started by building Windows applications, eventually graduating to Linux with multiple programming languages. Currently, you can find Randy spending most of his time building end-to-end real-time streaming solutions on AWS using serverless technologies and IoT.

Table of Contents

Preface

Section 1: Introduction to Data Streaming and Amazon Kinesis

Chapter 1: What Are Data Streams?

Introducing data streams

Sources of data

The value of real-time data in analytics

Decoupling systems

Challenges associated with distributed systems

Transactions per second

Scaling

Latency

Fault tolerance/high availability

Overview of messaging concepts

Overview of core messaging components

Messaging concepts

Examples of data streaming

Application log processing

Internet of Things

Real-time recommendations

Video streams

Summary

Further reading

Chapter 2: Messaging and Data Streaming in AWS

Amazon Kinesis Data Streams (KDS)

Encryption, authentication, and authorization

Producing and consuming records

Data delivery guarantees

Integration with other AWS services

Monitoring

Amazon Kinesis Data Firehose (KDF)

Encryption, authentication, and authorization

Monitoring

Producers

Delivery destinations

Transformations

Amazon Kinesis Data Analytics (KDA)

Amazon KDA for SQL

Amazon Kinesis Data Analytics for Apache Flink (KDA Flink)

Amazon Kinesis Video Streams (KVS)

Amazon Simple Queue Service (SQS)

Amazon Simple Notification Service (SNS)

Amazon SNS integrations with other AWS services

Encryption at rest

Amazon MQ for Apache ActiveMQ

IoT Core

Device software

Control services

Analytics services

Amazon Managed Streaming for Apache Kafka (MSK)

Apache Kafka

Amazon MSK

Amazon EventBridge

Service comparison summary

Summary

Chapter 3: The SmartCity Bike-Sharing Service

The mission for sustainable transportation

SmartCity new mobile features

SmartCity data pipeline

SmartCity data lake

SmartCity operations and analytics dashboard

SmartCity video

The AWS Well-Architected Framework

Summary

Further reading

Section 2: Deep Dive into Kinesis

Chapter 4: Kinesis Data Streams

Technical requirements

Discovering Amazon Kinesis Data Streams

Creating streams and shards

Creating a stream producer application

Creating a stream consumer application

Data pipelines with Amazon Kinesis Data Streams

Data pipeline design (simple)

Data pipeline design (intermediate)

Data pipeline design (full design)

Designing for scalable and reliable analytics pipelines

Monitoring and scaling with Amazon Kinesis Data Streams

X-Ray tracing with Amazon Kinesis Data Streams

Scaling up with Amazon Kinesis Data Streams

Securing Amazon Kinesis Data Streams

Implementing least-privilege access

Summary

Further reading

Chapter 5: Kinesis Firehose

Technical requirements

Setting up the AWS account

Using a local development environment

Using an AWS Cloud9 development environment

Code examples

Discovering Amazon Kinesis Firehose

Understanding KDF delivery streams

Understanding encryption in KDF

Using data transformation in KDF with a Lambda function

Understanding delivery stream destinations

Amazon S3

Amazon Redshift

Amazon Elasticsearch Service

Splunk destination

HTTP endpoint destination

Understanding data format conversion in KDF

Deserialization

Schema

Serializer

Data format conversion errors

Understanding monitoring in KDF

Use-case example – Bikeshare station data pipeline with KDF

Steps to recreate the example

Summary

Further reading

Chapter 6: Kinesis Data Analytics

Technical requirements

AWS account setup

AWS CDK

Java and Java IDE

Code examples

Discovering Amazon KDA

Working on SmartCity bike share analytics use cases

Creating operational insights using SQL Engine

Core concepts and capabilities

Creating operational insights using Apache Flink

Options for running Flink applications in AWS Cloud

Flink applications on KDA

Building bike ride analytic applications

Setting up a producer application

Building a KDA SQL application

Building a KDA Flink application

Monitoring KDA applications

Summary

Further reading

Blogs

Workshops

Chapter 7: Amazon Kinesis Video Streams

Technical requirements

AWS account setup

Using a local development environment

Code examples

Understanding video fundamentals

Containers

Codecs

Discovering Amazon Kinesis video streams WebRTC

Core concepts and connection patterns

Creating a signaling channel

Establishing a connection

Discovering Amazon KVS

Key components of KVS

Stream

Kinesis producer

Consuming

Creating a stream

Producing

Integration with Rekognition

Building video-enabled applications with KVS

Summary

Further reading

Section 3: Integrations

Chapter 8: Kinesis Integrations

Technical requirements

AWS account setup

AWS CLI

Kinesis Data Generator

Code examples

Amazon services that can produce data to send to Kinesis

Amazon Connect

Amazon Aurora database activity

DynamoDB activity

Processing Kinesis data with Apache Spark

Amazon services that consume data from Kinesis

Serverless data lake

Amazon services that transform Kinesis data

Routing events with EventBridge

Third-party integrations with Kinesis

Splunk

Summary

Further reading

Why subscribe?

Other Books You May Enjoy

Preface

Amazon Kinesis is a collection of secure, serverless, durable, and highly available purpose-built data streaming services. These data streaming services provide APIs and client SDKs to enable you to produce and consume data at scale.

Scalable Data Streaming with Amazon Kinesis begins with a quick overview of the core concepts of data streams along with the essentials of the AWS Kinesis landscape. You'll then explore the requirements of the use cases shown throughout the book to help you get started, and cover the key pain points encountered in the data stream life cycle. As you advance, you'll get to grips with the architectural components of Kinesis, understand how they are configured to build data pipelines, and delve into the applications that connect to them for consumption and processing. You'll also build a Kinesis data pipeline from scratch and learn how to implement and apply practical solutions. Moving on, you'll learn how to configure Kinesis on a cloud platform. Finally, you'll learn how other AWS services can be integrated into Kinesis. These services include Redshift, Dynamo Database, AWS S3, Elasticsearch, and third-party applications such as Splunk.

By the end of this AWS book, you'll be able to build and deploy your own Kinesis data pipelines with Kinesis Data Streams (KDS), Kinesis Firehose (KFH), Kinesis Video Streams (KVS), and Kinesis Data Analytics (KDA).

Who this book is for

This book is for solutions architects, developers, system administrators, data engineers, and data scientists looking to evaluate and choose the most performant, secure, scalable, and cost-effective data streaming technology to overcome their data ingestion and processing challenges on AWS. Prior knowledge of cloud architectures on AWS, data streaming technologies, and architectures is expected.

What this book covers

Chapter 1, What Are Data Streams?, covers core streaming concepts so that you will have a detailed understanding of their application in distributed systems.

Chapter 2, Messaging and Data Streaming in AWS, takes a brief look at the ecosystem of AWS services in the messaging space. After reading this chapter, you will have a good understanding of the various services, be able to differentiate them, and understand the strengths of each service.

Chapter 3, The SmartCity Bike-Sharing Service, reviews the existing bike-sharing application and how the city plans to modernize it. This chapter will provide the background information for the examples used throughout the book.

Chapter 4, Kinesis Data Streams, teaches concepts and capabilities, common deployment patterns, monitoring and scaling, and how to secure KDS. We will step through a data streaming solution that will ingest, process, and feed data from multiple SmartCity data systems.

Chapter 5, Kinesis Firehose, teaches the concepts, common deployment patterns, monitoring and scaling, and security in KFH.

Chapter 6, Kinesis Data Analytics, covers the concepts and capabilities, approaches for common deployment patterns, monitoring and scaling, and security in KDA. You will learn how real-time streaming data can be queried like a database with SQL or code.

Chapter 7, Amazon Kinesis Video Streams, explores the concepts, monitoring and scaling, security, and deployment patterns for real-time communication and data ingestion. We will step through a solution that will provide real-time access to a video stream and ingest video data for the SmartCity data system.

Chapter 8, Kinesis Integrations, reviews how to integrate Kinesis with several Amazon services, such as Amazon Redshift, Amazon DynamoDB, AWS Glue, Amazon Aurora, Amazon Athena, and other third-party services such as Splunk. We will integrate a wide variety of services to create a serverless data lake.

To get the most out of this book

All of the examples in the chapters in this book are run using an AWS account to access services such as Amazon Kinesis, DynamoDB, and Amazon S3. Readers will need a Windows, Mac, or Linux computer with an internet connection. Many of the examples in the book use a command-line terminal such as PuTTY, macOS Terminal, GNOME Terminal, or iTerm2 to run commands and change configuration. The examples written in Python are written for the Python 3 interpreter and may not work with Python 2. For the examples written for the Java platform, readers are encouraged to use Java version 11 and AWS Java SDK version 1.11. We make extensive use of the AWS CLI v2 and will also use Docker for some examples. In addition to software, a webcam or IP camera and Android device will be needed to fully execute some of the examples.

If you are using the digital version of this book, we advise you to type the code yourself or access the code via the GitHub repository (link available in the next section). Doing so will help you avoid any potential errors related to the copying and pasting of code.

Download the example code files

You can download the example code files for this book from GitHub at https://github.com/PacktPublishing/Streaming-Data-Solutions-with-Amazon-Kinesis. In case there's an update to the code, it will be updated on the existing GitHub repository.

We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!

Download the color images

We also provide a PDF file that has color images of the screenshots/diagrams used in this book. You can download it here: https://static.packt-cdn.com/downloads/9781800565401_ColorImages.pdf.

Conventions used

There are a number of text conventions used throughout this book.

Code in text: Indicates code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles. Here is an example: In this command, we'll send the test2.mkv file we downloaded to the KVS stream.

A block of code is set as follows:

aws glue create-database --database-input {\"Name\":\"smartcitybikes\"}

aws glue create-table --database-name smartcitybikes --table-input file://SmartCityGlueTable.json

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

mediaSource.start();

Any command-line input or output is written as follows:

aws rekognition start-stream-processor --name kvsprocessor

Bold: Indicates a new term, an important word, or words that you see onscreen. For example, words in menus or dialog boxes appear in the text like this. Here is an example: Once you have entered the appropriate information, all that's left is to click Create signaling channel.

Tips or important notes

Appear like this.

Get in touch

Feedback from our readers is always welcome.

General feedback: If you have questions about any aspect of this book, mention the book title in the subject of your message and email us at [email protected].

Errata: Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you have found a mistake in this book, we would be grateful if you would report this to us. Please visit www.packtpub.com/support/errata, selecting your book, clicking on the Errata Submission Form link, and entering the details.

Piracy: If you come across any illegal copies of our works in any form on the Internet, we would be grateful if you would provide us with the location address or website name. Please contact us at [email protected] with a link to the material.

If you are interested in becoming an author: If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, please visit authors.packtpub.com.

Reviews

Please leave a review. Once you have read and used this book, why not leave a review on the site that you purchased it from? Potential readers can then see and use your unbiased opinion to make purchase decisions, we at Packt can understand what you think about our products, and our authors can see your feedback on their book. Thank you!

For more information about Packt, please visit packt.com.

Section 1: Introduction to Data Streaming and Amazon Kinesis

In this section, you will be introduced to the concept of data streams and how they are used to create scalable data solutions.

This section comprises the following chapters:

Chapter 1, What Are Data Streams?

Chapter 2, Messaging and Data Streaming in AWS

Chapter 3, The SmartCity Bike-Sharing Service

Chapter 1: What Are Data Streams?

A data stream is a system where data continuously flows from multiple sources, just like water flows through a stream. The data is often produced and collected simultaneously in a continuous flow of many small files or records. Data streams are utilized by a wide range of business, medical, government, social media, and mobile applications. These applications include financial applications for the stock market and e-commerce ordering systems that collect orders and cover fulfillment of delivery.

In the entertainment space, live data is produced by sensing devices embedded in player equipment, video game players generate large amounts of data at a massive scale, and there are new social media posts thousands of times per second. Governments also leverage streaming data and geospatial services to monitor land, wildlife, and other activities.

Data volume and velocity are increasing at faster rates, creating new challenges in data processing and analytics. This book will detail these challenges and demonstrate how Amazon Kinesis can be used to address them. We will begin by discussing key concepts related to messaging in a technology-agnostic form to provide a solid foundation for building your Kinesis knowledge.

Incorporating data streams into your application architecture will allow you to deliver high-performance solutions that are secure, scalable, and fast. In this chapter, we will cover core streaming concepts so that you will have a detailed understanding of their application to distributed systems. You will learn what a data stream is, how to leverage data streams to scale, and examine a number of high-level use cases.

This chapter covers the following topics:

Introducing data streams

Challenges associated with distributed systems

Overview of messaging concepts

Examples of data streaming

Introducing data streams

Data streams are a way of storing a sequence of messages. They enable us to design systems where we think about state as a series of events instead of only entities and values, or rows and columns in a database. This shift in mindset and technology enables real-time analytics to extract the value from data by acting on it before it is stale. They also enable organizations to design and develop resilient software based on microservice architectures by helping them to decouple systems. We will begin with an overview of streaming data sources, why real-time data analysis is valuable, and how they can be used architecturally to decouple systems. We will then review the core challenges associated with distributed systems, and conclude with an overview of key messaging concepts and some high-level examples. Messages can contain a wide variety of information and come from different sources, so let's look at the primary sources and data formats.

Sources of data

The proliferation of data steadily increases from sources such as social media, IoT devices, web clickstreams, application logs, and video cameras. This data poses challenges to most systems, since it is typically high-velocity, intermittent, and bursty, making it difficult to adequately provision and design downstream systems. Payloads are generally small, except when containing audio or video data, and come in a variety of formats.

In this book, we will be focusing on three data formats. These formats include the following:

JavaScript Object Notation (JSON)

Log files

Time-encoded binary files such as video

JSON streams

JSON has become the dominant format for message serialization over the past 10 years. It is a lightweight data interchange format that is easy for humans to read and write and is based on the JavaScript object syntax. It has two data structures – hash tables and lists. A hash table consists of key-value pairs, {key:value}, where the keys must be unique. A list is a set of values in a specific order, [value 1, value 2]. The following code sample shows a sample IoT JSON message:

{

deviceid : device001,

eventTime: -192778200,

temp : 68.4,

humidity : 77.3,

coords : {

latitude : 32.779039,

longitude : -96.808660

}

Log file streams

Log files come in a variety of formats. Common ones include Apache Commons Logging, Apache Combined Log, Apache Error Log, and RFC3164 Syslog. They are plain text, and usually each line, delineated by a newline ('\n') character, is a separate log entry. In the following sample log, we see an HTTP GET request where the IP address is 10.13.37.01, the datetime of the request, the HTTP verb, the URL fragment, the HTTP version, the response code, and the size of the result.

The sample log line in Apache Commons Logging format is as follows:

10.13.37.01 - - [03/Sep/2017:12:00:01 +0830] GET /mailman/listinfo/test HTTP/1.1 200 2457

Time-encoded binary streams

Time-encoded binary streams consist of a time series of records where each record is related to the adjacent records (prior and subsequent records). These can be used for a wide variety of sensor data, from audio streams and RADAR signals to video streams. Throughout this book, the primary focus will be video streams and their applications.

Figure 1.1 – Time-encoded video data

Figure 1.1 – Time-encoded video data

As shown in Figure 1.1, video streams are composed of fragments, where each fragment is a self-contained sequence of media frames. There are no dependencies between fragments. We will discuss video streams in more detail in Chapter 7, Kinesis Video Streams. Now that we've covered the types of data that we'll be processing, let's take a step back to understand the value of real-time data in analytics.

The value of real-time data in analytics

Analysis is done to support decision making by individuals, organizations, or computer programs. Traditionally, data analysis has been done on batches of data, usually in long-running jobs that occur overnight and that happen periodically at predetermined times: nightly, weekly, quarterly, and so on. This not only limits the scope of actions available to decisions makers, but it is also only providing them with a representation of the past environment. Information is now available seconds after it is produced, so we need to design systems that provide decision makers with the freshest data available to make timely decisions.

The OODA – Observe, Orient, Decide, Act – loop is a decision-making, conceptual framework that describes how decisions are made when reacting to an event. By breaking it down into these four components,

Enjoying the preview?

Page 1 of 1

Scalable Data Streaming with Amazon Kinesis: Design and secure highly available, cost-effective data streaming applications with Amazon Kinesis

About this ebook

Tarik Makota

Related authors

Related to Scalable Data Streaming with Amazon Kinesis

Related ebooks

Practical AWS Networking: Build and manage complex networks using services such as Amazon VPC, Elastic Load Balancing, Direct Connect, and Amazon Route 53

Serverless Architectures with AWS: Discover how you can migrate from traditional deployments to serverless architectures with AWS

Actionable Insights with Amazon QuickSight: Develop stunning data visualizations and machine learning-driven insights with Amazon QuickSight

VMware Cross-Cloud Architecture: Automate and orchestrate your Software-Defined Data Center on AWS

Accelerating DevSecOps on AWS: Create secure CI/CD pipelines using Chaos and AIOps

Learning AWS

Data Engineering with AWS: Learn how to design and build cloud-based data transformation pipelines using AWS

Generative AI-Powered Assistant for Developers: Accelerate software development with Amazon Q Developer

AWS Administration - The Definitive Guide: Design, build, and manage your infrastructure on Amazon Web Services, 2nd Edition

Intelligent Workloads at the Edge: Deliver cyber-physical outcomes with data and machine learning using AWS IoT Greengrass

Building Serverless Web Applications

Architecting Cloud-Native Serverless Solutions: Design, build, and operate serverless solutions on cloud and open source platforms

Learn Microsoft Azure: Step by Step in 7 day for .NET Developers

Simplify Big Data Analytics with Amazon EMR: A beginner's guide to learning and implementing Amazon EMR for building data analytics solutions

Hands-On Azure for Developers: Implement rich Azure PaaS ecosystems using containers, serverless services, and storage solutions

Modern Data Architecture on AWS: A Practical Guide for Building Next-Gen Data Platforms on AWS

Expert AWS Development: Efficiently develop, deploy, and manage your enterprise apps on the Amazon Web Services platform

AI as a Service: Serverless machine learning with AWS

AWS Cloud Projects: Strengthen your AWS skills through practical projects, from websites to advanced AI applications

Building Serverless Microservices in Python: A complete guide to building, testing, and deploying microservices using serverless computing on AWS

Hands-On Artificial Intelligence on Amazon Web Services: Decrease the time to market for AI and ML applications with the power of AWS

Learn AWS Serverless Computing: A beginner's guide to using AWS Lambda, Amazon API Gateway, and services from Amazon Web Services

Learning AWS: Design, build, and deploy responsive applications using AWS Cloud components, 2nd Edition

Amazon Redshift Cookbook: Recipes for building modern data warehousing solutions

Optimizing Your Modernization Journey with AWS: Best practices for transforming your applications and infrastructure on the cloud

Hybrid Cloud for Architects: Build robust hybrid cloud solutions using AWS and OpenStack

Applied Machine Learning and High-Performance Computing on AWS: Accelerate the development of machine learning applications following architectural best practices

Amazon EC2 Cookbook

Mastering AWS Security: Create and maintain a secure cloud ecosystem

Azure for Developers.: Implement rich Azure PaaS ecosystems using containers, serverless services, and storage solutions

Computers For You

Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics

Elon Musk

CompTIA Security+ Get Certified Get Ahead: SY0-701 Study Guide

The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology

ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind

Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad

Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race

The Invisible Rainbow: A History of Electricity and Life

The Innovators: How a Group of Hackers, Geniuses, and Geeks Created the Digital Revolution

Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!

The Simulation Hypothesis: An MIT Computer Scientist Shows Why AI, Quantum Physics and Eastern Mystics All Agree We Are In a Video Game

101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters

Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition

Slenderman: Online Obsession, Mental Illness, and the Violent Crime of Two Midwestern Girls

Mastering ChatGPT: 21 Prompts Templates for Effortless Writing

Uncanny Valley: A Memoir

CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61

Deep Search: How to Explore the Internet More Effectively

Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees

SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL

CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations

Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work

The Professional Voiceover Handbook: Voiceover training, #1

How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally

Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)

Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention

Dark Aeon: Transhumanism and the War Against Humanity

How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)

Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are

The Hacker Crackdown: Law and Disorder on the Electronic Frontier

Related podcast episodes

Related articles

Related categories

Reviews for Scalable Data Streaming with Amazon Kinesis

What did you think?

Book preview

Scalable Data Streaming with Amazon Kinesis - Tarik Makota

101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters