Real-time Analytics with Storm and Cassandra

Ebook424 pages2 hours

Real-time Analytics with Storm and Cassandra

Name: Real-time Analytics with Storm and Cassandra
Author: Shilpi Saxena
ISBN: 9781784390006

By Shilpi Saxena

Rating: 0 out of 5 stars

()

Read preview

About this ebook

About This Book

Create your own data processing topology and implement it in various real-time scenarios using Storm and Cassandra
Build highly available and linearly scalable applications using Storm and Cassandra that will process voluminous data at lightning speed
A pragmatic and example-oriented guide to implement various applications built with Storm and Cassandra

Who This Book Is For

If you want to efficiently use Storm and Cassandra together and excel at developing production-grade, distributed real-time applications, then this book is for you. No prior knowledge of using Storm and Cassandra together is necessary. However, a background in Java is expected.

Skip carousel

LanguageEnglish

PublisherPackt Publishing

Release dateMar 27, 2015

ISBN9781784390006

Author

Shilpi Saxena

Related authors

Skip carousel

Related to Real-time Analytics with Storm and Cassandra

Related ebooks

Skip carousel

Introduction to Machine Learning in the Cloud with Python: Concepts and Practices
Ebook
Introduction to Machine Learning in the Cloud with Python: Concepts and Practices
byPramod Gupta
Rating: 0 out of 5 stars
0 ratings
Practical Convolutional Neural Networks: Implement advanced deep learning models using Python
Ebook
Practical Convolutional Neural Networks: Implement advanced deep learning models using Python
byMohit Sewak
Rating: 0 out of 5 stars
0 ratings
Data Engineering with Apache Spark, Delta Lake, and Lakehouse: Create scalable pipelines that ingest, curate, and aggregate complex data in a timely and secure way
Ebook
Data Engineering with Apache Spark, Delta Lake, and Lakehouse: Create scalable pipelines that ingest, curate, and aggregate complex data in a timely and secure way
byManoj Kukreja
Rating: 0 out of 5 stars
0 ratings
Learn Microservices with Spring Boot: A Practical Approach to RESTful Services Using an Event-Driven Architecture, Cloud-Native Patterns, and Containerization
Ebook
Learn Microservices with Spring Boot: A Practical Approach to RESTful Services Using an Event-Driven Architecture, Cloud-Native Patterns, and Containerization
byMoisés Macero García
Rating: 0 out of 5 stars
0 ratings
Tika in Action
Ebook
Tika in Action
byJukka L. Zitting
Rating: 0 out of 5 stars
0 ratings
Modern Computer Vision with PyTorch: A practical roadmap from deep learning fundamentals to advanced applications and Generative AI
Ebook
Modern Computer Vision with PyTorch: A practical roadmap from deep learning fundamentals to advanced applications and Generative AI
byV Kishore Ayyadevara
Rating: 0 out of 5 stars
0 ratings
Neural Networks with Python
Ebook
Neural Networks with Python
byMei Wong
Rating: 0 out of 5 stars
0 ratings
Ensemble Methods for Machine Learning
Ebook
Ensemble Methods for Machine Learning
byGautam Kunapuli
Rating: 0 out of 5 stars
0 ratings
Cloud-Native Observability with OpenTelemetry: Learn to gain visibility into systems by combining tracing, metrics, and logging with OpenTelemetry
Ebook
Cloud-Native Observability with OpenTelemetry: Learn to gain visibility into systems by combining tracing, metrics, and logging with OpenTelemetry
byAlex Boten
Rating: 0 out of 5 stars
0 ratings
Python AI Programming: Navigating fundamentals of ML, deep learning, NLP, and reinforcement learning in practice
Ebook
Python AI Programming: Navigating fundamentals of ML, deep learning, NLP, and reinforcement learning in practice
byPatrick J
Rating: 0 out of 5 stars
0 ratings
Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners
Ebook
Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners
byEkaba Bisong
Rating: 0 out of 5 stars
0 ratings
Pro Spring Boot 2: An Authoritative Guide to Building Microservices, Web and Enterprise Applications, and Best Practices
Ebook
Pro Spring Boot 2: An Authoritative Guide to Building Microservices, Web and Enterprise Applications, and Best Practices
byFelipe Gutierrez
Rating: 0 out of 5 stars
0 ratings
Random Graphs
Ebook
Random Graphs
bySvante Janson
Rating: 5 out of 5 stars
5/5
NumPy: An action packed guide using real world examples of the easy to use, high performance, free open source NumPy mathematical library.
Ebook
NumPy: An action packed guide using real world examples of the easy to use, high performance, free open source NumPy mathematical library.
byIvan Idris
Rating: 5 out of 5 stars
5/5
Mastering Kubernetes: Dive into Kubernetes and learn how to create and operate world-class cloud-native systems
Ebook
Mastering Kubernetes: Dive into Kubernetes and learn how to create and operate world-class cloud-native systems
byGigi Sayfan
Rating: 0 out of 5 stars
0 ratings
Fun Q: A Functional Introduction to Machine Learning in Q
Ebook
Fun Q: A Functional Introduction to Machine Learning in Q
byNick Psaris
Rating: 0 out of 5 stars
0 ratings
Python Object-Oriented Programming: Build robust and maintainable object-oriented Python applications and libraries
Ebook
Python Object-Oriented Programming: Build robust and maintainable object-oriented Python applications and libraries
bySteven F. Lott
Rating: 0 out of 5 stars
0 ratings
Spark GraphX in Action
Ebook
Spark GraphX in Action
byMichael Malak
Rating: 0 out of 5 stars
0 ratings
Elasticsearch 8.x Cookbook: Over 180 recipes to perform fast, scalable, and reliable searches for your enterprise
Ebook
Elasticsearch 8.x Cookbook: Over 180 recipes to perform fast, scalable, and reliable searches for your enterprise
byAlberto Paro
Rating: 0 out of 5 stars
0 ratings
Deep Learning for Time Series Cookbook: Use PyTorch and Python recipes for forecasting, classification, and anomaly detection
Ebook
Deep Learning for Time Series Cookbook: Use PyTorch and Python recipes for forecasting, classification, and anomaly detection
byVitor Cerqueira
Rating: 0 out of 5 stars
0 ratings
Flex on Java
Ebook
Flex on Java
byBernerd Allmon
Rating: 0 out of 5 stars
0 ratings
Apache Spark Graph Processing
Ebook
Apache Spark Graph Processing
byRamamonjison Rindra
Rating: 0 out of 5 stars
0 ratings
Modern Data Mining Algorithms in C++ and CUDA C: Recent Developments in Feature Extraction and Selection Algorithms for Data Science
Ebook
Modern Data Mining Algorithms in C++ and CUDA C: Recent Developments in Feature Extraction and Selection Algorithms for Data Science
byTimothy Masters
Rating: 0 out of 5 stars
0 ratings
Hadoop MapReduce v2 Cookbook - Second Edition
Ebook
Hadoop MapReduce v2 Cookbook - Second Edition
byThilina Gunarathne
Rating: 0 out of 5 stars
0 ratings
Start Concurrent: An Introduction to Problem Solving in Java with a Focus on Concurrency, 2014
Ebook
Start Concurrent: An Introduction to Problem Solving in Java with a Focus on Concurrency, 2014
byBarry Wittman
Rating: 0 out of 5 stars
0 ratings
Deep Belief Nets in C++ and CUDA C: Volume 2: Autoencoding in the Complex Domain
Ebook
Deep Belief Nets in C++ and CUDA C: Volume 2: Autoencoding in the Complex Domain
byTimothy Masters
Rating: 0 out of 5 stars
0 ratings
TensorFlow A Complete Guide - 2019 Edition
Ebook
TensorFlow A Complete Guide - 2019 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Hands-on Time Series Analysis with Python: From Basics to Bleeding Edge Techniques
Ebook
Hands-on Time Series Analysis with Python: From Basics to Bleeding Edge Techniques
byB V Vishwas
Rating: 5 out of 5 stars
5/5
Implementing Enterprise Observability for Success: Strategically plan and implement observability using real-life examples
Ebook
Implementing Enterprise Observability for Success: Strategically plan and implement observability using real-life examples
byManisha Agrawal
Rating: 0 out of 5 stars
0 ratings
Mastering Time Series Analysis and Forecasting with Python
Ebook
Mastering Time Series Analysis and Forecasting with Python
bySulekha Aloorravi
Rating: 0 out of 5 stars
0 ratings

Computers For You

Skip carousel

Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Ebook
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
byMargot Lee Shetterly
Rating: 4 out of 5 stars
4/5
The Invisible Rainbow: A History of Electricity and Life
Ebook
The Invisible Rainbow: A History of Electricity and Life
byArthur Firstenberg
Rating: 5 out of 5 stars
5/5
Slenderman: Online Obsession, Mental Illness, and the Violent Crime of Two Midwestern Girls
Ebook
Slenderman: Online Obsession, Mental Illness, and the Violent Crime of Two Midwestern Girls
byKathleen Hale
Rating: 4 out of 5 stars
4/5
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
Ebook
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
byGary Smith
Rating: 4 out of 5 stars
4/5
The Innovators: How a Group of Hackers, Geniuses, and Geeks Created the Digital Revolution
Ebook
The Innovators: How a Group of Hackers, Geniuses, and Geeks Created the Digital Revolution
byWalter Isaacson
Rating: 4 out of 5 stars
4/5
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
Ebook
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
byT.C. Boyle
Rating: 5 out of 5 stars
5/5
Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition
Ebook
Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition
byAndrew Hodges
Rating: 4 out of 5 stars
4/5
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
Ebook
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
byTJ Books
Rating: 4 out of 5 stars
4/5
Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!
Ebook
Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!
byJohannes Wild
Rating: 0 out of 5 stars
0 ratings
Learning the Chess Openings
Ebook
Learning the Chess Openings
byJef Kaan
Rating: 5 out of 5 stars
5/5
Elon Musk
Ebook
Elon Musk
byWalter Isaacson
Rating: 4 out of 5 stars
4/5
An Ultimate Guide to Kali Linux for Beginners
Ebook
An Ultimate Guide to Kali Linux for Beginners
byAnsh Goyal
Rating: 3 out of 5 stars
3/5
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 4 out of 5 stars
4/5
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Ebook
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
Deep Search: How to Explore the Internet More Effectively
Ebook
Deep Search: How to Explore the Internet More Effectively
byAlan Pearce
Rating: 5 out of 5 stars
5/5
Uncanny Valley: A Memoir
Ebook
Uncanny Valley: A Memoir
byAnna Wiener
Rating: 4 out of 5 stars
4/5
CompTIA Security+ Get Certified Get Ahead: SY0-701 Study Guide
Ebook
CompTIA Security+ Get Certified Get Ahead: SY0-701 Study Guide
byJoe Shelley
Rating: 5 out of 5 stars
5/5
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
Ebook
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
byAlex Parkinson
Rating: 4 out of 5 stars
4/5
Tor and the Dark Art of Anonymity
Ebook
Tor and the Dark Art of Anonymity
byLance Henderson
Rating: 5 out of 5 stars
5/5
How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)
Ebook
How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)
byDavid Kadavy
Rating: 5 out of 5 stars
5/5
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
Ebook
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
bySteven Cooper
Rating: 4 out of 5 stars
4/5
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
Ebook
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
byTriumph Books
Rating: 4 out of 5 stars
4/5
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
The Hacker Crackdown: Law and Disorder on the Electronic Frontier
Ebook
The Hacker Crackdown: Law and Disorder on the Electronic Frontier
byBruce Sterling
Rating: 4 out of 5 stars
4/5
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
The Professional Voiceover Handbook: Voiceover training, #1
Ebook
The Professional Voiceover Handbook: Voiceover training, #1
byPeter Baker
Rating: 5 out of 5 stars
5/5
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
Ebook
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
bySeth Stephens-Davidowitz
Rating: 4 out of 5 stars
4/5
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
Ebook
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
byHadelin de Ponteves
Rating: 2 out of 5 stars
2/5
The Self-Taught Computer Scientist: The Beginner's Guide to Data Structures & Algorithms
Ebook
The Self-Taught Computer Scientist: The Beginner's Guide to Data Structures & Algorithms
byCory Althoff
Rating: 0 out of 5 stars
0 ratings
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
Ebook
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
byQuentin Docter
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

Build A Data Lake For Your Security Logs With Scanner: Monitoring and auditing IT systems for security events requires the ability to quickly analyze massive volumes of unstructured log data. The majority of products that are available either require too much effort to structure the logs, or aren't fast enough for interactive use cases. Cliff Crosland co-founded Scanner to provide fast querying of high scale log data for security auditing. In this episode he shares the story of how it got started, how it works, and how you can get started with it.
UNLIMITED
Build A Data Lake For Your Security Logs With Scanner: Monitoring and auditing IT systems for security events requires the ability to quickly analyze massive volumes of unstructured log data. The majority of products that are available either require too much effort to structure the logs, or aren't fast enough for interactive use cases. Cliff Crosland co-founded Scanner to provide fast querying of high scale log data for security auditing. In this episode he shares the story of how it got started, how it works, and how you can get started with it.
byData Engineering Podcast
0 ratings
0% found this document useful
Iceberg at Netflix and Beyond with Ryan Blue: Apache Iceberg is an open source high-performance format for huge data tables. Iceberg enables the use of SQL tables for big data, while making it possible for engines like Spark and Hive to safely work with the same tables, at the same time.
UNLIMITED
Iceberg at Netflix and Beyond with Ryan Blue: Apache Iceberg is an open source high-performance format for huge data tables. Iceberg enables the use of SQL tables for big data, while making it possible for engines like Spark and Hive to safely work with the same tables, at the same time.
byData Archives - Software Engineering Daily
0 ratings
0% found this document useful
#14 Hidden Markov Models & Statistical Ecology, with Vianey Leos-Barajas
UNLIMITED
#14 Hidden Markov Models & Statistical Ecology, with Vianey Leos-Barajas
byLearning Bayesian Statistics
0 ratings
0% found this document useful
235: Pair programming with Ben Orenstein & Tuple: In this episode, Kaushik goes solo and interviews Ben Orenstein. Ben is a prolific Ruby developer, an amazing conference speaker, an ardent vim-ster, and now the CEO of Tuple. Kaushik has been a big fan of Ben's work and was super stoked to talk to Ben and pick his brains on a host of topics: starting the company Tuple, pair programming in general, learning different programming languages and technology, giving better conference talks and more! This episode is chock full of wisdom from Ben. Enjoy!
UNLIMITED
235: Pair programming with Ben Orenstein & Tuple: In this episode, Kaushik goes solo and interviews Ben Orenstein. Ben is a prolific Ruby developer, an amazing conference speaker, an ardent vim-ster, and now the CEO of Tuple. Kaushik has been a big fan of Ben's work and was super stoked to talk to Ben and pick his brains on a host of topics: starting the company Tuple, pair programming in general, learning different programming languages and technology, giving better conference talks and more! This episode is chock full of wisdom from Ben. Enjoy!
byFragmented - Android Developer Podcast
0 ratings
0% found this document useful
Robert Chang: Building the Minerva Metrics Store @ Airbnb: Robert Chang is a product manager for the data platform at Airbnb, where he helped build and roll out Minerva, Airbnb's internal metrics store. They use Minerva to track over 12,000(!) metrics and 4,000(!) dimensions with consistency across the...
UNLIMITED
Robert Chang: Building the Minerva Metrics Store @ Airbnb: Robert Chang is a product manager for the data platform at Airbnb, where he helped build and roll out Minerva, Airbnb's internal metrics store. They use Minerva to track over 12,000(!) metrics and 4,000(!) dimensions with consistency across the...
byThe Analytics Engineering Podcast
0 ratings
0% found this document useful
Rust Networking with Carl Lerche: Rust is a systems programming language with a distinct set of features for safety and concurrency. In previous shows about Rust, we explored how Rust can prevent crashes and eliminate data races through its approach to type safety and memory management...
UNLIMITED
Rust Networking with Carl Lerche: Rust is a systems programming language with a distinct set of features for safety and concurrency. In previous shows about Rust, we explored how Rust can prevent crashes and eliminate data races through its approach to type safety and memory management...
byCloud Engineering Archives - Software Engineering Daily
0 ratings
0% found this document useful
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
UNLIMITED
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
byCatalog & Cocktails: The Honest, No-BS Data Podcast
0 ratings
0% found this document useful
Build custom ML tools with Streamlit: featuring Adrien Treuille, Co-Founder and CEO at Streamlit
UNLIMITED
Build custom ML tools with Streamlit: featuring Adrien Treuille, Co-Founder and CEO at Streamlit
byPractical AI: Machine Learning, Data Science, LLM
0 ratings
0% found this document useful
Putting Airflow Into Production With James Meickle - Episode 43: Lessons Learned While Building A Data Science Platform With Airflow (Interview)
UNLIMITED
Putting Airflow Into Production With James Meickle - Episode 43: Lessons Learned While Building A Data Science Platform With Airflow (Interview)
byData Engineering Podcast
0 ratings
0% found this document useful
Jürgen Schmidhuber - Neural and Non-Neural AI, Reasoning, Transformers, and LSTMs
UNLIMITED
Jürgen Schmidhuber - Neural and Non-Neural AI, Reasoning, Transformers, and LSTMs
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
#19 Turing, Julia and Bayes in Economics, with Cameron Pfiffer
UNLIMITED
#19 Turing, Julia and Bayes in Economics, with Cameron Pfiffer
byLearning Bayesian Statistics
0 ratings
0% found this document useful
MLA 015 SageMaker 1: Part 1 of deploying your ML models to the cloud with SageMaker (MLOps) MLOps is deploying your ML models to the cloud. See for an overview of tooling (also generally a great ML educational run-down.) And I forgot to...
UNLIMITED
MLA 015 SageMaker 1: Part 1 of deploying your ML models to the cloud with SageMaker (MLOps) MLOps is deploying your ML models to the cloud. See for an overview of tooling (also generally a great ML educational run-down.) And I forgot to...
byMachine Learning Guide
0 ratings
0% found this document useful
#10 Exploratory Analysis of Bayesian Models, with ArviZ and Ari Hartikainen
UNLIMITED
#10 Exploratory Analysis of Bayesian Models, with ArviZ and Ari Hartikainen
byLearning Bayesian Statistics
0 ratings
0% found this document useful
S1:E1 "The Beginning"
UNLIMITED
S1:E1 "The Beginning"
byData Science Now
0 ratings
0% found this document useful
Renee M. P. Teate, "SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis" (John Wiley & Sons, 2021): An interview with Renee M. P. Teate
UNLIMITED
Renee M. P. Teate, "SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis" (John Wiley & Sons, 2021): An interview with Renee M. P. Teate
byNew Books in Science, Technology, and Society
0 ratings
0% found this document useful
MLA 020 Kubeflow: Conversation with Dirk-Jan Kubeflow (vs cloud native solutions like SageMaker) - Data Scientist at Dept Agency . (From the website:) The Machine Learning Toolkit for Kubernetes. The Kubeflow project is dedicated to making deployments of...
UNLIMITED
MLA 020 Kubeflow: Conversation with Dirk-Jan Kubeflow (vs cloud native solutions like SageMaker) - Data Scientist at Dept Agency . (From the website:) The Machine Learning Toolkit for Kubernetes. The Kubeflow project is dedicated to making deployments of...
byMachine Learning Guide
0 ratings
0% found this document useful
Yves Hilpisch on Quantitative Finance
UNLIMITED
Yves Hilpisch on Quantitative Finance
byThe Python Podcast.__init__
0 ratings
0% found this document useful
MLA 017 AWS Local Development: Show notes: Developing on AWS first (SageMaker or other) Consider developing against AWS as your local development environment, rather than only your cloud deployment environment. Solutions: Stick to AWS Cloud IDEs (, , Connect...
UNLIMITED
MLA 017 AWS Local Development: Show notes: Developing on AWS first (SageMaker or other) Consider developing against AWS as your local development environment, rather than only your cloud deployment environment. Solutions: Stick to AWS Cloud IDEs (, , Connect...
byMachine Learning Guide
0 ratings
0% found this document useful
IPFS, Filecoin and The Vision for a Decentralized Web (Part 1 of 2): Protocol Labs is the organisation behind IPFS and Filecoin. Juan Benet, Founder & CEO, returns to the show to give us an important update on the long-term vision to fund innovative technologies, IPFS since it was created, and Filecoin as a foundation to a new decentralized cloud.
UNLIMITED
IPFS, Filecoin and The Vision for a Decentralized Web (Part 1 of 2): Protocol Labs is the organisation behind IPFS and Filecoin. Juan Benet, Founder & CEO, returns to the show to give us an important update on the long-term vision to fund innovative technologies, IPFS since it was created, and Filecoin as a foundation to a new decentralized cloud.
byEpicenter - Learn about Crypto, Blockchain, Ethereum, Bitcoin and Distributed Technologies
0 ratings
0% found this document useful
Models for Human-Robot Collaboration with Julie Shah - #538
UNLIMITED
Models for Human-Robot Collaboration with Julie Shah - #538
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
LM101-086: Ch8: How to Learn the Probability of Infinitely Many Outcomes: This 86th episode of Learning Machines 101 discusses the problem of assigning probabilities to a possibly infinite set of observed outcomes in a space-time continuum which corresponds to our physical world. The machine learning algorithm uses information
UNLIMITED
LM101-086: Ch8: How to Learn the Probability of Infinitely Many Outcomes: This 86th episode of Learning Machines 101 discusses the problem of assigning probabilities to a possibly infinite set of observed outcomes in a space-time continuum which corresponds to our physical world. The machine learning algorithm uses information
byLearning Machines 101
0 ratings
0% found this document useful
Rust in Production Ep 2 - PubNub's Stephen Blum: PubNub's CTO Stephen Blum discusses how implementing Rust improved memory and performance compared to their C and Python implementation. They highlight Rust's versatility, while emphasizing low latency and the importance of code simplicity.
UNLIMITED
Rust in Production Ep 2 - PubNub's Stephen Blum: PubNub's CTO Stephen Blum discusses how implementing Rust improved memory and performance compared to their C and Python implementation. They highlight Rust's versatility, while emphasizing low latency and the importance of code simplicity.
byRust in Production
0 ratings
0% found this document useful
EP 161 - How to maintain data quality across systems: This week, our guest is , Chief Data Officer of . Profisee is a cloud-native master data management solution that helps enterprises solve data quality and governance issues. In this talk, we discussed the challenges related to data management, from...
UNLIMITED
EP 161 - How to maintain data quality across systems: This week, our guest is , Chief Data Officer of . Profisee is a cloud-native master data management solution that helps enterprises solve data quality and governance issues. In this talk, we discussed the challenges related to data management, from...
byIndustrial IoT Spotlight
0 ratings
0% found this document useful
Distributed Systems with Leslie Lamport: This episode is a republication from my interview with Leslie Lamport on Software Engineering Radio. Leslie Lamport won a Turing Award in 2013 for his work in distributed and concurrent systems. He also designed the document preparation tool LaTex.
UNLIMITED
Distributed Systems with Leslie Lamport: This episode is a republication from my interview with Leslie Lamport on Software Engineering Radio. Leslie Lamport won a Turing Award in 2013 for his work in distributed and concurrent systems. He also designed the document preparation tool LaTex.
byCloud Engineering Archives - Software Engineering Daily
0 ratings
0% found this document useful
#111 The Rise of the Julia Programming Language
UNLIMITED
#111 The Rise of the Julia Programming Language
byDataFramed
0 ratings
0% found this document useful
OpsLevel: Service Ownership Platform with John Laban and Kenneth Rose: Microservices are built to scale. But as a microservices-based system grows, so does the operational overhead to manage it. Even the most senior engineers can’t be familiar with every detail of dozens- perhaps hundreds- of services.
UNLIMITED
OpsLevel: Service Ownership Platform with John Laban and Kenneth Rose: Microservices are built to scale. But as a microservices-based system grows, so does the operational overhead to manage it. Even the most senior engineers can’t be familiar with every detail of dozens- perhaps hundreds- of services.
byCloud Engineering Archives - Software Engineering Daily
0 ratings
0% found this document useful
[MINI] Long Short Term Memory: Thanks to our sponsor brilliant.org/dataskeptics A Long Short Term Memory (LSTM) is a neural unit, often used in Recurrent Neural Network (RNN) which attempts to provide the network the capacity to store information for longer periods of time. An...
UNLIMITED
[MINI] Long Short Term Memory: Thanks to our sponsor brilliant.org/dataskeptics A Long Short Term Memory (LSTM) is a neural unit, often used in Recurrent Neural Network (RNN) which attempts to provide the network the capacity to store information for longer periods of time. An...
byData Skeptic
0 ratings
0% found this document useful
NLP for eCommerce Search - Current Challenges and Future Potential: Episode summary: In this week's interview on the AI in Industry podcast, we speak with Amir Konigsberg, the CEO of Twiggle, about the future of product search - and how eCommerce and retail brands can use natural language processing (NLP) to improve...
UNLIMITED
NLP for eCommerce Search - Current Challenges and Future Potential: Episode summary: In this week's interview on the AI in Industry podcast, we speak with Amir Konigsberg, the CEO of Twiggle, about the future of product search - and how eCommerce and retail brands can use natural language processing (NLP) to improve...
byThe AI in Business Podcast
0 ratings
0% found this document useful
AI Today Podcast #114: Patterns of AI – Predictive Analytics / Decision Support: Patterns of AI: Predictive Analytics / Decision Support
UNLIMITED
AI Today Podcast #114: Patterns of AI – Predictive Analytics / Decision Support: Patterns of AI: Predictive Analytics / Decision Support
byAI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion
0 ratings
0% found this document useful
Bringing Feature Stores and MLOps to the Enterprise at Tecton: An interview with Kevin Stumpf, CTO of Tecton, about his work building an enterprise grade feature store and how it functions as the core element of an MLOps strategy.
UNLIMITED
Bringing Feature Stores and MLOps to the Enterprise at Tecton: An interview with Kevin Stumpf, CTO of Tecton, about his work building an enterprise grade feature store and how it functions as the core element of an MLOps strategy.
byData Engineering Podcast
0 ratings
0% found this document useful

Skip carousel

Text Docs To Rich Docs
Linux Format
UNLIMITED
Text Docs To Rich Docs
Dec 17, 2019
6 min read
The Return Of Gpu Computing
PC Pro Magazine
UNLIMITED
The Return Of Gpu Computing
Jul 8, 2021
5 min read
Hacking Minecraft Games With Python
Linux Format
UNLIMITED
Hacking Minecraft Games With Python
Nov 19, 2019
6 min read
Build A Self-hosted Fediverse Server
Linux Format
UNLIMITED
Build A Self-hosted Fediverse Server
Jan 11, 2022
7 min read
Elasticsearch And Kibana Basics
Linux Format
UNLIMITED
Elasticsearch And Kibana Basics
Dec 15, 2020
1 min read
Usability
Linux Format
UNLIMITED
Usability
Oct 19, 2021
3 min read
Win, Lose Or Draw?
Linux Format
UNLIMITED
Win, Lose Or Draw?
Jun 30, 2020
1 min read
Create Asynchronous Code With Python
Linux Format
UNLIMITED
Create Asynchronous Code With Python
Jun 29, 2021
8 min read
2029 VISION Where Technology Is Taking Business
NZBusiness and Management
UNLIMITED
2029 VISION Where Technology Is Taking Business
May 27, 2019
6 min read
Create Visualisations And Cool Dashboards
Linux Format
UNLIMITED
Create Visualisations And Cool Dashboards
Jan 14, 2020
8 min read
What’s A Switch?
TechLife
UNLIMITED
What’s A Switch?
Jan 11, 2021
1 min read
The Self-Driving Car Is a Red Herring
Nautilus
UNLIMITED
The Self-Driving Car Is a Red Herring
Oct 21, 2020
Ten years ago this fall, Google gave us a glimpse of a new device unlike any it had ever built before—a computer-controlled car. It seemed such a strange thing for an Internet company to spend its time and energy on, a “moonshot” as the company’s eng
10 min read
Demystifying Artificial Intelligence
Finweek - English
UNLIMITED
Demystifying Artificial Intelligence
Oct 18, 2019
artificial intelligence (AI) has had a significant global impact by changing the way enterprises, markets and consumers define efficiency and innovation. Financial markets typically feature large volumes of noisy and dynamic data while utilising high
3 min read
Tackling Terminal Tabular Table Tools!
Linux Format
UNLIMITED
Tackling Terminal Tabular Table Tools!
Jan 10, 2023
9 min read
MapReduce: The ‘Big Data’ Idea Inside Your Android Phone
APC
UNLIMITED
MapReduce: The ‘Big Data’ Idea Inside Your Android Phone
Dec 2, 2019
4 min read
Use EBPF To Keep Tabs On Your CPU
Linux Format
UNLIMITED
Use EBPF To Keep Tabs On Your CPU
Oct 18, 2022
Did you miss part one? Get hold of it on page 60 Mihalis Tsoukalos is a systems engineer and a technical writer. You can reach him at @mactsouk. We’re continuing our dive into the notoriously complex Extended Berkeley Packet Filter (eBPF) feature of
9 min read
Collect And Graph Metrics With Python
Linux Format
UNLIMITED
Collect And Graph Metrics With Python
May 4, 2021
7 min read
The Truth About Robots
TIME
UNLIMITED
The Truth About Robots
Feb 4, 2019
Artificial intelligence is powerful—and misunderstood. What we need to know to protect workers
3 min read
Real World Computing
PC Pro Magazine
UNLIMITED
Real World Computing
May 11, 2023
Migrating to Azure isn’t necessarily the toughest part of a successful cloud migration, explains our guest columnist Many organisations succeed at deploying resources in or migrating to Microsoft Azure. But many of those same organisations fail to en
6 min read
“It’s Time To Put On Your Seatbelt Because It’s About To Get A Little Rough And Tumble”
PC Pro Magazine
UNLIMITED
“It’s Time To Put On Your Seatbelt Because It’s About To Get A Little Rough And Tumble”
Sep 5, 2024
10 min read
“It’s Time To Put On Your Seatbelt Because It’s About To Get A Little Rough And Tumble”
PC Pro Magazine
UNLIMITED
“It’s Time To Put On Your Seatbelt Because It’s About To Get A Little Rough And Tumble”
Sep 5, 2024
10 min read
Is My Data Really Safe? Your Questions About Cloud-Based Storage, Answered.
Entrepreneur
UNLIMITED
Is My Data Really Safe? Your Questions About Cloud-Based Storage, Answered.
Nov 1, 2014
2 min read
The Big Cloud Question: How Can You Protect Your Assets On Someone Else’s Servers?
PC Pro Magazine
UNLIMITED
The Big Cloud Question: How Can You Protect Your Assets On Someone Else’s Servers?
Aug 10, 2023
I write this as an old fool who remembers sitting in endless meetings and presentations back when the whole concept of the cloud was starting up. So I can tell you that, right from the beginning, while vendors were pitching the all-encompassing busin
6 min read
Mocap In The Cloud: The Future Of Motion Capture
3D World
UNLIMITED
Mocap In The Cloud: The Future Of Motion Capture
Nov 30, 2021
4 min read
Darq
PC Pro Magazine
UNLIMITED
Darq
Jul 9, 2022
3 min read
“When Something Goes Wrong, You Realise You’re Like That Cartoon Character That Has Run Off The Edge Of The Cliff”
PC Pro Magazine
UNLIMITED
“When Something Goes Wrong, You Realise You’re Like That Cartoon Character That Has Run Off The Edge Of The Cliff”
Feb 9, 2023
We need to talk about data. Specifically, your data and my data. The stuff we use on a day-to-day basis, from where we store it to what our expectations are for its safe handling. Now let me get one thing clear from the beginning: I am going to sugge
9 min read
On-premises Business Backup
PC Pro Magazine
UNLIMITED
On-premises Business Backup
Oct 5, 2023
4 min read
“We Should Pay Attention To The Way That A New Language Can Redefine The Limits Of Computing”
PC Pro Magazine
UNLIMITED
“We Should Pay Attention To The Way That A New Language Can Redefine The Limits Of Computing”
Feb 11, 2021
7 min read
Cloud for SMBs
PC Pro Magazine
UNLIMITED
Cloud for SMBs
Mar 7, 2024
3 min read
One-day Projects To Improve Your Business Network
PC Pro Magazine
UNLIMITED
One-day Projects To Improve Your Business Network
Apr 10, 2022
8 min read

Related categories

Skip carousel

Reviews for Real-time Analytics with Storm and Cassandra

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Real-time Analytics with Storm and Cassandra - Shilpi Saxena

Real-time Analytics with Storm and Cassandra

Credits

About the Author

About the Reviewers

www.PacktPub.com

Support files, eBooks, discount offers, and more

Why subscribe?

Free access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Errata

Piracy

Questions

1. Let's Understand Storm

Distributed computing problems

Real-time business solution for credit or debit card fraud detection

Aircraft Communications Addressing and Reporting system

Healthcare

Other applications

Solutions for complex distributed use cases

The Hadoop solution

A custom solution

Licensed proprietary solutions

Other real-time processing tools

A high-level view of various components of Storm

Delving into the internals of Storm

Quiz time

Summary

2. Getting Started with Your First Topology

Prerequisites for setting up Storm

Components of a Storm topology

Spouts

Bolts

Streams

Tuples – the data model in Storm

Executing a sample Storm topology – local mode

WordCount topology from the Storm-starter project

Executing the topology in the distributed mode

Set up Zookeeper (V 3.3.5) for Storm

Setting up Storm in the distributed mode

Launching Storm daemons

Executing the topology from Command Prompt

Tweaking the WordCount topology to customize it

Quiz time

Summary

3. Understanding Storm Internals by Examples

Customizing Storm spouts

Creating FileSpout

Tweaking WordCount topology to use FileSpout

The SocketSpout class

Anchoring and acking

The unreliable topology

Stream groupings

Local or shuffle grouping

Fields grouping

All grouping

Global grouping

Custom grouping

Direct grouping

Quiz time

Summary

4. Storm in a Clustered Mode

The Storm cluster setup

Zookeeper configurations

Cleaning up Zookeeper

Storm configurations

Storm logging configurations

The Storm UI

Section 1

Section 2

Section 3

Section 4

The visualization section

Storm monitoring tools

Quiz time

Summary

5. Storm High Availability and Failover

An overview of RabbitMQ

Installing the RabbitMQ cluster

Prerequisites for the setup of RabbitMQ

Setting up a RabbitMQ server

Testing the RabbitMQ server

Creating a RabbitMQ cluster

Enabling the RabbitMQ UI

Creating mirror queues for high availability

Integrating Storm with RabbitMQ

Creating a RabbitMQ feeder component

Wiring the topology for the AMQP spout

Building high availability of components

High availability of the Storm cluster

Guaranteed processing of the Storm cluster

The Storm isolation scheduler

Quiz time

Summary

6. Adding NoSQL Persistence to Storm

The advantages of Cassandra

Columnar database fundamentals

Types of column families

Types of columns

Setting up the Cassandra cluster

Installing Cassandra

Multiple data centers

Prerequisites for setting up multiple data centers

Installing Cassandra data centers

Introduction to CQLSH

Introduction to CLI

Using different client APIs to access Cassandra

Storm topology wired to the Cassandra store

The best practices for Storm/Cassandra applications

Quiz time

Summary

7. Cassandra Partitioning, High Availability, and Consistency

Consistent hashing

One or more node goes down

One or more node comes back up

Replication in Cassandra and strategies

Cassandra consistency

Write consistency

Read consistency

Consistency maintenance features

Quiz time

Summary

8. Cassandra Management and Maintenance

Cassandra – gossip protocol

Bootstrapping

Failure scenario handling – detection and recovery

Cassandra cluster scaling – adding a new node

Cassandra cluster – replacing a dead node

The replication factor

The nodetool commands

Cassandra fault tolerance

Cassandra monitoring systems

JMX monitoring

Datastax OpsCenter

Quiz time

Summary

9. Storm Management and Maintenance

Scaling the Storm cluster – adding new supervisor nodes

Scaling the Storm cluster and rebalancing the topology

Rebalancing using the GUI

Rebalancing using the CLI

Setting up workers and parallelism to enhance processing

Scenario 1

Scenario 2

Scenario 3

Storm troubleshooting

The Storm UI

Storm logs

Quiz time

Summary

10. Advance Concepts in Storm

Building a Trident topology

Understanding the Trident API

Local partition manipulation operation

Functions

Filters

partitionAggregate

Sum aggregate

CombinerAggregator

ReducerAggregator

Aggregator

Operations related to stream repartitioning

Data aggregations over the streams

Grouping over a field in a stream

Merge and join

Examples and illustrations

Quiz time

Summary

11. Distributed Cache and CEP with Storm

The need for distributed caching in Storm

Introduction to memcached

Setting up memcache

Building a topology with a cache

Introduction to the complex event processing engine

Esper

Getting started with Esper

Integrating Esper with Storm

Quiz time

Summary

A. Quiz Answers

Chapter 1

Chapter 2

Chapter 3

Chapter 4

Chapter 5

Chapter 6

Chapter 7

Chapter 8

Chapter 9

Chapter 10

Chapter 11

Index

Real-time Analytics with Storm and Cassandra

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the authors, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

First published: March 2015

Production reference: 1240315

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham B3 2PB, UK.

ISBN 978-1-78439-549-0

www.packtpub.com

Credits

Author

Shilpi Saxena

Reviewers

Sourav Gulati

Saurabh Gupta

Ranjeet Kumar Jha

Mark Kerzner

Sonal Raj

Commissioning Editor

Akram Hussain

Acquisition Editor

Larissa Pinto

Content Development Editor

Shweta Pant

Technical Editor

Saurabh Malhotra

Copy Editors

Pranjali Chury

Merilyn Pereira

Project Coordinator

Shipra Chawhan

Proofreaders

Simran Bhogal

Maria Gould

Paul Hindle

Indexer

Mariammal Chettiyar

Graphics

Sheetal Aute

Valentina D'silva

Abhinash Sahu

Production Coordinator

Manu Joseph

Cover Work

Manu Joseph

About the Author

Shilpi Saxena is a seasoned professional, who is leading in management with an edge of being a technology evangelist. She is an engineer who has exposure to a variety of domains (machine to machine space, health care, telecom, hiring, and manufacturing). She has experience in all aspects of conception and execution of enterprise solutions. She has been architecting, managing and delivering solutions in the big data space for the last 3 years, handling high performance geographically distributed teams of elite engineers.

Shilpi has more than 12 years (3 years in the big data space) of experience in development and execution of various facets of enterprise solutions both in product/services dimensions of the software industry. An engineer by degree and profession, she has worn varied hats—developer, technical leader, product owner, tech manager, and so on, and she has seen all flavors the industry has to offer.

She has architected and worked through some of the pioneers' production implementation in big data on Storm and Impala with auto scaling in AWS.

To know more about her, visit her LinkedIn profile at http://in.linkedin.com/pub/shilpi-saxena/4/552/a30.

I would like to thank my husband, Sachin Saxena, and my mother, Manju Saxena, for their constant support and encouragement while writing this book. A sincere word of thanks to Impetus and all my mentors, who gave me a chance to innovate and learn as part of the big data group.

About the Reviewers

Sourav Gulati is an MCA and has been working in the IT industry for about 5 years. He has worked on technologies such as Java and Unix shell scripting and has also worked on big data technologies such as Hadoop, Cassandra, Storm, and so on. Initially, he started working for Tech Mahindra in 2010 and then moved to Impetus in 2012. Currently, he is working as a senior software engineer at Impetus.

I would really like to thank Shilpi Saxena and Packt Publishing for giving me the chance to be a part of this book. This book is packed with practical knowledge and experience. I would also like to wish Shilpi a lot of success with this book.

Saurabh Gupta is the lead software engineer at Impetus Technologies and has around 8 years of experience in IT. He started his career with Java/J2EE and headed toward NoSQL and big data technologies. He loves to read about new technologies or tools on the market. He believes that there are no secrets to success, but rather that it is the result of preparation, hard work, and learning from failure.

I want to thank my wife, Nalini, and the rest of my family, who supported and encouraged me in spite of all the time it kept me away from them.

Ranjeet Kumar Jha has over 12 years of experience in various phases of project life cycles, including the development and design phases, and has also been part of production support for Java/J2EE and big data-based applications. He has more than 6 years of experience as a technical architect in Java technologies and more than 3 years in big data stacks. He has worked in various domains such as finance, insurance, e-commerce, digital media, and online advertisements.

Ranjeet has worked as a programmer, designer, and mentor and now works as an architect in all types of projects related to Java, especially J2EE and big data.

His LinkedIn profile is available at https://www.linkedin.com/in/jharanjeet.

His certifications include:

OCM-JEA 5 (Oracle Certified Master, Java Enterprise Architect) with a 94 percent score in 2011

OCE-WSD (Oracle Certified Expert, JAVA EE 6 Web Services Developer) in 2013

SCJP (Sun Certified Java Programmer) in 2004

SCWCD (Sun Certified Web Component Developer) in 2004

Java Development with Apache Cassandra from DataStax in 2014

MongoDB for Java Developers from MongoDB University in 2014

The companies he has worked for include the following:

EtechAces Consulting and Marketing Pvt Ltd. Gurgaon (Delhi NCR)

Times Internet Ltd (TimesGroup), Noida (Delhi NCR)

Ebusinessware Inc (now Xoriant Corporation), Gurgaon (Delhi NCR)

WIPRO, Gurgaon (Delhi NCR)

AgreeYa Solution Pvt Ltd, Noida (Delhi NCR)

INCA Informatics, Noida (Delhi NCR)

I would like to thank my family—my wife, Anila Jha, and two kids, Anushka Jha and Tanisha Jha, for their constant support, encouragement, and patience. Without you, I wouldn't have achieved so much! Love you all immensely.

Mark Kerzner holds degrees in law, math, and computer science. He is a software architect who has been working on Hadoop-based systems since 2008. Mark is a cofounder of Elephant Scale, a big data training and consulting company. He is a coauthor of the open source books Hadoop Illuminated and Hbase Design Patterns, both by Packt Publishing. He has also authored and coauthored other books and patents, which can be found at http://www.amazon.com.

I would like to acknowledge the help of my colleagues, in particular, Sujee Maniyam, and last but not least, my multitalented family.

Sonal Raj is a hacker, Pythonista, big data believer, and a technology dreamer. He has a passion for design and is an artist at heart. He blogs about technology, design, and gadgets at http://www.sonalraj.com/. When not working on projects, he can be found traveling, stargazing, or reading.

He has pursued engineering in computer science and loves to work on community projects. He has been a research fellow at SERC, IISc, Bangalore, and has taken up projects on graph computations using Neo4j and Storm. Sonal has been a speaker at PyCon India and local meets on Neo4j and has also published articles and research papers in leading magazines and international journals. He has contributed to several open source projects.

Sonal has been actively involved in the development of machine learning frameworks and has worked on technologies such as NoSQL databases including MongoDB and streaming using Apache Spark. He is currently working at Goldman Sachs.

I am grateful to the author for patiently listening to my critiques and I'd like to thank the open source community for keeping their passion alive and contributing to such remarkable projects. A special thank you to my parents, without whom I would never have grown to love learning as much as I do.

www.PacktPub.com

Support files, eBooks, discount offers, and more

For support files and downloads related to your book, please visit www.PacktPub.com.

Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at for more details.

At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.

https://www2.packtpub.com/books/subscription/packtlib

Do you need instant solutions to your IT questions? PacktLib is Packt's online digital book library. Here, you can search, access, and read Packt's entire library of books.

Why subscribe?

Fully searchable across every book published by Packt

Copy and paste, print, and bookmark content

On demand and accessible via a web browser

Free access for Packt account holders

If you have an account with Packt at www.PacktPub.com, you can use this to access PacktLib today and view 9 entirely free books. Simply use your login credentials for immediate access.

Preface

Storm, initially a project from the house of Twitter, has graduated to the league of Apache and thus rechristened from Twitter Storm. It is the brainchild of Nathan Marz that's now adopted by leagues of Cloudera's Distribution Including Apache Hadoop (CDH) and the Hortonworks Data Platform (HDP), and so on.

Apache Storm is a highly scalable, distributed, fast, and reliable real-time computing system designed to process very high velocity data. Cassandra complements the computing capability by providing lightning-fast read and writes, and this is the best combination currently available for data

Enjoying the preview?

Page 1 of 1

Real-time Analytics with Storm and Cassandra

About this ebook

Shilpi Saxena

Related authors

Related to Real-time Analytics with Storm and Cassandra

Related ebooks

Introduction to Machine Learning in the Cloud with Python: Concepts and Practices

Practical Convolutional Neural Networks: Implement advanced deep learning models using Python

Data Engineering with Apache Spark, Delta Lake, and Lakehouse: Create scalable pipelines that ingest, curate, and aggregate complex data in a timely and secure way

Learn Microservices with Spring Boot: A Practical Approach to RESTful Services Using an Event-Driven Architecture, Cloud-Native Patterns, and Containerization

Tika in Action

Modern Computer Vision with PyTorch: A practical roadmap from deep learning fundamentals to advanced applications and Generative AI

Neural Networks with Python

Ensemble Methods for Machine Learning

Cloud-Native Observability with OpenTelemetry: Learn to gain visibility into systems by combining tracing, metrics, and logging with OpenTelemetry

Python AI Programming: Navigating fundamentals of ML, deep learning, NLP, and reinforcement learning in practice

Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners

Pro Spring Boot 2: An Authoritative Guide to Building Microservices, Web and Enterprise Applications, and Best Practices

Random Graphs

NumPy: An action packed guide using real world examples of the easy to use, high performance, free open source NumPy mathematical library.

Mastering Kubernetes: Dive into Kubernetes and learn how to create and operate world-class cloud-native systems

Fun Q: A Functional Introduction to Machine Learning in Q

Python Object-Oriented Programming: Build robust and maintainable object-oriented Python applications and libraries

Spark GraphX in Action

Elasticsearch 8.x Cookbook: Over 180 recipes to perform fast, scalable, and reliable searches for your enterprise

Deep Learning for Time Series Cookbook: Use PyTorch and Python recipes for forecasting, classification, and anomaly detection

Flex on Java

Apache Spark Graph Processing

Modern Data Mining Algorithms in C++ and CUDA C: Recent Developments in Feature Extraction and Selection Algorithms for Data Science

Hadoop MapReduce v2 Cookbook - Second Edition

Start Concurrent: An Introduction to Problem Solving in Java with a Focus on Concurrency, 2014

Deep Belief Nets in C++ and CUDA C: Volume 2: Autoencoding in the Complex Domain

TensorFlow A Complete Guide - 2019 Edition

Hands-on Time Series Analysis with Python: From Basics to Bleeding Edge Techniques

Implementing Enterprise Observability for Success: Strategically plan and implement observability using real-life examples

Mastering Time Series Analysis and Forecasting with Python

Computers For You

Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race

The Invisible Rainbow: A History of Electricity and Life

Slenderman: Online Obsession, Mental Illness, and the Violent Crime of Two Midwestern Girls

Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics

The Innovators: How a Group of Hackers, Geniuses, and Geeks Created the Digital Revolution

Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad

Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition

The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology

Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!

Learning the Chess Openings

Elon Musk

An Ultimate Guide to Kali Linux for Beginners

Mastering ChatGPT: 21 Prompts Templates for Effortless Writing

ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind

Deep Search: How to Explore the Internet More Effectively

Uncanny Valley: A Memoir

CompTIA Security+ Get Certified Get Ahead: SY0-701 Study Guide

How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally

Tor and the Dark Art of Anonymity

How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)

Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work

101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters

SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL

The Hacker Crackdown: Law and Disorder on the Electronic Frontier

Grokking Algorithms: An illustrated guide for programmers and other curious people

The Professional Voiceover Handbook: Voiceover training, #1

Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are

AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python

The Self-Taught Computer Scientist: The Beginner's Guide to Data Structures & Algorithms

CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61

Related podcast episodes

Related articles

Related categories

Reviews for Real-time Analytics with Storm and Cassandra

What did you think?

Book preview

Real-time Analytics with Storm and Cassandra - Shilpi Saxena

Table of Contents

Real-time Analytics with Storm and Cassandra

Real-time Analytics with Storm and Cassandra

Credits

About the Author

About the Reviewers

101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters