ClickHouse Data Warehouse 101

ClickHouse Data Warehouse 101
The First Billion Rows

Alexander Zaitsev and Robert Hodges
About Us
Robert Hodges - Altinity CEO Alexander Zaitsev - Altinity CTO
30+ years on DBMS plus Expert in data warehouse with

virtualization and security. petabyte-scale deployments.
Previously at VMware and Altinity Founder; Previously at

Continuent LifeStreet (Ad Tech business)
Altinity Background
● Premier provider of software and services for ClickHouse

● Incorporated in UK with distributed team in US/Canada/Europe
● Main US/Europe sponsor of ClickHouse community
● Offerings:
○ Enterprise support for ClickHouse and ecosystem projects
○ Software (Kubernetes, cluster manager, tools & utilities)
○ POCs/Training
ClickHouse
Overview
ClickHouse is a powerful data warehouse that handles
many use cases
Understands SQL
a b c d a b c d
Runs on bare metal to cloud
Stores data in columns
Parallel and vectorized execution

a b c d a b c d
Scales to many petabytes
Is Open source (Apache 2.0)
Is WAY fast!
Tables are split into indexed, sorted parts for fast queries
Index Columns
Part Indexed
Table
Sorted
Index Columns Compressed

Part
Part
If one server is not enough -- ClickHouse can scale out easily
ClickHouse
tripdata_dist tripdata
(Distributed) (MergeTable)
SELECT ...
FROM ClickHouse
tripdata_dist
Result Set
ClickHouse
Getting Started:
Data Loading
Installation: Use packages on Linux host
$ sudo apt -y install clickhouse-client=19.6.2 \
clickhouse-server=19.6.2 \
clickhouse-common-static=19.6.2
...
$ sudo systemctl start clickhouse-server
...
$ clickhouse-client
11e99303c78e :) select version()
...
┌─version()─┐
│ 19.6.2.11 │
└───────────┘
Decision tree for ClickHouse basic schema design
Fields Yes Types Yes Use scalar

are are columns with
fixed? known? specific type
No No
Use array
Use scalar Select
columns to
columns with partition key
store key
String type and sort order
value pairs
Tabular data structure typically gives the best results
CREATE TABLE tripdata (

`pickup_date` Date DEFAULT
Scalar columns
toDate(tpep_pickup_datetime),
ìd` UInt64, Specific datatypes
`vendor_id` String,
`tpep_pickup_datetime` DateTime, Time-based partition key
`tpep_dropoff_datetime` DateTime,
...
) ENGINE = MergeTree Sort key to index parts
PARTITION BY toYYYYMM(pickup_date)
ORDER BY (pickup_location_id, dropoff_location_id, vendor_id)
Use clickhouse-client to load data quickly from files
CSV Input Data
"Pickup_date","id","vendor_id","tpep_pickup_datetime"…
"2016-01-02",0,"1","2016-01-02 04:03:29","2016-01-02…
"2016-01-29",0,"1","2016-01-29 12:00:51","2016-01-29…
"2016-01-09",0,"1","2016-01-09 17:22:05","2016-01-09…
Reading CSV Input with Headers

clickhouse-client --database=nyc_taxi_rides --query='INSERT
INTO tripdata FORMAT CSVWithNames' < data.csv
Reading Gzipped CSV Input with Headers

gzip -d -c | clickhouse-client --database=nyc_taxi_rides
--query='INSERT INTO tripdata FORMAT CSVWithNames'
Wouldn’t it be nice to run in parallel over a lot of input files?
Altinity Datasets project does exactly that!

● Dump existing schema definitions and data to files
● Load files back into a database
● Data dump/load commands run in parallel
See https://github.com/Altinity/altinity-datasets
How long does it take to load 1.3B rows?
$ time ad-cli dataset load nyc_taxi_rides --repo_path=/data1/sample-data
Creating database if it does not exist: nyc_timed
Executing DDL: /data1/sample-data/nyc_taxi_rides/ddl/taxi_zones.sql
. . .
Loading data: table=tripdata, file=data-200901.csv.gz
. . .
Operation summary: succeeded=193, failed=0
real 11m4.827s
user 63m32.854s
sys 2m41.235s
(Amazon md5.2xlarge: Xeon(R) Platinum 8175M, 8vCPU, 30GB RAM, NVMe SSD)
Do we really have 1B+ table?
:) select count() from tripdata;
SELECT count()
FROM tripdata
┌────count()─┐
│ 1310903963 │
└────────────┘
1 rows in set. Elapsed: 0.324 sec. Processed 1.31 billion rows, 1.31 GB (4.05
billion rows/s., 4.05 GB/s.)
1,310,903,963/11m4s = 1,974,253 rows/sec!!!

Getting Started
on Queries
Let’s try to predict maximum performance
SELECT avg(number)
FROM
(
SELECT number
FROM system.numbers
LIMIT 1310903963 system.numbers -- internal
) generator for testing
┌─avg(number)─┐
│ 655451981 │
└─────────────┘
million rows/s., 3.07 GB/s.)
Now we try with the real data
SELECT avg(passenger_count)
FROM tripdata
┌─avg(passenger_count)─┐
│ 1.6817462943317076 │
└──────────────────────┘
1 rows in set. Elapsed: ?
Guess how fast?

Now we try with the real data
FROM tripdata
│ 1.6817462943317076 │
└──────────────────────┘
Even faster!!!!
Data type and cardinality matters
What if we add a filter
FROM tripdata
WHERE toYear(pickup_date) = 2016
│ 1.6571129913837774 │
└──────────────────────┘
1 rows in set. Elapsed: 0.162 sec. Processed 131.17 million rows, 393.50 MB (811.05
million rows/s., 2.43 GB/s.)
What if we add a group by
SELECT
pickup_location_id AS location_id,
avg(passenger_count),
count()
FROM tripdata
GROUP BY location_id LIMIT 10
...
10 rows in set. Elapsed: 0.251 sec. Processed 131.17 million rows, 655.83 MB
(522.62 million rows/s., 2.61 GB/s.)
What if we add a join
SELECT
zone,
count()
FROM tripdata
INNER JOIN taxi_zones ON taxi_zones.location_id = pickup_location_id
GROUP BY zone
LIMIT 10
10 rows in set. Elapsed: 0.803 sec. Processed 131.17 million rows, 655.83 MB (163.29
million rows/s., 816.44 MB/s.)
Yes, ClickHouse is FAST!
https://tech.marksblogg.com/benchmarks.html
Optimization
Techniques
How to make ClickHouse
even faster
You can optimize
Server settings
Schema
Column storage
Queries
You can optimize
SELECT avg(passenger_count) Default is a half of
FROM tripdata
SETTINGS max_threads = 1
available cores --
good enough
...
1 rows in set. Elapsed: 4.855 sec. Processed 1.31 billion rows, 1.31 GB
(270.04 million rows/s., 270.04 MB/s.)
FROM tripdata
SETTINGS max_threads = 8
...
Schema optimizations
Data types
Index
Dictionaries
Arrays
Materialized Views and aggregating engines

Data Types matter!
https://www.percona.com/blog/2019/02/15/clickhouse-performance-uint32-vs-uint64-vs-float32-vs-float64/
MaterializedView with SummingMergeTree
CREATE MATERIALIZED VIEW tripdata_mv
ENGINE = SummingMergeTree MaterializedView
PARTITION BY toYYYYMM(pickup_date)
ORDER BY (pickup_location_id, dropoff_location_id, vendor_id) AS works as an INSERT
SELECT trigger
pickup_date,
vendor_id,
pickup_location_id,
dropoff_location_id,
sum(passenger_count) AS passenger_count_sum,
sum(trip_distance) AS trip_distance_sum,
sum(fare_amount) AS fare_amount_sum,
sum(tip_amount) AS tip_amount_sum,
sum(tolls_amount) AS tolls_amount_sum,
sum(total_amount) AS total_amount_sum, SummingMergeTree
count() AS trips_count automatically
FROM tripdata
GROUP BY aggregates data in
pickup_date, the background
vendor_id,
pickup_location_id,
dropoff_location_id
INSERT INTO tripdata_mv SELECT
pickup_date,
vendor_id, Note, no group by!
pickup_location_id,
dropoff_location_id,
passenger_count, SummingMergeTree
trip_distance,
fare_amount, automatically
tip_amount, aggregates data in
tolls_amount,
total_amount, the background
1
FROM tripdata;
Ok.
0 rows in set. Elapsed: 303.664 sec. Processed 1.31 billion rows,

50.57 GB (4.32 million rows/s., 166.54 MB/s.)
SELECT count()
FROM tripdata_mv
┌──count()─┐
│ 20742525 │
└──────────┘
1 rows in set. Elapsed: 0.015 sec. Processed 20.74 million rows, 41.49 MB (1.39 billion
rows/s., 2.78 GB/s.)
SELECT
zone,
sum(passenger_count_sum)/sum(trips_count),
sum(trips_count)
FROM tripdata_mv
GROUP BY zone
LIMIT 10
10 rows in set. Elapsed: 0.036 sec. Processed 3.23 million rows, 64.57 MB (89.14 million
Realtime Aggreation with Materialized Views
Summing
MergeTree
Summing
INSERTS Raw Data MergeTree
Summing
MergeTree
Column storage optimizations
Compression
LowCardinality
Column encodings
LowCardinality example. Another 1B rows.
:) create table test_lc ( LowCardinality
a String, a_lc LowCardinality(String) DEFAULT a) Engine = MergeTree encodes column
PARTITION BY tuple() ORDER BY tuple(); with a dictionary
encoding
:) INSERT INTO test_lc (a) SELECT
concat('openconfig-interfaces:interfaces/interface/subinterfaces/subinter
face/state/index', toString(rand() % 1000))
FROM system.numbers LIMIT 1000000000; Storage is

dramatically
┌─table───┬─name─┬─type───────────────────┬─compressed─┬─uncompressed─┐
reduced
│ test_lc │ a │ String │ 4663631515 │ 84889975226 │
│ test_lc │ a_lc │ LowCardinality(String) │ 2010472937 │ 2002717299 │
└─────────┴──────┴────────────────────────┴────────────┴──────────────┘
LowCardinality example. Another 1B rows
:) select a a, count(*) from test_lc group by a order by count(*) desc limit 10;
┌─a────────────────────────────────────────────────────────────────────────────────────┬─count()─┐
│ openconfig-interfaces:interfaces/interface/subinterfaces/subinterface/state/index396 │ 1002761 │
...
│ openconfig-interfaces:interfaces/interface/subinterfaces/subinterface/state/index5 │ 1002203 │
└──────────────────────────────────────────────────────────────────────────────────────┴─────────┘
10 rows in set. Elapsed: 11.627 sec. Processed 1.00 billion rows, 92.89 GB (86.00 million
Faster
:) select a_lc a, count(*) from test_lc group by a order by count(*) desc limit 10;
...
Array example. Another 1B rows
create table test_array ( Arrays efficiently model 1-to-N
s String, relationship
a Array(LowCardinality(String)) default arrayDistinct(splitByChar(',', s))
) Engine = MergeTree PARTITION BY tuple() ORDER BY tuple();

Note the use of complex default
expression
INSERT INTO test_array (s)
WITH ['Percona', 'Live', 'Altinity', 'ClickHouse', 'MySQL', 'Oracle', 'Austin', 'Texas',

'PostgreSQL', 'MongoDB'] AS keywords
SELECT concat(keywords[((rand(1) % 10) + 1)], ',',
keywords[((rand(2) % 10) + 1)], ',',
keywords[((rand(3) % 10) + 1)], ',',
keywords[((rand(4) % 10) + 1)])
FROM system.numbers LIMIT 1000000000;

Array efficiently models 1-to-N
Data sample: relationship
┌─s────────────────────────────────────┬─a────────────────────────────────────────────┐
│ Texas,ClickHouse,Live,MySQL │ ['Texas','ClickHouse','Live','MySQL'] │
│ Texas,Oracle,Altinity,PostgreSQL │ ['Texas','PostgreSQL','Oracle','Altinity'] │
│ Percona,MySQL,MySQL,Austin │ ['MySQL','Percona','Austin'] │
│ PostgreSQL,Austin,PostgreSQL,Percona │ ['PostgreSQL','Percona','Austin'] │
│ Altinity,Percona,Percona,Percona │ ['Altinity','Percona'] │
Storage:
┌─table──────┬─name─┬─type──────────────────────────┬────────comp─┬──────uncomp─┐
│ test_array │ s │ String │ 11239860686 │ 31200058000 │
│ test_array │ a │ Array(LowCardinality(String)) │ 4275679420 │ 11440948123 │
└────────────┴──────┴───────────────────────────────┴─────────────┴─────────────┘
:) select count() from test_array where s like '%ClickHouse%';
┌───count()─┐
│ 343877409 │
└───────────┘
:) select count() from test_array where has(a,'ClickHouse');
┌───count()─┐
│ 343877409 │
└───────────┘
Well, ‘like’ is very efficient, but we reduced I/O a lot.

* has() will be optimized by dev team
Query optimization example. JOIN optimization
SELECT
zone,
count()
FROM tripdata
INNER JOIN taxi_zones ON taxi_zones.location_id =
pickup_location_id
GROUP BY zone
LIMIT 10
10 rows in set. Elapsed: 0.803 sec. Processed 131.17 million rows,

655.83 MB (163.29 million rows/s., 816.44 MB/s.)
Can we do it any faster?

Query optimization example. JOIN optimization
SELECT
zone, Subquery minimizes data
sum(pc_sum) / sum(pc_cnt) AS pc_avg, scanned in parallel; joins
sum(pc_cnt)
on GROUP BY results
FROM
(
SELECT
pickup_location_id,
sum(passenger_count) AS pc_sum,
count() AS pc_cnt
FROM tripdata
GROUP BY pickup_location_id
)
GROUP BY zone LIMIT 10
10 rows in set. Elapsed: 0.248 sec. Processed 131.17 million rows, 655.83
MB (529.19 million rows/s., 2.65 GB/s.)
ClickHouse
Integrations
...And a nice set of supporting ecosystem tools
Client libraries: JDBC, ODBC, Python, Golang, ...
Kafka table engine to ingest from Kafka queues
Visualization tools: Grafana, Tableau, Tabix, SuperSet
Data science stack integration: Pandas, Jupyter Notebooks
Kubernetes ClickHouse operator

Integrations with MySQL
MySQL External Dictionaries (pull data from MySQL to CH)
MySQL Table Engine and Table Function (query/insert)
Binary Log Replication
ProxySQL supports ClickHouse
ClickHouse supports MySQL wire protocol (in June release)

..and with PostgreSQL
ODBC External Dictionaries (pull data from PostgreSQL to CH)
ODBC Table Engine and Table Function (query/insert)
Logical Replication: https://github.com/mkabilov/pg2ch
Foreign Data Wrapper:

https://github.com/Percona-Lab/clickhousedb_fdw
ClickHouse Operator -- an easy way to manage ClickHouse
DWH in Kubernetes
https://github.com/Altinity/clickhouse-operator
Where to get more information
● ClickHouse Docs: https://clickhouse.yandex/docs/en/

● Altinity Blog: https://www.altinity.com/blog
● Meetups and presentations: https://www.altinity.com/presentations
○ 2 April -- Madrid, Spain ClickHouse Meetup
○ 7 May -- Limassol, Cyprus ClickHouse Meetup
○ 28-30 May -- Austin, TX Percona Live 2019
○ 4 June -- San Francisco ClickHouse Meetup
○ 8 June -- Beijing ClickHouse Meetup
○ September -- ClickHouse Paris Meetup
Contacts:
Questions? [email protected]
Visit us at:
Thank you! https://www.altinity.com
Read Our Blog:

https://www.altinity.com/blog

ClickHouse Data Warehouse 101 - The First Billion Rows

Uploaded by

Copyright:

Available Formats

ClickHouse Data Warehouse 101 - The First Billion Rows

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

ClickHouse Data Warehouse 101 - The First Billion Rows

Uploaded by

Copyright:

Available Formats

The First Billion Rows

Robert Hodges - Altinity CEO Alexander Zaitsev - Altinity CTO

30+ years on DBMS plus Expert in data warehouse with

Previously at VMware and Altinity Founder; Previously at

● Premier provider of software and services for ClickHouse

Stores data in columns

Parallel and vectorized execution

Is Open source (Apache 2.0)

Index Columns Compressed

Fields Yes Types Yes Use scalar

CREATE TABLE tripdata (

Reading CSV Input with Headers

Reading Gzipped CSV Input with Headers

Altinity Datasets project does exactly that!

1,310,903,963/11m4s = 1,974,253 rows/sec!!!

1 rows in set. Elapsed: ?

Guess how fast?

Materialized Views and aggregating engines

0 rows in set. Elapsed: 303.664 sec. Processed 1.31 billion rows,

FROM system.numbers LIMIT 1000000000; Storage is

│ test_lc │ a_lc │ LowCardinality(String) │ 2010472937 │ 2002717299 │

) Engine = MergeTree PARTITION BY tuple() ORDER BY tuple();

WITH ['Percona', 'Live', 'Altinity', 'ClickHouse', 'MySQL', 'Oracle', 'Austin', 'Texas',

SELECT concat(keywords[((rand(1) % 10) + 1)], ',',

keywords[((rand(2) % 10) + 1)], ',',

keywords[((rand(3) % 10) + 1)], ',',

keywords[((rand(4) % 10) + 1)])

FROM system.numbers LIMIT 1000000000;

:) select count() from test_array where has(a,'ClickHouse');

Well, ‘like’ is very eﬃcient, but we reduced I/O a lot.

10 rows in set. Elapsed: 0.803 sec. Processed 131.17 million rows,

Can we do it any faster?

Client libraries: JDBC, ODBC, Python, Golang, ...

Kafka table engine to ingest from Kafka queues

Visualization tools: Grafana, Tableau, Tabix, SuperSet

Data science stack integration: Pandas, Jupyter Notebooks

Kubernetes ClickHouse operator

MySQL External Dictionaries (pull data from MySQL to CH)

MySQL Table Engine and Table Function (query/insert)

Binary Log Replication

ProxySQL supports ClickHouse

ClickHouse supports MySQL wire protocol (in June release)

ODBC External Dictionaries (pull data from PostgreSQL to CH)

ODBC Table Engine and Table Function (query/insert)

Logical Replication: https://github.com/mkabilov/pg2ch

Foreign Data Wrapper:

● ClickHouse Docs: https://clickhouse.yandex/docs/en/

Read Our Blog:

You might also like