0% found this document useful (0 votes)
16 views

Big Table

Uploaded by

sherlimca
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views

Big Table

Uploaded by

sherlimca
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 10

BigTable

 Bigtable is a compressed, high performance, proprietary data storage system built


on Google File System.
 Big Table is now used by a number of Google applications, such as web indexing,
mapreduce which is often used for generating and modifying data stored in Bigtable,
Google Maps, Google Book Search, "My Search History", Google
Earth, Blogger.com, Google Code hosting, YouTube, and Gmail.
 Bigtable is one of the prototypical examples of a wide column store.
 It maps two arbitrary string values (row key and column key) and timestamp (hence
three-dimensional mapping) into an associated arbitrary byte array.
 It is not a relational database and can be better defined as a sparse, distributed multi-
dimensional sorted map.
 Bigtable is designed to scale into the petabyte range across "hundreds or thousands of
machines, and to make it easy to add more machines [to] the system and automatically
start taking advantage of those resources without any reconfiguration".
 For example, Google's copy of the web can be stored in a bigtable where the row key is
a domain-reversed URL, and columns describe various properties of a web page, with
one particular column holding the page itself.
 The page column can have several timestamped versions describing different copies of
the web page timestamped by when they were fetched.
 Each cell of a bigtable can have zero or more timestamped versions of the data. Another
function of the timestamp is to allow for both versioning and garbage collection of
expired data.
Google Big Table

 Google Big Table is a columnar database and good for sparse data (Null Data).
 Google Big Table contains row key, column key and timestamp.
 Each table has only one index, the row key.
 Rows are sorted lexicographically by row key, from the lowest to highest byte
string.
 Column keys are grouped into sets called column families.
 Each column is identified by a combination of column family and a column
qualifier,, which is unique name with in column family.
 Timestamp can be assigned automatically by BigTable.
From the diagram, all client requests go through the front-end server, before they are send to
cloud BigTable node.

The nodes are organized into cloud BigTable cluster, which belongs to a cloud BigTable cluster.

Each node in the cluster handle the subset of request to the cluster. By adding node to the
cluster you can increase the number of simultaneous request that the cluster can handle.

The particular Google Cloud BigTable is shared into blocks of continous rows, called as tablet, to
balance the workload. The tablets are stored in Google File System.

 Cloud BigTable are sparse. Empty columns don’t take any space.
 All operations are atomic at row level. Eg : If you update two rows in a table, it is
possible that one row will be updated successfully, and the other update will fail.
 You can use cloud Big Table to store and query all the following types of data.
 Time series data, such as CPU and memory usage over time for multiple servers.
 Marketing Data, such as purchase histories and customer preferences.
 Financial data, such as transactional histories, stock prices, currency exchange rates.
 Internet of Things, such as usage reports from energy meters and home appliances.
 Graph data, such as informational about how one user is connected to one another.
Practical : Google Big Table in Google Cloud Platform

Step 1

Step 2 : Create an instance


Instance name is given as Sample BigTable Instance
Click done. The particular storage space is created.
Once we have created BigTable instance we have to activate the cloud shell to install the HBase
shell.
Establishing connection to Google cloud shell

Now my cloud shell has been activated. Google Cloud Platform provides a HBase shell. You can
use HBase commands to connect and work with BigTable.

Inorder to use HBase shell we need to found out Google Cloud Bigtable using the git clone
Mentioned the url in above picture

Now my HBase shell is created. It uses project id, instance id,.

Creating the HBase shell.


No tables are currently in hbase.

Create table student. Insert data in the student.


View data

Cmd - scan student

It has column family – student number, name, address along with timestamp.

This way we can create BigTable.

You might also like