Legacy HBase client-side metrics
This page describes a set of client-side metrics that has been replaced with built-in client-side metrics. This page is maintained as a reference for any developers who still use the legacy metrics.
We recommend using the built-in client-side metrics described on the Client-side metrics overview instead of the legacy metrics described on this page.
Enable metrics
This information is provided for customers who are using the legacy metrics. To enable the newer, built-in client-side metrics, see Set up client-side metrics.
The Cloud Bigtable HBase client for Java used Dropwizard Metrics to collect and report client-side metrics. Because collecting metrics can add a very small amount of latency (single-digit microseconds) to each request, metrics were not enabled by default. The following sections explain how to enable the legacy client-side metrics.
Use the Log4j reporter
The simplest way to enable the legacy client-side metrics is to add the following line to your Log4j configuration file:
log4j.category.com.google.cloud.bigtable.metrics=DEBUG
This configuration setting turns on metrics collection and logs Bigtable metrics with an SLF4J logger.
Use other reporters
You can use other types of reporters by updating your application's code. For example, here is a reporter that sends metrics to a Graphite server:
Graphite pickledGraphite = new PickledGraphite(new InetSocketAddress("graphite.example.com", 2004));
DropwizardMetricRegistry registry = new DropwizardMetricRegistry();
GraphiteReporter reporter =
GraphiteReporter.forRegistry(registry.getRegistry())
.convertRatesTo(TimeUnit.SECONDS)
.convertDurationsTo(TimeUnit.MILLISECONDS)
.filter(MetricFilter.ALL)
.build(pickledGraphite);
reporter.start(1, TimeUnit.MINUTES);
BigtableClientMetrics.setMetricRegistry(registry);
Dropwizard Metrics provides reporters for JMX, console logging, SLF4J, and CSV. There are also a variety of third-party reporters available.
Available metrics
This section describes the metrics that are available when the legacy HBase client-side metrics are enabled. Each metric has one of the following types:
- Counter: A cumulative count per Java virtual machine (JVM).
- Meter: Count information plus throughput information (a count per the last minute, 5 minutes, or 15 minutes)
- Timer: Meter information plus latency information (such as median, mean, and 95th percentile)
Only certain metrics are collected for each type of request. See "Example:
Metrics for a Put
request" for an example of the metrics that
are collected during a Put
request.
Channel-level metrics
Type | Name | Description |
---|---|---|
Counter | google-cloud-bigtable.sessions.active |
How many BigtableSession s are opened. Each HBase
Connection has a single BigtableSession . |
Counter | google-cloud-bigtable.grpc.channel.active |
How many lower-level gRPC/Netty channels are opened. Each
BigtableSession has many gRPC channels. |
General RPC metrics
Type | Name | Description |
---|---|---|
Counter | google-cloud-bigtable.grpc.rpc.active |
The number of remote procedure calls (RPCs) that are currently active. |
Meter | google-cloud-bigtable.grpc.rpc.performed |
RPC throughput. |
Data method metrics
Data method metrics are collected for the following data methods:
ReadRows
: Implements gets and scans.MutateRow
: Implements puts and deletes.MutateRows
: Implements bulk writes.CheckAndMutateRow
: Implements HBase'scheckAnd*
methods.ReadModifyWrite
: Implements HBase'sAppend
andIncrement
methods.SampleRowKeys
: Retrieves region information that is used for MapReduce operations.
Type | Name | Description |
---|---|---|
Timer | google-cloud-bigtable.grpc.method.[METHOD_TYPE].operation.latency |
The length of time that individual operations take. Operations include the total amount of latency of all RPCs that are performed. (Usually only one RPC is performed. Client-side retries can cause the same RPC to be performed more than once if there is a transient error.) |
Timer | google-cloud-bigtable.grpc.method.ReadRows.firstResponse.latency |
The length of time that it takes to receive the first response to a scan request. |
Meter | google-cloud-bigtable.grpc.method.[METHOD_TYPE].retries.performed |
The number of retries that were performed. |
Meter | google-cloud-bigtable.grpc.method.[METHOD_TYPE].failures |
The number of non-retryable failures. |
Meter | google-cloud-bigtable.grpc.method.[METHOD_TYPE].retries.exhausted |
The number of times that retrying was aborted because too many retries had failed. |
Bulk metrics
Bulk metrics are provided for methods that return more than one response, such as a bulk write.
Type | Name | Description |
---|---|---|
Meter | google-cloud-bigtable.scanner.results |
The throughput of individual rows returned by a scan. |
Meter | google-cloud-bigtable.bulk-mutator.mutations.added |
The throughput of individual mutations added for each
MutateRows request. |
Meter | google-cloud-bigtable.bulk-mutator.mutations.retried |
The number of individual mutations retried over time. |
Bigtable table metrics
Converting Bigtable objects to HBase objects can add to the
latency of a request. The following timers can be correlated with the specified
*.operation.latency
timers to measure the cost of the conversion.
Type | Name | Description |
---|---|---|
Timer | google-cloud-bigtable.table.put.latency |
The length of time that individual Put operations take.
Correlates with
google-cloud-bigtable.grpc.method.MutateRow.operation.latency . |
Timer | google-cloud-bigtable.table.get.latency |
The length of time that individual Get operations take.
Correlates with
google-cloud-bigtable.grpc.method.ReadRows.operation.latency . |
Example: Metrics for a Put
request
When client-side metrics are enabled, the following metrics are collected for a
successful Put
request that is not retried:
- Counter:
google-cloud-bigtable.grpc.rpc.active
- Meter:
google-cloud-bigtable.grpc.rpc.performed
- Timer:
google-cloud-bigtable.grpc.method.MutateRow.operation.latency
- Timer:
google-cloud-bigtable.table.put.latency
Collecting these metrics adds approximately 1 microsecond (1/1000 of a
millisecond) to the Put
operation. The overall Put
operation could be as
fast as 2 to 3 milliseconds, assuming that the operation includes about 1 KB of
data.