Basics¶

The AgentCheck base class contains the logic that all Checks inherit.

In addition to the integrations inheriting from AgentCheck, other classes that inherit from AgentCheck include:

Getting Started¶

The Datadog Agent looks for __version__ and a subclass of AgentCheck at the root of every Check package.

Below is an example of the __init__.py file for a hypothetical Awesome Check:

from .__about__ import __version__
from .check import AwesomeCheck

__all__ = ['__version__', 'AwesomeCheck']

The version is used in the Agent's status output (if no __version__ is found, it will default to 0.0.0):

=========
Collector
=========

  Running Checks
  ============== 

    AwesomeCheck (0.0.1)
    -------------------
      Instance ID: 1234 [OK]
      Configuration Source: file:/etc/datadog-agent/conf.d/awesomecheck.d/awesomecheck.yaml
      Total Runs: 12
      Metric Samples: Last Run: 242, Total: 2,904
      Events: Last Run: 0, Total: 0
      Service Checks: Last Run: 0, Total: 0
      Average Execution Time : 49ms
      Last Execution Date : 2020-10-26 19:09:22.000000 UTC
      Last Successful Execution Date : 2020-10-26 19:09:22.000000 UTC

...

Checks¶

AgentCheck contains functions that you use to execute Checks and submit data to Datadog.

Metrics¶

This list enumerates what is collected from your system by each integration. For more information on metrics, see the Metric Types documentation. You can find the metrics for each integration in that integration's metadata.csv file. You can also set up custom metrics, so if the integration doesn’t offer a metric out of the box, you can usually add it.

Gauge¶

The gauge metric submission type represents a snapshot of events in one time interval. This representative snapshot value is the last value submitted to the Agent during a time interval. A gauge can be used to take a measure of something reporting continuously—like the available disk space or memory used.

For more information, see the API documentation

Count¶

The count metric submission type represents the total number of event occurrences in one time interval. A count can be used to track the total number of connections made to a database or the total number of requests to an endpoint. This number of events can increase or decrease over time—it is not monotonically increasing.

For more information, see the API documentation.

Monotonic Count¶

Similar to Count, Monotonic Count represents the total number of event occurrences in one time interval. However, this value can ONLY increment.

For more information, see the API documentation.

Rate¶

The rate metric submission type represents the total number of event occurrences per second in one time interval. A rate can be used to track how often something is happening—like the frequency of connections made to a database or the flow of requests made to an endpoint.

For more information, see the API documentation.

Histogram¶

The histogram metric submission type represents the statistical distribution of a set of values calculated Agent-side in one time interval. Datadog’s histogram metric type is an extension of the StatsD timing metric type: the Agent aggregates the values that are sent in a defined time interval and produces different metrics which represent the set of values.

For more information, see the API documentation.

Historate¶

Similar to the histogram metric, the historate represents statistical distribution over one time interval, although this is based on rate metrics.

For more information, see the API documentation.

Service Checks¶

Service checks are a type of monitor used to track the uptime status of the service. For more information, see the Service checks guide.

For more information, see the API documentation.

Events¶

Events are informational messages about your system that are consumed by the events stream so that you can build monitors on them.

For more information, see the API documentation.

Namespacing¶

Within every integration, you can specify the value of __NAMESPACE__:

from datadog_checks.base import AgentCheck


class AwesomeCheck(AgentCheck):
    __NAMESPACE__ = 'awesome'

...

This is an optional addition, but it makes submissions easier since it prefixes every metric with the __NAMESPACE__ automatically. In this case it would append awesome. to each metric submitted to Datadog.

If you wish to ignore the namespace for any reason, you can append an optional Boolean raw=True to each submission:

self.gauge('test', 1.23, tags=['foo:bar'], raw=True)

...

You submitted a gauge metric named test with a value of 1.23 tagged by foo:bar ignoring the namespace.

Check Initializations¶

In the AgentCheck class, there is a useful property called check_initializations, which you can use to execute functions that are called once before the first check run. You can fill up check_initializations with instructions in the __init__ function of an integration. For example, you could use it to parse configuration information before running a check. Listed below is an example with Airflow:

class AirflowCheck(AgentCheck):
    def __init__(self, name, init_config, instances):
        super(AirflowCheck, self).__init__(name, init_config, instances)

        self._url = self.instance.get('url', '')
        self._tags = self.instance.get('tags', [])

        # The Agent only makes one attempt to instantiate each AgentCheck so any errors occurring
        # in `__init__` are logged just once, making it difficult to spot. Therefore,
        # potential configuration errors are emitted as part of the check run phase.
        # The configuration is only parsed once if it succeed, otherwise it's retried.
        self.check_initializations.append(self._parse_config)

...

Last update: October 29, 2020