Skip to main content
This page is a lookup reference for dataset statistics—the column-level metrics the platform computes for each dataset. For an explanation of why statistics matter and how they work, see Dataset Statistics.

Statistics reference

Each statistic is computed per column. The computation column shows the underlying SQL-level operation.
StatisticDescriptionComputation
valueCountTotal number of non-null values in the columnCOUNT(column)
nullValueCountNumber of null values in the columnCOUNT(*) - COUNT(column)
nanValueCountNumber of NaN (not-a-number) values—floating-point columns onlyCOUNT_IF(IS_NAN(column))
approxCountDistinctApproximate number of distinct values, using HyperLogLog for efficiency on large datasetsAPPROX_COUNT_DISTINCT(column)
countDistinctExact number of distinct valuesCOUNT(DISTINCT column)
lowerBoundMinimum value in the columnMIN(column)
upperBoundMaximum value in the columnMAX(column)
histogramFrequency distribution of values across distinct bucketsAggregation over value frequencies
meanArithmetic mean—numeric columns onlyAVG(column)
standardDeviationPopulation standard deviation—numeric columns onlySTDDEV(column)
columnStoredBytesBytes of storage consumed by the columnStorage metadata lookup
completenessRatio of non-null values to total rows (0 to 1)COUNT(column) / COUNT(*)

Type compatibility matrix

Not all statistics apply to all data types. The table below shows which statistics are computed for each type.
TypeSupported statistics
DoubleTypeAll 12: valueCount, nullValueCount, nanValueCount, approxCountDistinct, countDistinct, lowerBound, upperBound, histogram, mean, standardDeviation, columnStoredBytes, completeness
LongType11 (all except nanValueCount): valueCount, nullValueCount, approxCountDistinct, countDistinct, lowerBound, upperBound, histogram, mean, standardDeviation, columnStoredBytes, completeness
StringType9: valueCount, nullValueCount, approxCountDistinct, countDistinct, lowerBound, upperBound, histogram, columnStoredBytes, completeness
BooleanType9: valueCount, nullValueCount, approxCountDistinct, countDistinct, lowerBound, upperBound, histogram, columnStoredBytes, completeness
TimestampTzType9: valueCount, nullValueCount, approxCountDistinct, countDistinct, lowerBound, upperBound, histogram, columnStoredBytes, completeness
ArrayType4: valueCount, nullValueCount, columnStoredBytes, completeness
ObjectType4: valueCount, nullValueCount, columnStoredBytes, completeness
DoubleType is the only type that supports nanValueCount, since NaN is a floating-point concept. mean and standardDeviation are limited to numeric types (DoubleType and LongType).