This guide covers the UI-based configuration. You can also manage statistics configuration programmatically through the API or the TypeScript SDK.
Prerequisites
- A dataset you own or have write access to
- The dataset must be on an active data plane
Opening the configuration panel
Open the actions menu
Click the Actions button on the dataset details page to open the actions dropdown.
Setting default statistics
The Defaults section controls which statistics are computed for all eligible fields across the dataset. Select or deselect individual statistics, or use the category-level and global toggles to select groups at once. Available statistics are organized into five categories:| Category | Statistics |
|---|---|
| Counts | Value count, Null value count, NaN value count |
| Bounds | Lower bound, Upper bound |
| Distinctness | Approx count distinct, Count distinct |
| Distribution | Histogram, Mean, Standard deviation |
| Quality | Completeness |
double and long). The UI automatically filters out incompatible statistics when you configure per-field overrides. See the type compatibility matrix for the full mapping.
Histogram options
When you enable Histogram, additional options appear:| Option | Description | Default |
|---|---|---|
| Max bins | Maximum number of histogram buckets (2–100,000) | 50 |
| Overflow | How to handle values exceeding the bin count — none or truncate | none |
Setting a refresh schedule
The Refresh Trigger section controls when statistics are recomputed. Choose one of three options:| Trigger | Behavior |
|---|---|
| Manual | Statistics are only recomputed when you manually trigger an update from the actions menu |
| On update | Statistics are automatically recomputed whenever new data is ingested into the dataset |
| Cron | Statistics are recomputed on a UTC cron schedule (e.g., 0 0 * * * for daily at midnight) |
Adding field overrides
By default, all eligible fields inherit the statistics you set in the Defaults section. You can override this at two levels:Namespace scope overrides
Expand the Dataset Field Overrides or Rosetta Stone Overrides section and enable the scope override toggle. This lets you set different default statistics for all fields within that namespace, overriding the global defaults.Per-field overrides
Within each namespace section, click Add Field Override to configure statistics for a specific column or attribute. Each override row lets you:- Select a field from the dropdown
- Choose which statistics to compute for that field
- Optionally configure histogram options specific to that field
For non-primitive fields (objects and arrays), only Null value count is available as a self-statistic. Statistics on nested child fields within objects and arrays are not yet configurable from the UI.
value_count and histogram, select just those two — it won’t also inherit the defaults. For details on how inheritance works, see the inheritance model in the reference.
Saving and deleting
- Click Save to create or update the configuration. A confirmation toast appears on success.
- In edit mode, click Delete Configuration to remove the configuration entirely. This stops automatic statistics computation for the dataset.
Manually triggering a statistics update
Separately from the configuration, you can trigger an immediate one-off recomputation. From the dataset actions menu under the Stats group, click Update statistics. This runs thedatasets_calculate_column_stats job regardless of the refresh trigger setting.
Related content
Dataset Statistics
Why statistics matter and where they appear in the platform
Dataset Statistics Reference
Full API reference, configuration schema, and validation errors
Managing Datasets (SDK)
Manage datasets programmatically including reading statistics
Job Types
Job types including statistics computation

