CrosML: Chrome OS Machine Learning Service

Summary

The Machine Learning (ML) Service provides a common runtime for evaluating machine learning models on device. The service wraps the TensorFlow Lite runtime and provides infrastructure for deployment of trained models. Chromium communicates with ML Service via a Mojo Interface.

How to use ML Service

You need to provide your trained models to ML Service by following these instructions. You can then load and use your model from Chromium using the client library provided at //chromeos/services/machine_learning/public/cpp/.

Metrics

The following metrics are currently recorded by the daemon process in order to understand its resource costs in the wild:

MachineLearningService.MojoConnectionEvent: Success/failure of the D-Bus->Mojo bootstrap.
MachineLearningService.PrivateMemoryKb: Private (unshared) memory footprint every 5 minutes.
MachineLearningService.PeakPrivateMemoryKb: Peak value of MachineLearningService.PrivateMemoryKb per 24 hour period. Daemon code can also call ml::Metrics::UpdateCumulativeMetricsNow() at any time to take a peak-memory observation, to catch short-lived memory usage spikes.
MachineLearningService.CpuUsageMilliPercent: Fraction of total CPU resources consumed by the daemon every 5 minutes, in units of milli-percent (1/100,000).

TODO(alanlxl): Additional metrics to be added, ideally per-model, to understand levels of performance that clients are getting:

LoadModel time
CreateGraphExecutor time
Model inference time

Design docs

Overall design: go/chromeos-ml-service
Mojo interface: go/chromeos-ml-service-mojo
Deamon<->Chromium IPC implementation: go/chromeos-ml-service-impl
Model publishing: go/cros-ml-service-models