from release zip 0.10.2
Find a file Use this template
Robin L. M. Cheung, MBA 508d5d6215
Some checks failed
Build LiteRT-LM / build-linux-x86_64 (push) Failing after 7m45s
Build LiteRT-LM / build-windows-x86_64 (push) Failing after 14m55s
Build LiteRT-LM / build-android-arm64 (push) Failing after 15m4s
Fix runs-on labels: remove self-hosted, match registered runner labels
2026-04-21 03:23:11 -04:00
.claude-code-history Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
.forgejo/workflows Fix runs-on labels: remove self-hosted, match registered runner labels 2026-04-21 03:23:11 -04:00
.github/workflows Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
build_config Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
c Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
cmake Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
cxxbridge_cmd Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
docs Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
kotlin Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
prebuilt Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
python Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
runtime Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
rust Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
schema Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
scripts Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
src Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
tools/test Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
.bazeliskrc Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
.bazelrc Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
.bazelversion Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
.gitattributes Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
.gitignore Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
__init__.py Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
android_ndk_env.bzl Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
BUILD Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
BUILD.antlr4 Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
BUILD.llguidance Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
BUILD.miniaudio Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
BUILD.minizip Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
BUILD.minja Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
BUILD.nanobind_json Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
BUILD.sentencepiece Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
BUILD.stb Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
BUILD.tokenizers_cpp Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
BUILD_SYSTEM.pdf Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
cargo-bazel-lock.json Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
Cargo.lock Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
Cargo.toml Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
CMakeLists.txt Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
CMakePresets.json Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
CONTRIBUTING.md Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
LICENSE Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
PATCH.llguidance Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
PATCH.llguidance_grammar Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
PATCH.llguidance_numeric Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
PATCH.llguidance_parser Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
PATCH.llguidance_perf Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
PATCH.llguidance_regexvec Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
PATCH.minja Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
PATCH.nanobind_json Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
PATCH.rules_rust Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
PATCH.sentencepiece Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
PATCH.tensorflow Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
PATCH.toktrie Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
README.md Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
requirements.txt Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
rust_cxx_bridge.bzl Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
version.bzl Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00
WORKSPACE Initial import: LiteRT-LM 0.10.2 2026-04-20 19:52:23 -04:00

LiteRT-LM

LiteRT-LM is Google's production-ready, high-performance, open-source inference framework for deploying Large Language Models on edge devices.

🔗 Product Website

🔥 What's New: Gemma 4 support with LiteRT-LM

Deploy Gemma 4 across a broad range of hardware with stellar performance (blog).

👉 Try on Linux, macOS, Windows (WSL) or Raspberry Pi with the LiteRT-LM CLI:

litert-lm run  \
   --from-huggingface-repo=litert-community/gemma-4-E2B-it-litert-lm \
   gemma-4-E2B-it.litertlm \
   --prompt="What is the capital of France?"

🌟 Key Features

  • 📱 Cross-Platform Support: Android, iOS, Web, Desktop, and IoT (e.g. Raspberry Pi).
  • 🚀 Hardware Acceleration: Peak performance via GPU and NPU accelerators.
  • 👁️ Multi-Modality: Support for vision and audio inputs.
  • 🔧 Tool Use: Function calling support for agentic workflows.
  • 📚 Broad Model Support: Gemma, Llama, Phi-4, Qwen, and more.


🚀 Production-Ready for Google's Products

LiteRT-LM powers on-device GenAI experiences in Chrome, Chromebook Plus, Pixel Watch, and more.

You can also try the Google AI Edge Gallery app to run models immediately on your device.

Install the app today from Google Play Install the app today from App Store
Get it on Google Play Download on the App Store

📰 Blogs & Announcements

Link Description
Bring state-of-the-art agentic skills to the edge with Gemma 4 Deploy Gemma 4 in-app and across a broader range of devices with stellar performance and broad reach using LiteRT-LM.
On-device GenAI in Chrome, Chromebook Plus and Pixel Watch Deploy language models on wearables and browser-based platforms using LiteRT-LM at scale.
On-device Function Calling in Google AI Edge Gallery Explore how to fine-tune FunctionGemma and enable function calling capabilities powered by LiteRT-LM Tool Use APIs.
Google AI Edge small language models, multimodality, and function calling Latest insights on RAG, multimodality, and function calling for edge language models.

🏃 Quick Start

Quick Try (No Code)

Try LiteRT-LM immediately from your terminal without writing a single line of code using uv:

uv tool install litert-lm

litert-lm run \
  --from-huggingface-repo=google/gemma-3n-E2B-it-litert-lm \
  gemma-3n-E2B-it-int4 \
  --prompt="What is the capital of France?"

📚 Supported Language APIs

Ready to get started? Explore our language-specific guides and setup instructions.

Language Status Best For... Documentation
Kotlin Stable Android apps & JVM Android (Kotlin) Guide
Python Stable Prototyping & Scripting Python Guide
C++ Stable High-performance native C++ Guide
Swift 🚀 In Dev Native iOS & macOS (Coming Soon)

🏗️ Build From Source

This guide shows how you can compile LiteRT-LM from source. If you want to build the program from source, you should checkout the stable Latest
Release tag.


📦 Releases

  • v0.10.1: Deploy Gemma 4 with stellar performance (blog) and introduce LiteRT-LM CLI.
  • v0.9.0: Improvements to function calling capabilities, better app performance stability.
  • v0.8.0: Desktop GPU support and Multi-Modality.
  • v0.7.0: NPU acceleration for Gemma models.

For a full list of releases, see GitHub Releases.