Template

from release zip 0.10.2

Find a file Use this template

Robin L. M. Cheung, MBA 508d5d6215 Some checks failed Build LiteRT-LM / build-linux-x86_64 (push) Failing after 7m45s Details Build LiteRT-LM / build-windows-x86_64 (push) Failing after 14m55s Details Build LiteRT-LM / build-android-arm64 (push) Failing after 15m4s Details Fix runs-on labels: remove self-hosted, match registered runner labels		2026-04-21 03:23:11 -04:00
.claude-code-history	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
.forgejo/workflows	Fix runs-on labels: remove self-hosted, match registered runner labels	2026-04-21 03:23:11 -04:00
.github/workflows	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
build_config	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
c	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
cmake	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
cxxbridge_cmd	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
docs	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
kotlin	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
prebuilt	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
python	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
runtime	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
rust	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
schema	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
scripts	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
src	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
tools/test	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
.bazeliskrc	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
.bazelrc	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
.bazelversion	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
.gitattributes	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
.gitignore	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
__init__.py	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
android_ndk_env.bzl	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
BUILD	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
BUILD.antlr4	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
BUILD.llguidance	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
BUILD.miniaudio	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
BUILD.minizip	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
BUILD.minja	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
BUILD.nanobind_json	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
BUILD.sentencepiece	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
BUILD.stb	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
BUILD.tokenizers_cpp	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
BUILD_SYSTEM.pdf	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
cargo-bazel-lock.json	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
Cargo.lock	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
Cargo.toml	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
CMakeLists.txt	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
CMakePresets.json	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
CONTRIBUTING.md	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
LICENSE	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
PATCH.llguidance	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
PATCH.llguidance_grammar	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
PATCH.llguidance_numeric	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
PATCH.llguidance_parser	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
PATCH.llguidance_perf	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
PATCH.llguidance_regexvec	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
PATCH.minja	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
PATCH.nanobind_json	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
PATCH.rules_rust	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
PATCH.sentencepiece	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
PATCH.tensorflow	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
PATCH.toktrie	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
README.md	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
requirements.txt	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
rust_cxx_bridge.bzl	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
version.bzl	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00
WORKSPACE	Initial import: LiteRT-LM 0.10.2	2026-04-20 19:52:23 -04:00

README.md

LiteRT-LM

LiteRT-LM is Google's production-ready, high-performance, open-source inference framework for deploying Large Language Models on edge devices.

🔗 Product Website

🔥 What's New: Gemma 4 support with LiteRT-LM

Deploy Gemma 4 across a broad range of hardware with stellar performance (blog).

👉 Try on Linux, macOS, Windows (WSL) or Raspberry Pi with the LiteRT-LM CLI:

litert-lm run  \
   --from-huggingface-repo=litert-community/gemma-4-E2B-it-litert-lm \
   gemma-4-E2B-it.litertlm \
   --prompt="What is the capital of France?"

🌟 Key Features

📱 Cross-Platform Support: Android, iOS, Web, Desktop, and IoT (e.g. Raspberry Pi).
🚀 Hardware Acceleration: Peak performance via GPU and NPU accelerators.
👁️ Multi-Modality: Support for vision and audio inputs.
🔧 Tool Use: Function calling support for agentic workflows.
📚 Broad Model Support: Gemma, Llama, Phi-4, Qwen, and more.

🚀 Production-Ready for Google's Products

LiteRT-LM powers on-device GenAI experiences in Chrome, Chromebook Plus, Pixel Watch, and more.

You can also try the Google AI Edge Gallery app to run models immediately on your device.

Install the app today from Google Play	Install the app today from App Store

📰 Blogs & Announcements

Link	Description
Bring state-of-the-art agentic skills to the edge with Gemma 4	Deploy Gemma 4 in-app and across a broader range of devices with stellar performance and broad reach using LiteRT-LM.
On-device GenAI in Chrome, Chromebook Plus and Pixel Watch	Deploy language models on wearables and browser-based platforms using LiteRT-LM at scale.
On-device Function Calling in Google AI Edge Gallery	Explore how to fine-tune FunctionGemma and enable function calling capabilities powered by LiteRT-LM Tool Use APIs.
Google AI Edge small language models, multimodality, and function calling	Latest insights on RAG, multimodality, and function calling for edge language models.

🏃 Quick Start

🔗 Key Links

👉 Technical Overview including performance benchmarks, model support, and more.
👉 LiteRT-LM CLI Guide including installation, getting started, and advanced usage.

⚡ Quick Try (No Code)

Try LiteRT-LM immediately from your terminal without writing a single line of code using uv:

uv tool install litert-lm

litert-lm run \
  --from-huggingface-repo=google/gemma-3n-E2B-it-litert-lm \
  gemma-3n-E2B-it-int4 \
  --prompt="What is the capital of France?"

📚 Supported Language APIs

Ready to get started? Explore our language-specific guides and setup instructions.

Language	Status	Best For...	Documentation
Kotlin	✅ Stable	Android apps & JVM	Android (Kotlin) Guide
Python	✅ Stable	Prototyping & Scripting	Python Guide
C++	✅ Stable	High-performance native	C++ Guide
Swift	🚀 In Dev	Native iOS & macOS	(Coming Soon)

🏗️ Build From Source

This guide shows how you can compile LiteRT-LM from source. If you want to build the program from source, you should checkout the stable tag.

📦 Releases

v0.10.1: Deploy Gemma 4 with stellar performance (blog) and introduce LiteRT-LM CLI.
v0.9.0: Improvements to function calling capabilities, better app performance stability.
v0.8.0: Desktop GPU support and Multi-Modality.
v0.7.0: NPU acceleration for Gemma models.

For a full list of releases, see GitHub Releases.