🦥Introducing Unsloth Studio

Run and train AI models locally with Unsloth Studio.

Today, we’re launching Unsloth Studio (Beta): an open-source, no-code web UI for training, running and exporting open models in one unified local interface.

Quickstart Features Github

Run GGUF and safetensor models locally on Mac, Windows, Linux.
Train 500+ models 2x faster with 70% less VRAM (no accuracy loss)
Run and train text, vision, TTS audio, embedding models

For all the latest updates, see our new changelog page here! ✨

March 27: Unsloth now has a desktop shortcut icon.

Inference is now 20–30% faster and is now calculated correctly.
Auto-detects older or pre-existing models downloaded from Hugging Face, LM Studio etc.
CPU RAM no longer spikes.

March 25: You can now update Unsloth Studio! Please update via: unsloth studio update

Windows CPU or GPU now works seamlessly. Please reinstall!
App shortcuts. Once installed, you can now launch in via a shortcut icon from Desktop.
Pre-compiled llama.cpp binaries - 6x faster installs!
50% reduced installation sizes (-7GB or more savings), 2x faster installs and faster resolving. 50% smaller pypi sizes.
Tool calling improved. Better llama.cpp parsing, no raw tool markup in chat, faster inference, a new Tool Outputs panel, timers.
MacOS and CPU now have Data Recipes enabled with multi-file uploading.

MacOS and CPU work for Chat GGUF inference and Data Recipes. MLX training coming soon.
No dataset needed. Auto-create datasets from PDF, CSV, JSON, DOCX, TXT files.
Export or save your model to GGUF, 16-bit safetensor etc.
Self-healing tool calling / web search + code execution
Auto inference parameter tuning and edit chat templates.

⭐ Features

Run models locally

Search and run GGUF and safetensor models with self-healing tool calling / web search, auto inference parameter tuning, code execution (Bash + Python), APIs (very soon). Upload images, docs, audio, code.

Battle models side by side. Powered by llama.cpp + Hugging Face, we support multi-GPU inference and most models.

Execute code + heal Tool calling

Unsloth Studio lets LLMs run Bash and Python, not just JavaScript. It also sandboxes programs like Claude Artifacts so models can test code, generate files, and verify answers with real computation.

E.g. Qwen3.5-4B searched 20+ websites and cited sources, with web search happening inside its thinking trace.

No-code training

Upload PDF, CSV, JSON docs, or YAML configs and start training instantly on NVIDIA. Unsloth’s kernels optimize LoRA, FP8, FFT, PT across 500+ text, vision, TTS/audio and embedding models.

Fine-tune the latest LLMs like Qwen3.5 and NVIDIA Nemotron 3. Multi-GPU works automatically, with a new version coming.

Data Recipes

Data Recipes transforms your docs into useable / synthetic datasets via graph-node workflow. Upload unstructured or structured files like PDFs, CSV and JSON. Unsloth Data Recipes, powered by NVIDIA Nemo Data Designer, auto turns documents into your desired formats.

Observability

Gain complete visibility into and control over your training runs. Track training loss, gradient norms, and GPU utilization in real time, and customize to your liking.

You can even view the training progress on other devices like your phone.

Export / Save models

Export any model, including your fine-tuned models, to safetensors, or GGUF for use with llama.cpp, vLLM, Ollama, LM Studio, and more.

Stores your training history, so you can revisit runs, export again and experiment.

Model Arena

Chat with and compare 2 different models, such as a base model and a fine-tuned one, to see how their outputs differ.

Just load your first GGUF/model, then the second, and voilà! Inference will firstly load for one model, then the second one.

Privacy first + Secure

Unsloth Studio can be used 100% offline and locally on your computer. Its token-based authentication, including encrypted password and JWT access / refresh flows keeps your data secure.

You can use pre-exisiting / old models or GGUFs that previously downloaded from HF etc. Read instructions here.

Please note this is the BETA version of Unsloth Studio. Expect many improvements, fixes, and new features in the coming days and weeks. One issue we’re actively addressing is precompiled llama.cpp binaries to significantly speed up install times.

⚡ Quickstart

Unsloth Studio works on Windows, Linux, WSL and MacOS (chat only currently).

CPU: Unsloth still works without a GPU, but only for Chat inference and Data Recipes.
Training: Works on NVIDIA: RTX 30, 40, 50, Blackwell, DGX Spark/Station etc. + Intel GPUs
Mac: Like CPU - Chat and Data Recipes only works for now. MLX training coming very soon.
AMD: Chat works. Train with Unsloth Core. Studio support is coming soon.
Coming soon: Training support for Apple MLX and AMD.
Multi-GPU: Works already, with a major upgrade on the way.

Use the same install commands below to update:

MacOS, Linux, WSL:

curl -fsSL https://unsloth.ai/install.sh | sh

Windows PowerShell:

irm https://unsloth.ai/install.ps1 | iex

Launch Unsloth

unsloth studio -H 0.0.0.0 -p 8888

Docker:

Use our official Docker image: unsloth/unsloth which currently works for Windows, WSL and Linux. MacOS support coming soon.

docker run -d -e JUPYTER_PASSWORD="mypassword" \
  -p 8888:8888 -p 8000:8000 -p 2222:22 \
  -v $(pwd)/work:/workspace/work \
  --gpus all \
  unsloth/unsloth

First install should now be 6x faster and with 50% reduced size due to precompiled llama.cpp binaries.

For more details about install and uninstallation please visit the Unsloth Studio Install section.

Installation

Google Colab notebook

We’ve created a free Google Colab notebook so you can explore all of Unsloth’s features on Colab’s T4 GPUs. You can train and run most models up to 22B parameters, and switch to a larger GPU for bigger models. Just Click 'Run all' and the UI should pop up after installation.

Google Colabcolab.research.google.com

Once installation is complete, scroll to Start Unsloth Studio and click Open Unsloth Studio in the white box shown on the left:

Scroll further down, to see the actual UI.

We now precompile llama.cpp binaries for much faster install speeds.

Sometimes the Studio link may return an error. This happens because you might be using an adblocker or Mozilla or Google Colab expects you to stay on the Colab page; if it detects inactivity, it may shut down the GPU session. Nevertheless, you can scroll down a bit

Workflow

Here is a usual workflow of Unsloth Studio to get you started:

Launch Studio from install instructions.
Load a model from local files or a supported integration.
Import training data from PDFs, CSVs, or JSONL files, or build a dataset from scratch.
Clean, refine, and expand your dataset in Data Recipes.
Start training with recommended presets or customize the config yourself.
Chat with the trained model and compare its outputs against the base model.
Save or export locally to the stack you already use.

You can read our individual deep dives into each section of Unsloth Studio:

Get Started Model Export

Data Recipes Studio Chat

Video Tutorials

Here is a video tutorial created by NVIDIA to get you started with Studio:

How to Install Unsloth Studio Video Tutorial

FAQ

Does Unsloth collect or store data? Unsloth does not collect usage telemetry. Unsloth only collects the minimal hardware information required for compatibility, such as GPU type and device (e.g. Mac). Unsloth Studio runs 100% offline and locally.

How do I use an old / exisiting model that I downloaded previously from Hugging Face? Yes, you can use pre-exisiting/old models or GGUFs that you previously downloaded from Hugging Face etc. They should be now be automatically detected by Unsloth otherwise read our instructions here.

Why is inference sometimes slower in Unsloth? Unsloth, like other local inference apps, are powered by llama.cpp, so speeds should be mostly the same. Sometimes Unsloth might be because you turned on web-search, code execution, self-healing tool-calling on. All these features may make your inference slower. If the speed difference is still slower with all features turned off, please make a GitHub issue!

Does Unsloth Studio support OpenAI-compatible APIs? Yes, for our Data Recipes it does. For inference we are working on this and hope to release support for it as soon as this week so stay tuned!

Is Unsloth now licensed under AGPL-3.0? Unsloth uses a dual-licensing model of Apache 2.0 and AGPL-3.0. The core Unsloth package remains licensed under Apache 2.0, while certain optional components, such as the Unsloth Studio UI are licensed AGPL-3.0.

This structure helps support ongoing Unsloth development while keeping the project open source and enabling the broader ecosystem to continue growing.

Does Studio only support LLMs? No. Studio supports a range of supported transformers compatible model families, including text, multimodal models, text-to-speech, audio, embeddings, and BERT-style models.

Can I use my own training config? Yes. Import a YAML config and Studio will pre-fill the relevant settings.

How can I adjust my context length? Context length adjustment is no longer necessary with llama.cpp’s smart auto context, which uses only the context you need without loading anything extra. However, soon we will still add the feature incase you want to use it.

Do you need to train models to use the UI? No, you can just download any GGUF or model without fine-tuning any model.

Future of Unsloth

We're working hard to make open-source AI as accessible as possible. Coming next for Unsloth and Unsloth Studio, we're releasing official support for: multi-GPU, Apple Silicon/MLX and AMD. Reminder this is the BETA version of Unsloth Studio so expect a lot of announcements and improvements in the coming weeks. We’re also working closely with NVIDIA on multi-GPU support to deliver the best and simplest experience possible.

Acknowledgements

A huge thank you to NVIDIA and Hugging Face for being part of our launch. Also thanks to all of our early beta testers for Unsloth Studio, we truly appreciate your time and feedback. We’d also like to thank llama.cpp, PyTorch and open model labs for providing the infrastructure that made Unsloth Studio possible.

PreviousDPO, ORPO, KTO NextGet Started

Last updated 1 day ago

Was this helpful?

hashtag⭐ Features

hashtagRun models locally

hashtagExecute code + heal Tool calling

hashtagNo-code training

hashtagData Recipes

hashtagObservability

hashtagExport / Save models

hashtagModel Arena

hashtagPrivacy first + Secure

hashtag⚡ Quickstart

hashtagMacOS, Linux, WSL:

hashtagWindows PowerShell:

hashtagLaunch Unsloth

hashtagDocker:

hashtaggoogle Google Colab notebook

hashtagseedling Workflow

hashtagvideo Video Tutorials

hashtagcomments-question FAQ

hashtagFuture of Unsloth

hashtagAcknowledgements

⭐ Features

Run models locally

Execute code + heal Tool calling

No-code training

Data Recipes

Observability

Export / Save models

Model Arena

Privacy first + Secure

⚡ Quickstart

MacOS, Linux, WSL:

Windows PowerShell:

Launch Unsloth

Docker:

Google Colab notebook

Workflow

Video Tutorials

FAQ

Future of Unsloth

Acknowledgements