
Copy
cargo run --release -p cli -- help
Copy
Usage: uzu_cli [COMMAND]
Commands:
run Run a model with the specified path
serve Start a server with the specified model path
help Print this message or the help of the given subcommand(s)
To use the CLI, you’ll need a converted model. Check the lalamo overview to see how to handle model conversion.
We’re currently working on a new version that won’t require you to manually convert models, stay tuned.
Run
Run command allows you to send messages to the model in interactive mode:Copy
cargo run --release -p cli -- run $MODEL_PATH
Serve
Serve command starts a local server with an OpenAI-like API:Copy
cargo run --release -p cli -- serve $MODEL_PATH
Copy
curl --location 'http://127.0.0.1:8000/chat/completions' \
--header 'Content-Type: application/json' \
--data '{
"messages": [
{
"role": "system",
"content": "You are a helpful assistant"
},
{
"role": "user",
"content": "Tell about London"
}
],
"max_completion_tokens": 256
}'
Copy
{
"id": "421a6b4b-be08-478c-b443-5cc7cef46f5c",
"model": "Llama-3.2-1B-Instruct",
"choices": [
{
"index": 0,
"message": {
"role": "assistant",
"content": "London! The vibrant, historic, and cosmopolitan capital of England. London is a city that seamlessly blends tradition with modernity, offering a unique blend of culture, entertainment, and excitement. Here's a comprehensive overview:\n\n**History and Architecture**\n\nLondon has a rich history dating back to the Roman Empire. The city has been ruled by various empires, including the Romans, Anglo-Saxons, and Normans. The city's iconic landmarks, such as Buckingham Palace, the Tower of London, and Big Ben, are testaments to its storied past. The city's architecture is a mix of medieval, Gothic, and Victorian styles, with many buildings dating back to the 13th century.\n\n**Must-Visit Attractions**\n\n1. **Buckingham Palace**: The official residence of the British monarch, Buckingham Palace is a must-visit attraction. Watch the Changing of the Guard ceremony, which takes place daily at 11:30 am from April to July and on alternate days the rest of the year.\n2. **The British Museum**: One of the world's greatest museums, the British Museum houses a vast collection of artifacts from ancient civilizations, including the Rosetta Stone, the Elgin Marbles, and the mummies in the Ancient Egypt gallery.\n3"
},
"finish_reason": "length"
}
],
"stats": {
"prefill_stats": {
"duration": 0.282355625,
"suffix_length": 8,
"tokens_count": 1,
"tokens_per_second": 3.541633002707136,
"model_run": {
"count": 6,
"average_duration": 0.047014152499999996
},
"run": null
},
"generate_stats": {
"duration": 7.3130893389999985,
"suffix_length": 1,
"tokens_count": 255,
"tokens_per_second": 34.868984662898846,
"model_run": {
"count": 255,
"average_duration": 0.028573350874509806
},
"run": {
"count": 255,
"average_duration": 0.02867878172156862
}
},
"total_stats": {
"duration": 7.6461285839999995,
"tokens_count_input": 42,
"tokens_count_output": 256
}
}
}