Pre-configured Models
Overview
Jan provides various pre-configured AI models with different capabilities. Please see the following list for details.
| Model | Description | 
|---|---|
| Mistral Instruct 7B Q4 | A model designed for a comprehensive understanding through training on extensive internet data | 
| OpenHermes Neural 7B Q4 | A merged model using the TIES method. It performs well in various benchmarks | 
| Stealth 7B Q4 | This is a new experimental family designed to enhance Mathematical and Logical abilities | 
| Trinity-v1.2 7B Q4 | An experimental model merge using the Slerp method | 
| Openchat-3.5 7B Q4 | An open-source model that has a performance that surpasses that of ChatGPT-3.5 and Grok-1 across various benchmarks | 
| Wizard Coder Python 13B Q5 | A Python coding model that demonstrates high proficiency in specific domains like coding and mathematics | 
| OpenAI GPT 3.5 Turbo | The latest GPT-3.5 Turbo model with higher accuracy at responding in requested formats and a fix for a bug that caused a text encoding issue for non-English language function calls | 
| OpenAI GPT 3.5 Turbo 16k 0613 | A Snapshot model of gpt-3.5-16k-turbo from June 13th 2023 | 
| OpenAI GPT 4 | The latest GPT-4 model intended to reduce cases of “laziness” where the model doesn't complete a task | 
| TinyLlama Chat 1.1B Q4 | A tiny model with only 1.1B. It's a good model for less powerful computers | 
| Deepseek Coder 1.3B Q8 | A model that excelled in project-level code completion with advanced capabilities across multiple programming languages | 
| Phi-2 3B Q8 | a 2.7B model, excelling in common sense and logical reasoning benchmarks, trained with synthetic texts and filtered websites | 
| Llama 2 Chat 7B Q4 | A model that is specifically designed for a comprehensive understanding through training on extensive internet data | 
| CodeNinja 7B Q4 | A model that is good for coding tasks and can handle various languages, including Python, C, C++, Rust, Java, JavaScript, and more | 
| Noromaid 7B Q5 | A model designed for role-playing with human-like behavior. | 
| Starling alpha 7B Q4 | An upgrade of Openchat 3.5 using RLAIF, is good at various benchmarks, especially with GPT-4 judging its performance | 
| Yarn Mistral 7B Q4 | A language model for long context and supports a 128k token context window | 
| LlaVa 1.5 7B Q5 K | A model can bring vision understanding to Jan | 
| BakLlava 1 | A model can bring vision understanding to Jan | 
| Solar Slerp 10.7B Q4 | A model that uses the Slerp merge method from SOLAR Instruct and Pandora-v1 | 
| LlaVa 1.5 13B Q5 K | A model can bring vision understanding to Jan | 
| Deepseek Coder 33B Q5 | A model that excelled in project-level code completion with advanced capabilities across multiple programming languages | 
| Phind 34B Q5 | A multi-lingual model that is fine-tuned on 1.5B tokens of high-quality programming data, excels in various programming languages, and is designed to be steerable and user-friendly | 
| Yi 34B Q5 | A specialized chat model is known for its diverse and creative responses and excels across various NLP tasks and benchmarks | 
| Capybara 200k 34B Q5 | A long context length model that supports 200K tokens | 
| Dolphin 8x7B Q4 | An uncensored model built on Mixtral-8x7b and it is good at programming tasks | 
| Mixtral 8x7B Instruct Q4 | A pre-trained generative Sparse Mixture of Experts, which outperforms 70B models on most benchmarks | 
| Tulu 2 70B Q4 | A strong model alternative to Llama 2 70b Chat to act as helpful assistants | 
| Llama 2 Chat 70B Q4 | A model that is specifically designed for a comprehensive understanding through training on extensive internet data | 
note
OpenAI GPT models require a subscription to use them further. To learn more, click here.
Model details
| Model | Author | Model ID | Format | Size | 
|---|---|---|---|---|
| Mistral Instruct 7B Q4 | MistralAI, The Bloke | mistral-ins-7b-q4 | GGUF | 4.07GB | 
| OpenHermes Neural 7B Q4 | Intel, Jan | openhermes-neural-7b | GGUF | 4.07GB | 
| Stealth 7B Q4 | Jan | stealth-v1.2-7b | GGUF | 4.07GB | 
| Trinity-v1.2 7B Q4 | Jan | trinity-v1.2-7b | GGUF | 4.07GB | 
| Openchat-3.5 7B Q4 | Openchat | openchat-3.5-7b | GGUF | 4.07GB | 
| Wizard Coder Python 13B Q5 | WizardLM, The Bloke | wizardcoder-13b | GGUF | 7.33GB | 
| OpenAI GPT 3.5 Turbo | OpenAI | gpt-3.5-turbo | GGUF | - | 
| OpenAI GPT 3.5 Turbo 16k 0613 | OpenAI | gpt-3.5-turbo-16k-0613 | GGUF | - | 
| OpenAI GPT 4 | OpenAI | gpt-4 | GGUF | - | 
| TinyLlama Chat 1.1B Q4 | TinyLlama | tinyllama-1.1b | GGUF | 638.01MB | 
| Deepseek Coder 1.3B Q8 | Deepseek, The Bloke | deepseek-coder-1.3b | GGUF | 1.33GB | 
| Phi-2 3B Q8 | Microsoft | phi-2-3b | GGUF | 2.76GB | 
| Llama 2 Chat 7B Q4 | MetaAI, The Bloke | llama2-chat-7b-q4 | GGUF | 3.80GB | 
| CodeNinja 7B Q4 | Beowolx | codeninja-1.0-7b | GGUF | 4.07GB | 
| Noromaid 7B Q5 | NeverSleep | noromaid-7b | GGUF | 4.07GB | 
| Starling alpha 7B Q4 | Berkeley-nest, The Bloke | starling-7b | GGUF | 4.07GB | 
| Yarn Mistral 7B Q4 | NousResearch, The Bloke | yarn-mistral-7b | GGUF | 4.07GB | 
| LlaVa 1.5 7B Q5 K | Mys | llava-1.5-7b-q5 | GGUF | 5.03GB | 
| BakLlava 1 | Mys | bakllava-1 | GGUF | 5.36GB |