qlora

Star

Here are 202 public repositories matching this topic...

hiyouga / LlamaFactory

Star

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Updated Mar 27, 2026
Python

bitsandbytes-foundation / bitsandbytes

Sponsor

Star

Accessible large language models via k-bit quantization for PyTorch.

machine-learning pytorch quantization llm qlora

Updated Mar 27, 2026
Python

yangjianxin1 / Firefly

Star

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Updated Oct 24, 2024
Python

hiyouga / ChatGLM-Efficient-Tuning

Star

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

transformers pytorch lora language-model alpaca fine-tuning peft huggingface chatgpt rlhf chatglm qlora chatglm2

Updated Oct 12, 2023
Python

ssbuild / chatglm_finetuning

Star

chatglm 6b finetuning and alpaca finetuning

deep-learning pytorch freeze lora sft chatglm p-tuning-v2 adalora qlora ia3

Updated Mar 9, 2025
Python

georgian-io / LLM-Finetuning-Toolkit

Star

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

nlp unit-testing falcon classification summarization lora nlp-machine-learning zephyr fine-tuning finetuning ablation-study large-language-models flan-t5 redpajama qlora llm-test llama2 mistral-7b

Updated Oct 25, 2024
Python

X-D-Lab / MindChat

Star

🐋MindChat（漫谈）——心理大模型：漫谈人生路, 笑对风霜途

lora large-language-models llm chatgpt qlora domain-llm chatglm2-6b internlm baichuan-13b qwen qwen-7b qwen2 qwen1-5

Updated Sep 13, 2024
Python

jianzhnie / LLamaTuner

Star

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

llama ppo dpo chatgpt rlhf qlora qwen mixtral llama3

Updated Jan 24, 2025
Python

X-D-Lab / Sunsimiao

Star

🌿孙思邈中文医疗大模型(Sunsimiao)：提供安全、可靠、普惠的中文医疗大模型

medical huggingface baichuan large-language-models llm modelscope chatgpt qlora baichuan-7b

Updated Sep 13, 2024
Python

yangjianxin1 / Firefly-LLaMA2-Chinese

Star

Firefly中文LLaMA-2大模型，支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

bloom falcon firefly llama lora pretrain baichuan llm chatglm qlora internlm baichuan-13b llama2 llama-2 qwen xverse baichaun2

Updated Oct 21, 2023
Python

WangRongsheng / Aurora

Star

The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"

chinese gpt lora language-model fine-tuning large-language-models llm instruction-tuning qlora mixtral mixtral-8x7b mixtral-8x7b-instruct

Updated May 9, 2024
Python

taishan1994 / Llama3.1-Finetuning

Star

对llama3进行全参微调、lora微调以及qlora微调。

lora qlora qwen llama3

Updated Oct 4, 2024
Python

Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc GRPO log diagnostics, evaluation, and export end-to-end. Part of the Gaslamp AI platform.