Skip to content

Releases: ggml-org/llama.cpp

b9033

05 May 14:17

Choose a tag to compare

b9031

05 May 14:05
bf76ac7

Choose a tag to compare

b9030

05 May 10:31
a09a00e

Choose a tag to compare

b9028

05 May 06:33
d6e7b03

Choose a tag to compare

b9026

05 May 05:33
a817a22

Choose a tag to compare

b9025

04 May 21:50
eff0670

Choose a tag to compare

b9023

04 May 21:12
935a340

Choose a tag to compare

b9022

04 May 18:36
d8794ee

Choose a tag to compare

b9020

04 May 19:06
a4701c9

Choose a tag to compare

b9019

04 May 17:25
994118a

Choose a tag to compare

model: move load_hparams and load_tensors to per-model definition (#22004)

  • git-friendly migration

  • add build_graph

  • nits

  • exclude old code from build

  • wip

  • add llm_arch_model_i

  • prepare downstream functions

  • nits

  • nits

  • wip

  • wip

  • add back create_tensor_qkv

  • fix files missing include

  • enforce one llm_build per arch

  • cmake: use glob

  • missing model params

  • nits

  • wip

  • wip (2)

  • wip (3)

  • test-llama-archs is happy

  • improve switch case

  • move more stuff into llm_arch_model_i

  • fix downstream code

  • nits

  • nits (2)

  • fix order

  • llama_model_base

  • LLAMA_LOAD_LOCALS

  • small fix

  • fix build errors

  • auto

  • rm migration script and ifdef

macOS/iOS:

Linux:

Android:

Windows:

openEuler: