Tags · swordow/llama-cpp-python

main_joker_v0.3.16

fix: llama_cpp embed() method bug fixes

1. n_tokens() parentheses in batch overflow offset: self._batch.n_tokens
   was a bound method reference, not an int; added () to call it correctly
2. normalize parameter now passed to llama_batch_decode as n_norm
   (n_norm=2 for L2, n_norm=0 for no normalization) instead of hardcoded 2
3. NONE pooling pos offset: pos += size * n_embd (was pos += size),
   since ptr is float-indexed and each token occupies n_embd floats

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Mar 7, 2026
22f9ba3
zip
tar.gz

main_joker_v0.3.8

mod: 1. exports llama-embedding as dll using llama_batch_decode from …

…llama-embedding 2.create_embedding function adds normalize parameter for embedding normalization 3. Llama supports apply_chat_format 4. logits_all does not necessary be related with pooling type, sets True for all tokens in embed and be consistent with logic in llama-embedding.

Mar 13, 2025
43efedb
zip
tar.gz

main_joker_v0.3.7

mod: 1. exports llama-embedding as dll using llama_batch_decode from …

…llama-embedding 2.create_embedding function adds normalize parameter for embedding normalization 3. Llama supports apply_chat_format 4. logits_all does not necessary be related with pooling type, sets True for all tokens in embed and be consistent with logic in llama-embedding.

Mar 12, 2025
3a11fc0
zip
tar.gz

v0.3.7

chore: Bump version

Jan 29, 2025
710e19a
zip
tar.gz

v0.3.6

chore: Bump version

Jan 8, 2025
0580cf2
zip
tar.gz

v0.3.5

chore: Bump version

Dec 9, 2024
803924b
zip
tar.gz

v0.3.5-metal

chore: Bump version

Dec 9, 2024
803924b
zip
tar.gz

v0.3.4

chore: Bump version

Dec 9, 2024
002f583
zip
tar.gz

v0.3.4-metal

chore: Bump version

Dec 9, 2024
002f583
zip
tar.gz

v0.3.4-cu124

fix(ci): update macos runner image to non-deprecated version

Dec 9, 2024
ea4d86a
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

main_joker_v0.3.16

main_joker_v0.3.8

main_joker_v0.3.7

v0.3.7

v0.3.6

v0.3.5

v0.3.5-metal

v0.3.4

v0.3.4-metal

v0.3.4-cu124

Tags: swordow/llama-cpp-python