Pulse · ggerganov/llama.cpp

November 3, 2024 – November 10, 2024

Overview

52 Active pull requests

64 Active issues

28 Releases published by 1 person

b4019
published Nov 3, 2024
b4020
published Nov 3, 2024
b4023
published Nov 4, 2024
b4024
published Nov 4, 2024
b4025
published Nov 4, 2024
b4026
published Nov 4, 2024
b4027
published Nov 4, 2024
b4032
published Nov 4, 2024
b4033
published Nov 4, 2024
b4034
published Nov 5, 2024
b4036
published Nov 6, 2024
b4037
published Nov 6, 2024
b4038
published Nov 6, 2024
b4040
published Nov 7, 2024
b4041
published Nov 7, 2024
b4042
published Nov 7, 2024
b4044
published Nov 7, 2024
b4048
published Nov 7, 2024
b4050
published Nov 8, 2024
b4052
published Nov 8, 2024
b4053
published Nov 8, 2024
b4055
published Nov 9, 2024
b4056
published Nov 9, 2024
b4057
published Nov 9, 2024
b4059
published Nov 9, 2024
b4060
published Nov 9, 2024
b4061
published Nov 9, 2024
b4058
published Nov 9, 2024

64 Unresolved conversations

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

add FP8 support to gguf/llama:
#10055 commented on Nov 8, 2024 • 10 new comments
sampling: add K-Shift sampler
#10048 commented on Nov 9, 2024 • 3 new comments
main : add new feature: special commands
#10145 commented on Nov 7, 2024 • 2 new comments
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels
#9921 commented on Nov 8, 2024 • 1 new comment
Bug: Load time on rpc server with multiple machines
#9820 commented on Nov 10, 2024 • 0 new comments
android examples add top_p min_keep to new_context
#9828 commented on Nov 10, 2024 • 0 new comments
Feature Request: NEON, SVE2, int8mm optimized kernels for IQ4, K quants ?
#9827 commented on Nov 10, 2024 • 0 new comments
Feature Request: RPC offloading using a local model copy
#10095 commented on Nov 9, 2024 • 0 new comments
Bug: Certain RPC Servers cause major slowdown to Host machine
#10047 commented on Nov 9, 2024 • 0 new comments
Bug: Ccache causing SYCL backend failed to build on Windows
#9954 commented on Nov 9, 2024 • 0 new comments
llama : speed-up grammar sampling
#4218 commented on Nov 9, 2024 • 0 new comments
Bug: [vulkan] llama.cpp not work on Raspberry Pi 5
#9801 commented on Nov 9, 2024 • 0 new comments
llama.cpp Windows/ROCm builds are broken? Using shared GPU memory instead of dedicated.
#9964 commented on Nov 8, 2024 • 0 new comments
Bug: Failing to build using cmake on tag b3912
#9913 commented on Nov 8, 2024 • 0 new comments
Bug: Model isn't loading
#9563 commented on Nov 8, 2024 • 0 new comments
Feature Request: Support for DeciLMForCausalLM
#10028 commented on Nov 8, 2024 • 0 new comments
Feature Request: Support for Qwen2-VL
#9246 commented on Nov 8, 2024 • 0 new comments
Bug: No improvement for NEON?
#9774 commented on Nov 8, 2024 • 0 new comments
Optimization of matrix-vector kernel memory accesses for NVIDIA CUDA High Bandwidth GPUs
#9817 commented on Nov 10, 2024 • 0 new comments
Bug: Failed to process regex error with long repeating sequences
#9715 commented on Nov 10, 2024 • 0 new comments
[CANN]Bug: Can't compile ggml/src/CMakeFiles/ggml.dir/ggml-cann/acl_tensor.cpp.o
#9560 commented on Nov 10, 2024 • 0 new comments
Bug: Slow model loading with mmap
#9244 commented on Nov 10, 2024 • 0 new comments
Feature Request: Support llava with different vision/LM backbones
#8574 commented on Nov 10, 2024 • 0 new comments
Llama cpp low level python bindings
#1660 commented on Nov 5, 2024 • 0 new comments
llama : initial Mamba-2 support
#9126 commented on Nov 4, 2024 • 0 new comments
llama: (proposal) propagating the results of `graph_compute` to the user interface
#9525 commented on Nov 10, 2024 • 0 new comments
[gguf-py] gguf_reader: numpy 2 newbyteorder fix
#9772 commented on Nov 5, 2024 • 0 new comments
fix gguf-py: Conversion error when multiple licenses are configured
#9807 commented on Nov 9, 2024 • 0 new comments
[SYCL] Fix build on Windows when ccache enabled (#9954)
#9976 commented on Nov 9, 2024 • 0 new comments
[SYCL] pass SYCL CI
#10041 commented on Nov 8, 2024 • 0 new comments
metal : GPU "idle-throttling" analysis
#10119 commented on Nov 3, 2024 • 0 new comments
Fix docker locale issue (#6267)
#10142 commented on Nov 4, 2024 • 0 new comments
Bug: using kv cache quantitisation q4_0 seems to cause issues when a context shift is done
#9743 commented on Nov 4, 2024 • 0 new comments
Feature Request: [metal] implement FA kernels for quantized KV cache
#9736 commented on Nov 4, 2024 • 0 new comments
Bug: ggml_vulkan can only Found 1 Vulkan devices.
#9716 commented on Nov 4, 2024 • 0 new comments
Feature Request: Support Codestral Mamba
#8519 commented on Nov 4, 2024 • 0 new comments
llama_kv_cache_seq_shift does not work with cache type q4_0
#5652 commented on Nov 4, 2024 • 0 new comments
Bug: gguf pypi package corrupts environment
#9566 commented on Nov 4, 2024 • 0 new comments
llama : store token ids in the KV Cache
#9113 commented on Nov 4, 2024 • 0 new comments
Feature Request: Support Aya
#10035 commented on Nov 4, 2024 • 0 new comments
Bug: gguf tries to access newbyteorder, which was removed in numpy2.0
#10127 commented on Nov 4, 2024 • 0 new comments
Problem with using llava_surgery_v2.py
#9750 commented on Nov 5, 2024 • 0 new comments
Feature Request: Anti-slop / fine tuning of a model output in realtime / on the fly for output quality enhancement.
#9748 commented on Nov 5, 2024 • 0 new comments
Bug: struct llama_file has two different definitions (breaks ODR)
#9770 commented on Nov 5, 2024 • 0 new comments
llama : tool for evaluating quantization results per layer
#2783 commented on Nov 5, 2024 • 0 new comments
llama : support Mamba-2
#7727 commented on Nov 5, 2024 • 0 new comments
metal : compile-time kernel args and params
#4085 commented on Nov 5, 2024 • 0 new comments
ci : add Apple silicon (M1) macOS runners
#3469 commented on Nov 5, 2024 • 0 new comments
ggml : unified CMake build
#6913 commented on Nov 5, 2024 • 0 new comments
How can i get log probs in create_chat_completions in llama-cpp , I'm using logprobs=True as an attribute but still not getting Log Probabilities.
#6423 commented on Nov 5, 2024 • 0 new comments
Bug: No text response when "--log-disable" is set
#10002 commented on Nov 5, 2024 • 0 new comments
llama_model_load: error loading model: vk::PhysicalDevice::createDevice: ErrorDeviceLost
#9767 commented on Nov 6, 2024 • 0 new comments
Bug: Rocm extreme slow down on GFX1100 with release binary
#9765 commented on Nov 6, 2024 • 0 new comments
Feature Request: multimodal on android
#9738 commented on Nov 6, 2024 • 0 new comments
Bug: llama-quantize --help is not printed
#10122 commented on Nov 6, 2024 • 0 new comments
Bug: LLAMA_MAX_LAYERS must be increased to run FatLlama 1.7T
#9909 commented on Nov 6, 2024 • 0 new comments
Bug: Cannot edit input before the current line.
#9777 commented on Nov 7, 2024 • 0 new comments
Feature Request: ANE utilization on Apple Silicon
#9773 commented on Nov 7, 2024 • 0 new comments
Potential GPU Usage During CPU Inference (ngl=0)
#9724 commented on Nov 7, 2024 • 0 new comments
changelog : `llama-server` REST API
#9291 commented on Nov 7, 2024 • 0 new comments
Typo on build.md?
#9793 commented on Nov 8, 2024 • 0 new comments
Bug: After update, unable to load GGUF models
#9790 commented on Nov 8, 2024 • 0 new comments
Feature Request: Enable overallocation for ggml-vulkan
#9785 commented on Nov 8, 2024 • 0 new comments
Feature Request: Support for architecture MambaByte
#9780 commented on Nov 8, 2024 • 0 new comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

November 3, 2024 – November 10, 2024

Overview

Could not load contribution data

28 Releases published by 1 person

33 Pull requests merged by 14 people

19 Pull requests opened by 14 people

42 Issues closed by 11 people

22 Issues opened by 19 people

64 Unresolved conversations

Insights: ggerganov/llama.cpp

November 3, 2024 – November 10, 2024

Overview

Could not load contribution data

28 Releases published by 1 person

33 Pull requests merged by 14 people

19 Pull requests opened by 14 people

42 Issues closed by 11 people

22 Issues opened by 19 people

64 Unresolved conversations