-
Notifications
You must be signed in to change notification settings - Fork 380
Pull requests: huggingface/text-embeddings-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: ROCm flash-attn varlen, triton layer norm, and AMD Dockerfile
#860
opened Apr 9, 2026 by
Abdennacer-Badaoui
Member
Loading…
Add rate-limited and aggregate logging to reduce log volume at high load
#859
opened Apr 8, 2026 by
dsingal0
Loading…
Add repository cloning step for local installation
#781
opened Dec 19, 2025 by
smedegaard
Loading…
1 of 5 tasks
feat: add varlen attention on cpu
#777
opened Dec 17, 2025 by
michaelfeil
Contributor
•
Draft
5 tasks
candle: health check by queuing on cuda
#775
opened Dec 17, 2025 by
michaelfeil
Contributor
Loading…
5 tasks
Add Support for XProvence Sentence-Level Context Pruning (naver/xprovence-reranker-bgem3-v1)
#770
opened Dec 4, 2025 by
sigridjineth
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-03-09.