-
Notifications
You must be signed in to change notification settings - Fork 170
Issues: EricLBuehler/mistral.rs
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Support loading tokenizer from New feature or request
sentencepiece
model
new feature
#407
opened Jun 8, 2024 by
EricLBuehler
Installation Error
bug
Something isn't working
triaged
This error has been reproduced or otherwise triaged.
#396
opened Jun 5, 2024 by
TimDouglas2
Cross GPU device mapping feature
backend
Backend work
models
Additions to model or architectures
new feature
New feature or request
#395
opened Jun 5, 2024 by
joshpopelka20
Support for T5 Architecture
models
Additions to model or architectures
new feature
New feature or request
#384
opened Jun 5, 2024 by
niranjanakella
Feature Req: Add Importance Matrix / RAM avail calculations to ISQ
models
Additions to model or architectures
new feature
New feature or request
#377
opened Jun 4, 2024 by
psyv282j9d
dolphin-2.9-mixtral-8x22b.Q8_0.gguf "Error: cannot find tensor info for blk.0.ffn_gate.0.weight"?
bug
Something isn't working
#352
opened May 28, 2024 by
psyv282j9d
Insitu quantization OOM for large models
bug
Something isn't working
#344
opened May 23, 2024 by
nidhoggr-nil
Garbled output on very long prompts
bug
Something isn't working
#339
opened May 21, 2024 by
LLukas22
bug: If device layers requested exceed model layers, host layers overflow
bug
Something isn't working
resolved
#329
opened May 19, 2024 by
polarathene
Running model from a GGUF file, only
new feature
New feature or request
#326
opened May 17, 2024 by
MoonRide303
mistral does not support NVIDIA V100 (compute_cap <= 800)
bug
Something isn't working
#305
opened May 14, 2024 by
thesues
Quantized Phi3: Features to add
models
Additions to model or architectures
#277
opened May 9, 2024 by
EricLBuehler
1 of 2 tasks
LoRA swapping at runtime
models
Additions to model or architectures
new feature
New feature or request
#259
opened May 1, 2024 by
BHX2
Add C api and provide shared and static libraries.
new feature
New feature or request
#258
opened May 1, 2024 by
maximus2600
Batched & chunked prefill
models
Additions to model or architectures
new feature
New feature or request
#216
opened Apr 26, 2024 by
lucasavila00
Model Wishlist
models
Additions to model or architectures
#156
opened Apr 16, 2024 by
EricLBuehler
11 tasks
Need parallel linears
backend
Backend work
paged-attention
#50
opened Apr 1, 2024 by
EricLBuehler
3 tasks
Add topk scalings, topk softmax scalings for X-LoRA
models
Additions to model or architectures
new feature
New feature or request
#48
opened Mar 30, 2024 by
EricLBuehler
ProTip!
no:milestone will show everything without a milestone.