Skip to content

Issues: ahrefs/ocannl

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Implement a BERT model, replicate ModernBERT to use ModernBERT weights explore Priority below "enhancement", non-blocking for milestones
#297 opened Dec 20, 2024 by lukstafi
Example training loop using DisTrO low-communication distributed data parallelism explore Priority below "enhancement", non-blocking for milestones
#278 opened Aug 27, 2024 by lukstafi
Anything we can learn from krnl and autograph? explore Priority below "enhancement", non-blocking for milestones
#277 opened Jul 30, 2024 by lukstafi
Replicate Andrej Karpathy's "LLM101n: Let's build a Storyteller" explore Priority below "enhancement", non-blocking for milestones
#275 opened Jul 24, 2024 by lukstafi
Support quantization for optimizers: low-bit optimizers explore Priority below "enhancement", non-blocking for milestones
#271 opened Jul 6, 2024 by lukstafi
Any lessons from Imbue for training-in-the-large? explore Priority below "enhancement", non-blocking for milestones
#270 opened Jul 4, 2024 by lukstafi
Take a look at Tiramisu Polyhedral Compiler explore Priority below "enhancement", non-blocking for milestones
#267 opened May 27, 2024 by lukstafi
Consider implementing a Cranelift backend (CPU) explore Priority below "enhancement", non-blocking for milestones
#266 opened May 25, 2024 by lukstafi
Study Candle -- a minimalistic Rust framework explore Priority below "enhancement", non-blocking for milestones
#265 opened May 25, 2024 by lukstafi
Consider implementing Lean Attention (Flash Attention + softmax-as-reduce) explore Priority below "enhancement", non-blocking for milestones
#263 opened May 21, 2024 by lukstafi
Superoptimizers for tensor programs explore Priority below "enhancement", non-blocking for milestones
#261 opened May 11, 2024 by lukstafi
ProTip! Updated in the last three days: updated:>2024-12-22.