Ljubomir Josifovski

ljupco

https://ljubomirj.github.io/

AI & ML interests

Now - ML/AI, agents, forecasting, science & engineering. Previous - systematic trading, research & development. Prior - speech recognition in noise, speech synthesis, machine learning.

Recent Activity

liked a model about 1 hour ago

nvidia/NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B-BF16

liked a model about 2 hours ago

z-lab/MiniMax-M2.7-DFlash

liked a model 1 day ago

havenoammo/Qwen3.6-27B-MTP-UD-GGUF

View all activity

Organizations

None yet

upvoted 2 collections 2 days ago

Proven REAPs

Collection

Benchmarked REAP checkpoints with >=500 all-time downloads. GLM/Qwen/MiniMax/DeepSeek/Kimi/gemma. • 25 items • Updated 18 days ago • 8

Gemma-4 Assistant (MTP)

Collection

4 items • Updated 4 days ago • 17

upvoted an article 5 days ago

Article

Mixture of Experts (MoEs) in Transformers

Feb 26

•

159

upvoted a paper 12 days ago

Learning to Continually Learn via Meta-learning Agentic Memory Designs

Paper • 2602.07755 • Published Feb 8 • 8

upvoted a collection 15 days ago

DeepSeek-V4

Collection

4 items • Updated 16 days ago • 622

upvoted a collection 19 days ago

Nemotron-Cascade 2

Collection

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated about 19 hours ago • 49

upvoted a paper 22 days ago

TRACER: Trace-Based Adaptive Cost-Efficient Routing for LLM Classification

Paper • 2604.14531 • Published 24 days ago • 7

upvoted a changelog 24 days ago

Hugging Face Changelog

Introducing Kernels

24 days ago

• 171

upvoted a collection 26 days ago

NVIDIA Nemotron v3

Collection

Open, Production-ready Enterprise Models • 18 items • Updated about 19 hours ago • 285

upvoted a collection 27 days ago

Gemma 4

Collection

Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated 17 days ago • 176

upvoted a paper about 1 month ago

Arcee Trinity Large Technical Report

Paper • 2602.17004 • Published Feb 19 • 20

upvoted a collection about 1 month ago

Trinity-Large-Thinking

Collection

5 items • Updated 30 days ago • 31

upvoted an article about 1 month ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

Apr 2

•

890

upvoted a collection about 1 month ago

Qwen3.5-Claude-4.6-Opus-Reasoning-Distilled-v2

Collection

15 items • Updated 3 days ago • 101

upvoted a paper about 1 month ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published Mar 19 • 66

upvoted an article about 1 month ago

Article

Introducing Cohere-transcribe: state-of-the-art speech recognition

Mar 26

•

upvoted 3 papers about 2 months ago

The Y-Combinator for LLMs: Solving Long-Context Rot with λ-Calculus

Paper • 2603.20105 • Published Mar 20 • 37

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 196

EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test

Paper • 2503.01840 • Published Mar 3, 2025 • 9

upvoted an article 2 months ago

Article

Train AI models with Unsloth and Hugging Face Jobs for FREE

Feb 20

•

100

Ljubomir Josifovski

AI & ML interests

Recent Activity

Organizations

ljupco's activity

Mixture of Experts (MoEs) in Transformers

Introducing Kernels

Welcome Gemma 4: Frontier multimodal intelligence on device

Introducing Cohere-transcribe: state-of-the-art speech recognition

Train AI models with Unsloth and Hugging Face Jobs for FREE