Carlo Moro's picture

Carlo Moro

cnmoro

·

AI & ML interests

None yet

Recent Activity

liked a model about 18 hours ago

jinaai/jina-clip-v2

reacted to fffiloni's post with ❤️ about 18 hours ago

I brought DALL·E mini back to life 🤖🎨 You can try it here: https://huggingface.co/spaces/fffiloni/dalle-mini-reboot And I also built a batch version using Hugging Face Jobs (up to 50 images per prompt): https://huggingface.co/spaces/fffiloni/dalle-mini-via-jobs The goal was to stay close to the original JAX/Flax pipeline, while integrating it with modern tooling (Gradio + Jobs). It ended up being a fun way to revisit this model — still weird, still fun 😄

reacted to robtacconelli's post with 🤯 3 days ago

🧬 Midicoth: diffusion-based lossless compression — no neural net, no GPU, no training data What if reverse diffusion could compress text — without a neural network? Midicoth brings score-based denoising into classical compression. It treats prior smoothing as forward noise and reverses it with Tweedie's formula on a binary tree — 3 denoising steps, James-Stein shrinkage, applied after all model blending. ~2,000 lines of C, single CPU core. Beats every dictionary compressor we tested: enwik8 (100 MB) → 1.753 bpb (−11.9% vs xz, −15% vs Brotli, −24.5% vs bzip2) alice29.txt → 2.119 bpb (−16.9% vs xz) Outperforms xz, zstd, Brotli, bzip2, gzip on all inputs PAQ/CMIX still win with hundreds of models + LSTMs. LLM compressors win with pre-trained knowledge. Midicoth closes the gap with pure statistics — no mixer, no gradient descent, just counting. The Tweedie denoising layer adds 2.3–2.7% on every file tested — the most consistent component in the ablation. Adding SSE or logistic mixers made things worse. In the online setting, count-based beats gradient-based. No external dependencies. Fully deterministic. Bit-exact encode/decode. ~60 KB/s throughput. 💻 Code: https://github.com/robtacconelli/midicoth 📄 Paper: https://huggingface.co/papers/2603.08771 ⭐ Space: https://huggingface.co/spaces/robtacconelli/midicoth If you ever wondered whether diffusion ideas belong in data compression — here's proof they do. ⭐ appreciated!

View all activity

Organizations

upvoted a collection 7 days ago

Qwen3.5-text-only

Qwen3.5-text-only • 4 items • Updated 8 days ago • 11

upvoted a collection 15 days ago

Tucano2

An open suite of large language models (LLMs) with 0.5-3.7 billion parameters, designed to address the gap in open-source development for Portuguese. • 33 items • Updated 9 days ago • 13

upvoted a paper 21 days ago

Diffusion-Pretrained Dense and Contextual Embeddings

Paper • 2602.11151 • Published Feb 11 • 22

upvoted an article 22 days ago

Article

Mixture of Experts (MoEs) in Transformers

+5

23 days ago

•

139

upvoted a collection 25 days ago

Qwen3 Voice Embedding

Standalone ECAPA-TDNN x-vector speaker encoders extracted from Qwen3-TTS. 1024-dim (0.6B) and 2048-dim (1.7B). • 4 items • Updated 21 days ago • 28

upvoted an article about 1 month ago

Article

LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling

Feb 12

•

50

upvoted an article 3 months ago

Article

Encoding the World's Medical Knowledge into 970K

Dec 22, 2025

•

15

upvoted a collection 5 months ago

Smoothie Qwen3

For more details, please visit https://github.com/dnotitia/smoothie-qwen • 9 items • Updated Jan 26 • 7

upvoted an article 5 months ago

Article

Visualizing How VLMs Work

Oct 7, 2025

•

54

upvoted an article 9 months ago

Article

Gemma 3n fully available in the open-source ecosystem!

+6

Jun 26, 2025

•

120

upvoted a collection about 1 year ago

Gemma 3 Release

28 items • Updated 8 days ago • 624

upvoted a paper about 1 year ago

Layered Image Vectorization via Semantic Simplification

Paper • 2406.05404 • Published Jun 8, 2024 • 3

upvoted a collection about 1 year ago

Portuguese LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the PT-LLM leaderboard: • 17 items • Updated 18 minutes ago • 43

upvoted a paper about 1 year ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published Jan 22, 2025 • 28

upvoted an article about 1 year ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15, 2025

•

228

upvoted a collection over 1 year ago

Common Models

The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 42

upvoted 3 collections almost 2 years ago

Florence

5 items • Updated 18 days ago • 173

MonoPTT5

MonoT5 rerankers for the Portuguese language • 5 items • Updated Sep 4, 2024 • 2

ptt5-v2

5 items • Updated Sep 4, 2024 • 3

upvoted a collection about 2 years ago

Recent models: last 100 repos, sorted by creation date

The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 100 items • Updated 18 days ago • 576