Javed Alam PRO
AI & ML interests
Recent Activity
Organizations
- Running11k
AI Comic Factory
👩11kCreate your own AI comic with a single prompt
- RunningAgentsFeatured196
Arxiv CS RAG
🔍196Ask an LLM about Arxiv papers
- Running on CPU UpgradeAgents244
Open Portuguese LLM Leaderboard
🏆244Track, rank and evaluate open LLMs in Portuguese
- RunningAgentsFeatured586
LLM-Perf Leaderboard
🏆586Explore LLM performance across hardware configurations
- Running on ZeroAgentsFeatured163
Stable Audio Live Multiplayer
💻163Generate custom audio from text prompts
- Runtime errorAgents16
Chattts Zero
🐢16Generate audio from text with voice cloning
- Running459
Real-time Whisper WebGPU
🎤459Transcribe audio to text using your browser
- Runtime errorAgentsFeatured5.07k
MusicGen
🎵5.07kGenerate music from text descriptions and optional melodies
- Runtime errorAgents162
Hallo
👋162Generate realistic talking heads from image+audio
-
Wan-AI/Wan2.2-TI2V-5B
Text-to-Video • Updated • 6.59k • • 570 - PausedAgents33
Canary-Qwen-2.5B
🐤33Transcribe audio and generate responses based on prompts
- SleepingAgents3
MiniCPM-V-4 5-video-chat
📈3Ask questions about uploaded videos
- Running31
— Inference Api —
📟31Generate text based on your input
- Running on ZeroAgentsFeatured219
Microsoft Phi-3-Vision-128k
😻219Chat with an image using Phi-3 Vision model
- Running on ZeroAgents90
Llava Llama-3 8B
🔥90Meta Llama3 8b with Llava Multimodal capabilities
-
nanonets/Nanonets-OCR2-3B
Image-Text-to-Text • 4B • Updated • 495k • 500
- Running11k
AI Comic Factory
👩11kCreate your own AI comic with a single prompt
- RunningAgentsFeatured196
Arxiv CS RAG
🔍196Ask an LLM about Arxiv papers
- Running on CPU UpgradeAgents244
Open Portuguese LLM Leaderboard
🏆244Track, rank and evaluate open LLMs in Portuguese
- RunningAgentsFeatured586
LLM-Perf Leaderboard
🏆586Explore LLM performance across hardware configurations
- Running31
— Inference Api —
📟31Generate text based on your input
- Running on ZeroAgentsFeatured219
Microsoft Phi-3-Vision-128k
😻219Chat with an image using Phi-3 Vision model
- Running on ZeroAgents90
Llava Llama-3 8B
🔥90Meta Llama3 8b with Llava Multimodal capabilities
-
nanonets/Nanonets-OCR2-3B
Image-Text-to-Text • 4B • Updated • 495k • 500
- Running on ZeroAgentsFeatured163
Stable Audio Live Multiplayer
💻163Generate custom audio from text prompts
- Runtime errorAgents16
Chattts Zero
🐢16Generate audio from text with voice cloning
- Running459
Real-time Whisper WebGPU
🎤459Transcribe audio to text using your browser
- Runtime errorAgentsFeatured5.07k
MusicGen
🎵5.07kGenerate music from text descriptions and optional melodies
- Runtime errorAgents162
Hallo
👋162Generate realistic talking heads from image+audio
-
Wan-AI/Wan2.2-TI2V-5B
Text-to-Video • Updated • 6.59k • • 570 - PausedAgents33
Canary-Qwen-2.5B
🐤33Transcribe audio and generate responses based on prompts
- SleepingAgents3
MiniCPM-V-4 5-video-chat
📈3Ask questions about uploaded videos