-
-
-
-
-
-
Inference Providers
Active filters:
RL
nvidia/Nemotron-Cascade-14B-Thinking
Text Generation
•
15B
•
Updated
•
3.31k
•
55
nvidia/Nemotron-Cascade-8B
Text Generation
•
8B
•
Updated
•
5.49k
•
50
nvidia/Nemotron-Cascade-8B-Thinking
Text Generation
•
8B
•
Updated
•
1.61k
•
32
bartowski/nvidia_Nemotron-Cascade-14B-Thinking-GGUF
Text Generation
•
15B
•
Updated
•
5.14k
•
5
nvidia/Nemotron-Cascade-8B-Intermediate-ckpts
Text Generation
•
Updated
•
10
stanfordnlp/SteamSHP-flan-t5-xl
Updated
•
43
•
43
stanfordnlp/SteamSHP-flan-t5-large
Updated
•
459
•
33
SultanR/SmolTulu-1.7b-Reinforced
Text Generation
•
2B
•
Updated
•
9
•
5
mradermacher/SmolTulu-1.7b-Reinforced-GGUF
2B
•
Updated
•
150
Daemontatox/Llama3.3-70B-CogniLink
Text Generation
•
71B
•
Updated
•
73
•
•
3
mradermacher/Llama3.3-70B-CogniLink-GGUF
Text Generation
•
71B
•
Updated
•
225
mradermacher/Llama3.3-70B-CogniLink-i1-GGUF
Text Generation
•
71B
•
Updated
•
846
JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora
Reinforcement Learning
•
Updated
JHuel/Mistral-Nemo-Instruct-2407_ORPO
Text Generation
•
Updated
Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
•
0.5B
•
Updated
•
86
•
24
Reinforcement Learning
•
Updated
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
0.5B
•
Updated
•
110
•
1
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
0.5B
•
Updated
•
120
•
1
mradermacher/QuadConnect2.5-0.5B-v0.0.3b-GGUF
0.5B
•
Updated
•
37
Text Generation
•
684B
•
Updated
•
131
•
1
mradermacher/QuadConnect2.5-0.5B-v0.0.8b-GGUF
0.5B
•
Updated
•
139
Lyte/QuadConnect2.5-0.5B-v0.0.9b
Text Generation
•
0.5B
•
Updated
•
23
mradermacher/QuadConnect2.5-0.5B-v0.0.9b-GGUF
0.5B
•
Updated
•
47
Lyte/QuadConnect2.5-1.5B-v0.1.0b
Text Generation
•
2B
•
Updated
•
104
•
1
mradermacher/QuadConnect2.5-1.5B-v0.1.0b-GGUF
2B
•
Updated
•
40
•
1
mradermacher/Zireal-0-GGUF
mradermacher/Magellanic-Qwen-25B-R999-GGUF
25B
•
Updated
•
38
•
1
mradermacher/Magellanic-Qwen-25B-R999-i1-GGUF
25B
•
Updated
•
53
•
1
VaidikML0508/Shark-Tank-Offer-Evaluator-llama3.2-3B-Instruct-SFT-DPO-4bits-V1
Text Generation
•
3B
•
Updated
•
1
Teen-Different/squiral_maze
Reinforcement Learning
•
Updated