Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
17
5
4
perfecxion.ai
perfecXion
Follow
webxos's profile picture
1 follower
ยท
2 following
https://perfecxion.ai
perfecxion-ai
perfecxion-ai
AI & ML interests
None yet
Recent Activity
upvoted
an
article
4 days ago
AI Coding Assistants Keep Shipping Vulnerable Code -- Here's What We're Doing About It
reacted
to
scthornton
's
post
with ๐
4 days ago
# SecureCode Dataset Family Update: 2,185 Security Examples, Framework-Specific Patterns, Clean Parquet Loading Hey y'all, Quick update on the SecureCode dataset family. We've restructured things and fixed several issues: **What changed:** - The datasets are now properly split into three repos: [unified](https://huggingface.co/datasets/scthornton/securecode) (2,185), [web](https://huggingface.co/datasets/scthornton/securecode-web) (1,378), [AI/ML](https://huggingface.co/datasets/scthornton/securecode-aiml) (750) - All repos now use Parquet format -- `load_dataset()` just works, no deprecated loading scripts - SecureCode Web now includes 219 framework-specific examples (Express, Django, Spring Boot, Flask, Rails, Laravel, ASP.NET Core, FastAPI, NestJS) - Data cards have been corrected and split sizes fixed **Why it matters:** With AI-generated code accounting for 60%+ of some codebases (Checkmarx 2025), security training data is more important than ever. Every example in SecureCode is grounded in a real CVE with 4-turn conversations that mirror actual developer-AI workflows. If you're working on code generation models, I'd love to hear how you're approaching the security angle. Are there vulnerability categories or frameworks you'd like to see covered? Paper: [arxiv.org/abs/2512.18542](https://arxiv.org/abs/2512.18542)
reacted
to
scthornton
's
post
with ๐
4 days ago
# SecureCode Dataset Family Update: 2,185 Security Examples, Framework-Specific Patterns, Clean Parquet Loading Hey y'all, Quick update on the SecureCode dataset family. We've restructured things and fixed several issues: **What changed:** - The datasets are now properly split into three repos: [unified](https://huggingface.co/datasets/scthornton/securecode) (2,185), [web](https://huggingface.co/datasets/scthornton/securecode-web) (1,378), [AI/ML](https://huggingface.co/datasets/scthornton/securecode-aiml) (750) - All repos now use Parquet format -- `load_dataset()` just works, no deprecated loading scripts - SecureCode Web now includes 219 framework-specific examples (Express, Django, Spring Boot, Flask, Rails, Laravel, ASP.NET Core, FastAPI, NestJS) - Data cards have been corrected and split sizes fixed **Why it matters:** With AI-generated code accounting for 60%+ of some codebases (Checkmarx 2025), security training data is more important than ever. Every example in SecureCode is grounded in a real CVE with 4-turn conversations that mirror actual developer-AI workflows. If you're working on code generation models, I'd love to hear how you're approaching the security angle. Are there vulnerability categories or frameworks you'd like to see covered? Paper: [arxiv.org/abs/2512.18542](https://arxiv.org/abs/2512.18542)
View all activity
Organizations
perfecXion
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
3 datasets
20 days ago
scthornton/atlas
Viewer
โข
Updated
Nov 24, 2025
โข
155
โข
25
โข
2
scthornton/securecode
Preview
โข
Updated
20 days ago
โข
115
โข
3
scthornton/securecode-aiml
Updated
20 days ago
โข
70
โข
3
liked
a dataset
about 1 month ago
scthornton/securecode-web
Viewer
โข
Updated
4 days ago
โข
1.38k
โข
3.27k
โข
9