Documentation
malagent research documentation
Research
Explore the theoretical foundations of RLVR for security research:
- Theory โ RLVR concepts and RAFT algorithm
- Reward Configuration โ Configurable reward signals
Access
Full implementation documentation is available for authorized research collaborators. See the GitHub repository for access information.