A library for making RepE control vectors
-
Updated
Sep 24, 2025 - Jupyter Notebook
A library for making RepE control vectors
[ICLR 2025] General-purpose activation steering library
A resource repository for representation engineering in large language models
Steering vectors for transformer language models in Pytorch / Huggingface
[🏆 CHI26 Best Paper] CoBRA: Reproducible control of LLM agent behavior via classic social science experiments
KV Cache Steering for Inducing Reasoning in Small Language Models
Lightweight representation engineering dataflow operations for agent developers.
[🔥 ICLR 2026] - Misaligned Roles, Misplaced Images: Structural Input Perturbations Expose Multimodal Alignment Blind Spots
CRSM (Continuous Reasoning State Model): An asynchronous "System 2" architecture that implements Hierarchical State Sovereignty within a Mamba backbone. Unlike traditional search wrappers, CRSM uses Forward-Projected Planning and Sparse-Gated Injection to steer latent manifolds in real-time, decoupling strategic reasoning from token generation.
Early baby steps towards a long-term vision regarding Mamba-2's state interpretability.
Latent or Linguistic: Autonomous Behavior in LLMs. CS120 Final Project Submission
Qwen3-0.6B activation steering: style vectors, lens contamination eval, CPRR methodology
Representation Rerouting for Agentic Safety: Defending LLM Agents against Prompt Injection via Circuit Breakers and Triplet Loss.
Evaluation framework of different methods for probing and steering LLMs activations to mitigate Chain-of-Thought Unfaithfulness. Research project by Giovanni M. Occhipinti (University of Bologna), Alessandro Abate e Nandi Schoots (University of Oxford).
SOM network modified in order to control the latent space
Official code for "Activation Steering for Accent Adaptation in Speech Foundation Models" (Interspeech 2026). Parameter-free accent adaptation via mean-shift steering vectors — no weight updates, consistent WER reductions across 8 accents.
Add a description, image, and links to the representation-engineering topic page so that developers can more easily learn about it.
To associate your repository with the representation-engineering topic, visit your repo's landing page and select "manage topics."