Causal Analysis Framework

Multi-stage framework for identifying and validating knowledge-critical neurons in AI models.

Neuron Identification

Isolate predictive neurons using integrated gradients and knowledge probing tasks for factual correctness.

A close-up of an abstract textured surface featuring irregular patterns resembling neural networks or biological structures. The surface appears metallic with reflective qualities and intricate grooves.
A close-up of an abstract textured surface featuring irregular patterns resembling neural networks or biological structures. The surface appears metallic with reflective qualities and intricate grooves.
Causal Intervention

Selectively deactivate high-impact neurons to test factual degradation and observe knowledge drift.

Compare attribution methods for consistency in identifying knowledge-critical neurons against human-annotated rankings.

Attribution Validation
Suspended, interconnected strands of blue and white particles create an abstract, web-like pattern against a dark background, resembling cosmic or neural structures.
Suspended, interconnected strands of blue and white particles create an abstract, web-like pattern against a dark background, resembling cosmic or neural structures.
A complex network of interconnected black strings or wires forms an intricate web-like structure against a light, neutral background. The lines crisscross at various angles, creating a sense of depth and three-dimensionality.
A complex network of interconnected black strings or wires forms an intricate web-like structure against a light, neutral background. The lines crisscross at various angles, creating a sense of depth and three-dimensionality.
A network of tree branches with an artificial color scheme. The branches are predominantly in bright pink and yellow against a brown background, creating a high contrast abstract visual.
A network of tree branches with an artificial color scheme. The branches are predominantly in bright pink and yellow against a brown background, creating a high contrast abstract visual.
Neuron Ablation

Testing factual degradation by selectively deactivating high-impact neurons for insights.

A 3D illustration of interconnected white spheres resembling molecular structures set against a soft, blurred backdrop. The spheres are depicted in various sizes, connected by straight rods, creating an impression of a complex network or chemical compound.
A 3D illustration of interconnected white spheres resembling molecular structures set against a soft, blurred backdrop. The spheres are depicted in various sizes, connected by straight rods, creating an impression of a complex network or chemical compound.
Attribution Validation

Comparing attribution methods for consistency in identifying critical knowledge neurons.