🧠 DETR Interpretability Dashboard with Controls

How to use this dashboard

  • Upload an image using the left panel. The model will run object detection and list detected objects. Try imageNet
  • Use the "Confidence Threshold" slider to filter detections by score. Detections below the threshold are hidden.
  • Pick a detection from the dropdown to generate explanations for that object.
  • Choose between Grad-CAM and Grad-CAM++ (Grad-CAM++ often gives sharper, more localized maps).
  • MC Dropout Samples controls how many stochastic forward passes are used to estimate prediction uncertainty. More samples give smoother estimates but take longer.
  • Dropout Probability sets the dropout rate used during MC Dropout; higher values typically increase predicted uncertainty.

Tooltips are provided on each control (hover or focus) for quick hints.

Pick a detection for explanation
0 1
Grad-CAM Variant
1 100
0 0.9

Control tooltips (quick reference)

  • Confidence Threshold: Filter out detections with confidence below this value.
  • Grad-CAM Variant: Choose the gradient-based visualization method. Grad-CAM++ may highlight smaller regions more precisely.
  • MC Dropout Samples: Number of stochastic forward passes for uncertainty estimation. Increase for more stable results.
  • Dropout Probability: Dropout rate used during MC Dropout sampling. Higher values typically increase predictive variance.
  • Pick a detection: Select which detected object to explain. Format shown as 'index: label (score)'.

Key interpretability choices & why they matter

  • Baseline (Integrated Gradients): Defines what 'no signal' looks like. Black (zeros) is simple, but blurred or neutral baselines may give more meaningful attributions.
  • Which conv layer for Grad-CAM: Early layers give fine texture but low semantics; very late layers are coarse. A late backbone conv (default used) is a good compromise.
  • Number of MC Dropout samples: More samples = smoother, more stable uncertainty estimates, but higher compute cost.
  • Grad-CAM vs Grad-CAM++: Grad-CAM++ can be sharper and better for overlapping instances; vanilla Grad-CAM is faster and simpler.

Further reading (recommended)