Fig. 1From: Pedestrian detection with motion features via two-stream ConvNetsOverview of our method. First we pre-process video frames by coarse alignment and temporal differencing for effective motion description (a). Next, we input raw current frames and temporal differences into convolutional layers to extract spatio-temporal features (b). Finally boosted forests perform sliding-window detection over the convolutional feature map (c)Back to article page