If I understand it well you do the following to get rid of the audio_carrier:
interpolate 2x to avoid aliasing
downconvert audio-carrier to zero
highpass to get rid of audio-carrier
downconvert video-carrier to zero
get absolute value(get envelope)
decimate 2x
The above steps can be done in gnuradio with the components which are already
there
interpolator
freq_translating_fir_filter (which you feed high_pass fir filter taps)
multiplication (or another freq_translating_fir_filter with low_pass filter
taps, this way you get a band_filter)
abs
decimator