In this paper an audio processing solution for video conference based aerobics is presented. The proposed solution leaves the workout music unaltered by separating it from the speech and processing each signal separately. The speech signal processing is also performed at a lower sample rate, which saves computational power. Real time evaluation of the system shows that high quality music as well as a good two-way communication is maintained during the aerobic session.