The 2 channel audio input from PART1 plays into the headphones that accompany the monitor playing the performance video. Each channel is fed into a different side of the headphones. The idea behind the live audio input into PART2 is to incorporate 2 other readings of the original event into the third person's experience. So ideally when a person is viewing PART2 they will be listening to two people in a separate place simultaneously read/sing the words from the karaoke video, which should then shed light on the meaning of the performance.
I am also interested in the circuitry of the two parts of the installation. It doesn't matter which part is experienced first but ideally experiencing one part should then alter how the participant reflects on their participation of the other part forming a hyper-reflective circuit.