← Dashboard 6ygujqr_JWc
Ready annotator_b
Video ID: 6ygujqr_JWc
Category: speech_dominant
Standard Vision ○
Misleading Vision ○
Standard Audio ○
Misleading Audio ○
As the woman in the lavender-purple robe clutches her face in despair while tears stream down her cheeks, what specific emotional declaration does she struggle to articulate before being silenced?
A.I'm afraid to be alone
B.He's gone now
C.I'm afraid to leave ✓ Correct
D.I lost the baby
E.The visual detail in the question is incorrect
F.The audio detail in the question is incorrect
Answer timestamp: [90s-100s]s Modality: vision Category: existence

Annotation

While the man in the white tank top leans forward to offer support by resting his hand on her shoulder, what specific emotional declaration does the woman struggle to articulate before the scene cuts?
A.I'm afraid to be alone
B.He's gone now
C.I'm afraid to leave
D.I lost the baby
E.The visual detail in the question is incorrect ✓ Correct
F.The audio detail in the question is incorrect
Answer timestamp: [90s-100s]s Modality: vision Category: existence
Misleading Information
Category: person_appearance
Description: The misleading premise shifts focus from the woman (who is speaking and acting) to the man (who is listening/supporting). A lazy model might associate the man's presence with the quote or fail to track who is actually delivering the line based on the visual cue provided.

Annotation

Following the moment where a loud, synthetic electronic tone abruptly silences the room after her confession, what was the final phrase she managed to whisper?
A.I'm afraid to be alone
B.He's gone now
C.I'm afraid to leave ✓ Correct
D.I lost the baby
E.The visual detail in the question is incorrect
F.The audio detail in the question is incorrect
Answer timestamp: [90s-100s]s Modality: audio Category: existence

Annotation

Immediately after a sharp, high-pitched beep cuts off the dialogue mid-sentence, what was the final phrase she managed to whisper?
A.I'm afraid to be alone
B.He's gone now
C.I'm afraid to leave
D.I lost the baby
E.The visual detail in the question is incorrect
F.The audio detail in the question is incorrect ✓ Correct
Answer timestamp: [90s-100s]s Modality: audio Category: existence
Misleading Information
Category: sound_intensity
Description: The misleading premise swaps 'loud, synthetic electronic tone' with 'sharp, high-pitched beep'. While similar, the caption specifically describes the sound at [90s-100s] as a 'loud, synthetic electronic tone', whereas the 'sharp, high-pitched beep' occurs at [110s-120s]. This tests if the model can distinguish between specific audio descriptors tied to different timestamps.

Annotation