← Dashboard 35FF7e1-zCg
Ready annotator_b
Video ID: 35FF7e1-zCg
Category: speech_dominant
Standard Vision ○
Misleading Vision ○
Standard Audio ○
Misleading Audio ○
At the moment the suited man stands near the green horizontal blinds and gestures sharply while delivering his critique, what specific visual element on the wall directly behind him casts jagged shadows?
A.A bulletin board plastered with torn notices ✓ Correct
B.Rows of vintage television sets blinking with static
C.Shelves lined with color-coded books
D.A chalkboard covered in writing
E.The visual detail in the question is incorrect
F.The audio detail in the question is incorrect
Answer timestamp: [120s-130s]s Modality: vision Category: temporal

Annotation

When the suited man is seated behind his desk with his hands clasped tightly during the initial conversation, what specific visual element on the wall directly behind him casts jagged shadows?
A.A bulletin board plastered with torn notices
B.Rows of vintage television sets blinking with static
C.Shelves lined with color-coded books
D.A chalkboard covered in writing
E.The visual detail in the question is incorrect ✓ Correct
F.The audio detail in the question is incorrect
Answer timestamp: [120s-130s]s Modality: vision Category: temporal
Misleading Information
Category: location_detail
Description: This misleads models by swapping the character's posture and location (standing by blinds vs. seated at desk). A model relying on general scene knowledge might know there is a bulletin board, but must verify the specific timestamp where the man is STANDING by the blinds to confirm the shadow source matches that specific framing, rather than the earlier scene where he was seated.

Annotation

Just as the suited man's speech about reporters being 'plotters' is severed mid-sentence while he stands by the window, what overwhelming audio phenomenon instantly dominates the soundscape?
A.A loud, harsh, low-frequency electronic buzz ✓ Correct
B.The rhythmic clack-clack of typewriters
C.A sharp metallic scraping sound
D.The distant crackle of shortwave radios
E.The visual detail in the question is incorrect
F.The audio detail in the question is incorrect
Answer timestamp: [120s-130s]s Modality: audio Category: temporal

Annotation

Immediately after the suited man delivers the line 'I don't like you, Caulfield' while dragging his chair, what distinct audio event punctuates the silence?
A.A loud, harsh, low-frequency electronic buzz
B.The rhythmic clack-clack of typewriters
C.A sharp metallic scraping sound
D.The distant crackle of shortwave radios
E.The visual detail in the question is incorrect
F.The audio detail in the question is incorrect ✓ Correct
Answer timestamp: [120s-130s]s Modality: audio Category: temporal
Misleading Information
Category: sound_source
Description: This misleads models by referencing a very prominent sound effect from a different part of the video (the chair scrape at [80s-90s]) instead of the unique electronic buzz at [120s-130s]. Models must distinguish between two distinct 'interruption' sounds occurring in different contexts to avoid selecting the plausible but temporally incorrect distractor.

Annotation