← Dashboard Xq_9TDk-hj0
Ready annotator_b
Video ID: Xq_9TDk-hj0
Category: scene_dominant
Standard Vision ○
Misleading Vision ○
Standard Audio ○
Misleading Audio ○
While the shirtless man in torn leathers writhes violently against the thick ropes binding his wrists, what specific physical action does he perform with his lower body in an attempt to free himself?
A.He kicks desperately at the stone floor ✓ Correct
B.He scrabbles frantically at the moss-slicked ceiling
C.He rolls over onto his back to loosen the knots
D.He pushes off the wall with his feet to gain leverage
E.The visual detail in the question is incorrect
F.The audio detail in the question is incorrect
Answer timestamp: [20s-30s]s Modality: vision Category: causal

Annotation

As the hooded figure lunges forward with arms outstretched to catch a falling companion mid-leap, what specific physical action does the hooded figure perform with their clothing?
A.He kicks desperately at the stone floor
B.He scrabbles frantically at the moss-slicked ceiling
C.He rolls over onto his back to loosen the knots
D.He pushes off the wall with his feet to gain leverage
E.The visual detail in the question is incorrect ✓ Correct
F.The audio detail in the question is incorrect
Answer timestamp: [20s-30s]s Modality: vision Category: causal
Misleading Information
Category: person_action
Description: The misleading premise shifts focus from the bound shirtless man (who kicks) to the hooded figure (who lunges). A model not watching the video might conflate the actions of different characters or assume the 'lunging' character performs the same type of struggle as the 'bound' character, leading to incorrect associations between character and action.

Annotation

At the moment the shirtless man's strained voice cuts through the chaos shouting 'I can't hold on—someone grab my arm!', what other distinct sound effect is described as occurring simultaneously in the air?
A.The frantic scrape-scrabble of claws on stone ✓ Correct
B.The sharp click-clack of a crossbow mechanism loading
C.The rhythmic thud-thud-thud of armored boots approaching
D.The loud shattering of glass nearby
E.The visual detail in the question is incorrect
F.The audio detail in the question is incorrect
Answer timestamp: [20s-30s]s Modality: audio Category: causal

Annotation

Following the desperate whisper from a hooded figure saying 'Stay calm—they'll take us alive if we give them a chance', what distinct sound effect is described as occurring immediately after this statement?
A.The frantic scrape-scrabble of claws on stone
B.The sharp click-clack of a crossbow mechanism loading
C.The rhythmic thud-thud-thud of armored boots approaching
D.The loud shattering of glass nearby
E.The visual detail in the question is incorrect
F.The audio detail in the question is incorrect ✓ Correct
Answer timestamp: [20s-30s]s Modality: audio Category: causal
Misleading Information
Category: speech_context
Description: The misleading premise swaps the specific dialogue line and speaker context. The correct answer relies on the specific temporal proximity of 'claws on stone' to the shirtless man's shout. The wrong premise references a different line spoken by a different character later in the sequence (or in a different segment), where the surrounding audio context differs, testing if the model tracks specific speech-to-sound correlations.

Annotation