Crowd Counting Sound with Ref

Rethinking global context in crowd counting

The input image is first split into overlapping patches. Then, those patches go through tokens reduction block and main transformer to learn features with global information. To abstract global ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Feedback

Rethinking global context in crowd counting

Trending now