Abstract:Addressing the issues of competition between detection features and Re-ID features in joint detection and embedding multi-object tracking methods, as well as difficulties in maintaining visual consistency for occluded targets in complex scenes, we propose an end-to-end hypergraph neural network matching tracking method, named HGTracker. Firstly, HGTracker introduces an enhanced Spatial Pyramid Pooling Networks (ESPPNet) module to enhance the detection capability of the target detection backbone network.This module aggregates features from different dimensions to adapt to different tasks in the tracking process, effectively alleviating the issue of competition between detection and Re-ID tasks in one-stage multi-object tracking methods. Secondly, it introduces a Short-term and Long-term Hypergraph Neural Network Matching module, which designs long-term and short-term hypergraph neural networks to associate unoccluded and occluded detection visual features. It transforms the data association problem into a hypergraph matching problem between trajectory hypergraphs and detection hypergraphs. The tracker models the relationship between trajectory segment information and the current detection frame information as a hypergraph neural network, maintaining visual trajectory consistency under severe occlusion. Experimental comparisons on the MOT17 and MOT20 datasets validate the effectiveness of the HGTracker tracking method.