Camera-Based-Table-Tennis-Posture-Analysis

Camera-Based Table Tennis Posture Analysis

Demo

demo

The Problem

Table tennis players need analyses of their opponents’ postures to optimize their game strategies, but it is too laborious and time-consuming to calculate a player’s postures by hands. Besides, most existing models are sensor-based.

Our Solution

We built a system to classify players’ postures (forehand and backhand) automatically based on their past game and practice videos, and we calculate ratios of players’ postures automatically based on the prediction from those classifiers.

Methods

Semantic Segmentation for Balls and Tables

Data Labeling

Screen Shot 2021-06-28 at 10 31 07 AM

Data Augmentation

Screen Shot 2021-06-28 at 10 31 35 AM

EfficientNet

EfficientNet is proposed by Google AI in 2019 and it uses a simple but highly effective compound coefficient to uniformly scales all dimensions of width, depth, and resolution.
Unlike other models that arbitrary scale a single dimension of the network, the compound scaling method uniformly scales up all dimensions in a principled way.

Screen Shot 2021-06-28 at 10 33 30 AM

U-Net Architecture

This architecture allows us to use a pre-trained model that has been used for a classification task - on a dataset such as ImageNet - as our encoder. Here, we use EfficientNet as the U-Net’s encoder.

Screen Shot 2021-06-28 at 10 34 53 AM

Evaluation

Screen Shot 2021-06-28 at 10 35 49 AM

Results

Screen Shot 2021-06-28 at 10 36 02 AM

Video

We output videos with the results of the two above mentioned methods.

Optimization

White points in backgrounds may be detected as balls. To deal with the problem, we recover pixels that be detected as balls at 70% of all the frames in a video.

Screen Shot 2021-06-28 at 10 37 03 AM

Conclusion

References

[1] R. Voeikov, N.Falaleev, R. Baikulov. TTNet: Real-time temporal and spatial video analysis of table tennis. CVPR. 2020.
[2] C. B. Lin, Z. Dong, W. K. Kuan, Y. F. Huang. A Framework for Fall Detection Based on OpenPose Skeleton and LSTM/GRU Models. In Applied Science. 2020.
[3] Z. Cao, G. Hidalgo, T. Simon, S. E. Wei, Y. Sheikh. OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.43, No.1, pp. 172-186, Jan. 1 2021.
[4] C. Sawant. Human activity recognition with openpose and Long Short-Term Memory on real time images. IEEE 5th International Conference for Convergence in Technology (I2CT). 2020.