Introduction

Over the last few years, monitoring technology has taken a leap and helped free the many physical constraints carry-on sensors like GPS-trackers, accelerometers, and gyroscopes posed. This has paved the way for academic research in the field of tracking and classifying data collected from numerous human activities through means of wearable smartwatches. The aim of this research project is to deeply explore the various possibilities of context-aware application in the field of strength training. The objective is achieved through means of collecting, preprocessing, and analyzing raw data collected from wristband gyroscopes and accelerometers obtained during workout sessions.

Numerous works from past have been focused on tracking movements and analyzing user feedback through means of exercises. However, these systems do not entirely replace the tasks fulfilled by personal trainers. Weight training, similar to cardiovascular aerobic exercises is an essential aspect to health and fitness. However, there are yet to be many wearable devices that track the strength training exercises. At the moment there is only one wearable device that identifies exercise and tracks the repetitions in a set.

As further advancements occur in context-aware applications, it should eventually lead to the development of digital personal trainers driven by artificial intelligence. First and foremost, a personal trainer must possess adequate knowledge of human anatomy and basics of training and nutrition. Moreover, the trainer should be adept enough to design a personal training program for the client. And lastly, the trainer should assess the form and have a method to track progress.

Moreover, by offering real-time feedback and tailored recommendations based on an individual's biomechanics, tracked fitness level, and goals, artificial intelligence algorithms can be incorporated into wearable devices and have the potential to completely revolutionize the weightlifting industry. Akin to having a virtual personal trainer, powered by AI-driven solution at the disposal whenever needed will help improve performance, customize workout plan, and overall enhance the quality of exercise. This would maximize training efficiency and reduce the risk of injury.

Integration of machine learning algorithms into wearable devices like metamotions and apple watches opens up possibilities for real-time analysis of biomechanical metrics during strength training sessions. Upon leveraging the gathered data obtained in the form of accelerometer and gyroscope measurements can provide in-depth insights into factors such as exercise form, joint angles, and the amount of exerted force. Such detailed feedback can help people engaging in strength training sessions to fix their form, refine their repetition, and prevent risk of injuries by prior identification of potential areas vulnerable to strain or imbalance.

In addition to improving individual workouts, such applications developed with the aid of machine learning algorithms can also facilitate and encourage a broader engagement to the community of fitness where participants are striving to become the best version of themselves and actively sharing knowledge amongst each other. Such a collaborative ecosystem will not only foster accountability and motivation but also accelerate the collective learning curve as the participants benefit significantly from each other’s experiences. Finally, the quantization of individual fitness metrics will improve individual support that will further lead to more effective training practices that involve people of all skill levels and backgrounds.

Questions to be Answered

Who will benefit most from this research project?

- People seeking to optimize their weightlifting training program and trying to get into science-based lifting.

What were the primary objectives of investigating context-aware applications in strength training, and what motivated this research?

- The objective of this project was to investigate the potential possibilities of context-aware applications in the domain of strength training. The impetus for this project was the lack of focus on the implementation of context-aware applications in the field of activity tracker devices.

Was the data collection method viable for further extension of this study? What type of sensors were utilized

- Data has been collected from Metamotions, a wearable device that offers real-time and continuous monitoring of motion and environmental sensor data. The raw data involves accelerometer and gyroscope measurements of all the participants involved during their workout session. Dataset was created after itering over numerous files of raw data, cleaning it, preprocessing it, and finally merging it into one pickle dataset. The reason the dataset was exported as pickle over csv is because it involved numerous timestamps in epoch. Accelerometer and gyroscope sensors were utilized.

What was the optimal number of clusters? How were the different exercises classified

- After performing some Clustering tests the optimal number of clusters was determined to be 4 upon analyzing the elbow plot and determining the silhouette score. Cluster 1 covers almost all of the bench press and overhead press data. This could be attributed to the fact that both exercises involve similar patterns of movement. Squat is captured in cluster 2 while the deadlift and row are captured in cluster 3. Cluster 4 captures the remainder of the data but fails to classify them accurately.

What were the primary goal of the participants for participating in this project?

- To further optimize their training and workout sessions in order to pursue a higher level of fitness

Which was the best model for classification of different compound movements?

- Decision tree and random forest appear to be the best model with accuracy score of ~98%

How does quantification and tracking one's fitness correlates with one's overall fitness level?

- Quantification and tracking of fitness through means of context-aware applications can enable personalized monitoring leading to enhanced overall fitness by offering insights into the exercise patterns, form, and progress.

Are there any type of specific weight training that appear to be similar?

- Benchpress and overhead press were classified as similar in a number of instances. Similar situation happen in regards to deadlift and row.

Why different exercises are classified as same in some instances?

- Benchpress and overheadpress involved similar movement patterns during execution. Same appears to be true for deadlift and rows

Assessing correlation between psychological factors and the measurements generated in accelerometer and gyroscope. Ascertain the fact how psychological factors affect the training?

- No proper conclusion was reached regarding this question

Snippet 1

Data Preparation

Data has been collected from Metamotions, a wearable device that offers real-time and continuous monitoring of motion and environmental sensor data. The raw data involves accelerometer and gyroscope measurements of all the participants involved during their workout session. Dataset was created after itering over numerous files of raw data, cleaning it, preprocessing it, and finally merging it into one pickle dataset. The reason the dataset was exported as pickle over csv is because it involved numerous timestamps in epoch. Snippet 1 next to the text gives a brief glimpse of the raw data files that has been gathered. Snippet 2 is a snippet of one of the files that measured benchpress movement of participant A. Snippet 3, the lower rightmost image is the picture of final dataset the project will be using.

Snippet 2

Snippet 3

Exploratory Data Analysis

Since the data cleaning and preprocessing took such a long time only minor data exploration is done centered around the accelerometer and gyroscope measurements along different axes with the type of exercise (label).

Figure 1

Figure 2

Figure 3

Figure 4

Figure 5

Figure 6

Figure 7

Figure 8

Figure 9

Figure 10

Clustering

Overview

Clustering is an algorithm that falls under unsupervised learning that groups similar unlabeled data points together. It is implemented to identify similar patterns or structures within a given dataset. Hierarchical clustering is a type of clustering algorithm that organizes data points into a hierarchy of clusters and forms a dendogram. It can either be agglomerative where clusters are grouped together based on proximity, or divisive where the clusters are split in a recursive manner. Distance metrics like Euclidean distance or Gower distance also impacts the result of the clustering algorithm.

Data Prep

Before Transformation

After Transformation

Code

Python was used for clustering through means of Jupyter Notebook

Results and Conclusion

Elbow Plot

Silhouette Plot

Clustering Scatter Plot

Dendogram - Hierarchical Clustering

As we can see, clustering is the right approach for this model. However, Principal Component Analysis along with outlier detection should be performed before clustering as we can see from the above scatter plot that it is affecting the clustering solution. A clear elbow point from the elbow plot is hard to determine as the rate of decrease is pretty linear. However, from the elbow plot we can conclude that 3-5 clusters are ideal. The silhouette score is maximum at 4 as evident from the plot which peaks at 4. It means that a 4 cluster solution will have maximum inter-cluster similarity and least intra-cluster correlation. Thus, from the above elbow and silhouette plot we can conclude for this particular dataset 4 cluster solution is ideal. Moreover, 50 sample data points are sampled from the large dataset to get proper legible labels for the dendogram. We can conclude from the dendogram it is taking form of agglomerative clustering rather than divisive and 4 clusters can be recognized. The vertical line represents the distance between clusters.

Principal Component Analysis

Overview

Principal Component Analysis (PCA) is a method for reducing dimension of a complex dataset while preserving maximum possible explanatory power of the dataset. The method transforms the original features into principal components, ordered by their variance. PCA maximizes data variance along these components and helps in visualization and data compression for machine learning purposes. PCA projects original features onto orthogonal directions (principal components) where data variance is maximum. The next steps of the project is to perform clustering and PCA again but after performing outlier detection.