Thanks for the answer!
For the sake of prototyping I want to do the following:
I want to have the player input a button at a 4/4 rhythm. Then by comparing their input to the bpm of a track playing in the background, we slow down or speed up the track depending on user inputs. So in order to generalize this to multiple tracks, I want to be able to extract the data from a track, like bar length and stuff.
I already have an RTCP set up to increase and decrease the speed, but now it's about using the input of the user and comparing it to the bpm of a track.