If you find that after annotating audio data, the visible audio wave doesn’t match the timestamps and the sound, try converting the audio to a different format. For example, if you are annotating mp3 files, try converting them to wav files.
ffmpeg -y -i audio.mp3 -ar 8k -ac 1 audio.wav