You mention that the video should have this specs:
Video file (.mp4): Monocular video, 81 frames, 832×480 resolution, 16fps
But is it possible to have a video input file of Full-HD or UHD resolution using frames rates such als 23.986, 24, 25, 30 or 60p?
Could you built an AI for automatically creating the txt-files?