Goes from raw decoded floating point data from a sound file to a final plan for copying data to a target_duration_spec length
See Source File