Audio doesn't have layers. As soon as its compliled into an MP3 or WAVE, it loses the layers that may have been used when creating.
On the other hand, you can have audio programs that do cut out singing. It tends to work by cutting out a certain band of frequency and loudness that would probably be singing.
For this reason, most programs don't do it efficently, but some programs work okay.
Do a search in