Probably it has more to do with the complexity of simultaneously implementing two different drag and drop subsystems, one for folders and one for tokens. Separating them in that way lets them work independently, as the cost of a bit fo cognitive dissonance in the use.