Queue parent - global vs specific interface

Is there any difference (like CPU/core utilization) when using specific interface instead of catch-all global ?

Looking at the flow diagrams alone there shouldn’t be any but since global is essentially artificial interface one can expect some kind of overhead, unfortunately after some digging on the forum I didn’t manage to find anything on this topic.