12 options scored across complexity × context × quality × latency × privacy × tool-use × budget. Top 3 highlighted.