The top documents tagged [reward preference q]