joelchan's working notes

Powered by 🌱Roam Garden

inter-rater reliability

This is an "Orphan" page. Its core content has not been shared: what you see below is a loose collection of pages and page snippets that mention this page, as well as snippets of this page that were quoted elsewhere.

Referenced in

July 1st, 2021

All functional measures relied on a functional categorization, which had two researchers independently assign one or more functional categories (from a bottom-up generated list of 16 categories) to each solution. NOTE: singular solutions (functional categories that only one person generated) were lumped together into a single "other" category, comprising 3% of solutions. inter-rater reliability was pretty good, at Cohen's K = 0.87

inter-rater reliability