How do interval scales help us with better understanding IR evaluation measures?