Bounded scales:  a new type of scale for educational practice and research

Paul Vossen; Suraj Ajit

doi:10.21125/edulearn.2023.1818

Bounded scales: a new type of scale for educational practice and research

Paul Vossen^*, Suraj Ajit

^*Corresponding author for this work

Technology

Research output: Contribution to Book/Report › Conference Contribution › peer-review

Abstract

In Higher Education, achievement assessment of students (aka summative evaluation) is nearly always done by qualified humans, called assessors, by means of counting or scoring on bounded scales related to quality criteria that are specific to the assignment or test. We do not want to criticize or change this age-old tradition. On the contrary: we accept this practice as it is, not the least because a viable alternative is not yet in sight (for theoretical as well as practical reasons, we consider psychometric tests and learning analytics, though available, as non-viable). Moreover, assessment by well-qualified and well-trained assessors appears to be reasonably effective and robust, whereas it is often not possible to delegate it to some testing instrument or software (exceptions, of course, are the simplest forms of assessment like multiple-choice tests and short essays). Thus, human assessment as such, in its double connotation of “assessment of humans by humans”, is not a big issue in educational assessment practice and research. The real issue though is the deceptively simple-looking phrase “bounded scale.” Unfortunately, there is not yet a well-developed theory or methodology of measuring on bounded scales that is flexible and practical enough for recurrent or incidental use in educational assessment. Therefore, we set out to develop such a theory of measurement for bounded scales satisfying several technical and practical requirements:
1. It should enable assessors to add, multiply, and average bounded scores measured for any number of quality criteria.
2. It should be possible to convert scores from one bounded scale to another bounded scale, with the lower bound, upper bound, and neutral score as parameters.
3. There should be a systematic way of standardizing bounded scales, so that scores can be interpreted independently from the original measurement scale.
4. It should be easy to aggregate, or average, scores from a given bounded scale to get a single bounded score that represents the original bounded scores analogous to the way that the arithmetic mean represents a set of boundary-free decimal numbers.

The above list is not complete, but it highlights the key features of the concept/construct of bounded scale. It appears that the resulting scale type has all the characteristics of an algebraic module, with scores as module elements, and addition and scalar multiplication of scores as module operations. Moreover, using the two basic operations, it is rather straightforward to define the neutral score, the inverse of a given score, the subtraction operation, the absolute value of a score, the distance between two scores, and finally an n-place operation called quasi-arithmetic mean of scores. Thus, bounded scales conceived of as modules have a rich structure of their own. As modules are a generalisation of ordinary linear spaces, we may still talk about scores and their manifold manipulations as if they were decimal numbers with associated numeric operations, but when it comes to actual calculations, we must of course apply the new operations defined for bounded scales as genuine modules.
We will demonstrate the above construction for the percentage scale, i.e., the standard bounded scale of scores between 0 (0%) and 1 (100%) with a variable neutral score. This scale type is well-known and often used in educational assessment for its unique properties and relatively simple rules of calculation. For neutral scores of 0%, 50%, or 100%, there are simple (but different) formulae for score addition, multiplication, and averaging. However, for arbitrary neutral scores, we must introduce a so-called generating function with two auxiliary parameters (both dependent on the neutral score) to get the correct (generalized) definitions of the score operations. Furthermore, we will show how to get a complete characterisation of the signed percentage scale with signed scores between ‒1 (‒100%) and +1 (+100%), again with a variable neutral score, using the same approach as for the standard percentage scale, but slightly adapted formulae and auxiliary parameters.

Original language	English
Title of host publication	EDULEARN23 Proceedings
Publisher	International Academy of Technology, Education and Development (IATED)
Pages	6912-6922
Number of pages	11
ISBN (Electronic)	978-84-09-52151-7
DOIs	https://doi.org/10.21125/edulearn.2023.1818
Publication status	Published - 3 Jul 2023
Event	15th Annual International Conference on Education and New Learning Technologies, EDULEARN 23 - Palma de Mallorca, Spain Duration: 3 Jul 2023 → 5 Jul 2023 https://iated.org/edulearn/

Publication series

Name	EDULEARN Conference Proceedings
Publisher	IATED
ISSN (Electronic)	2340-1117

Conference

Conference	15th Annual International Conference on Education and New Learning Technologies, EDULEARN 23
Country/Territory	Spain
City	Palma de Mallorca
Period	3/07/23 → 5/07/23
Internet address	https://iated.org/edulearn/

Keywords

measurement
assessment
grading
marking
scoring
rating
bounded scale
standard percentage scale
signed percentage scale
quasi-arithmetic mean

Access to Document

10.21125/edulearn.2023.1818

Cite this

@inproceedings{b289f9dfb5154902a83e5131348a5ca3,

title = "Bounded scales: a new type of scale for educational practice and research ",

abstract = "In Higher Education, achievement assessment of students (aka summative evaluation) is nearly always done by qualified humans, called assessors, by means of counting or scoring on bounded scales related to quality criteria that are specific to the assignment or test. We do not want to criticize or change this age-old tradition. On the contrary: we accept this practice as it is, not the least because a viable alternative is not yet in sight (for theoretical as well as practical reasons, we consider psychometric tests and learning analytics, though available, as non-viable). Moreover, assessment by well-qualified and well-trained assessors appears to be reasonably effective and robust, whereas it is often not possible to delegate it to some testing instrument or software (exceptions, of course, are the simplest forms of assessment like multiple-choice tests and short essays). Thus, human assessment as such, in its double connotation of “assessment of humans by humans”, is not a big issue in educational assessment practice and research. The real issue though is the deceptively simple-looking phrase “bounded scale.” Unfortunately, there is not yet a well-developed theory or methodology of measuring on bounded scales that is flexible and practical enough for recurrent or incidental use in educational assessment. Therefore, we set out to develop such a theory of measurement for bounded scales satisfying several technical and practical requirements:1. It should enable assessors to add, multiply, and average bounded scores measured for any number of quality criteria.2. It should be possible to convert scores from one bounded scale to another bounded scale, with the lower bound, upper bound, and neutral score as parameters.3. There should be a systematic way of standardizing bounded scales, so that scores can be interpreted independently from the original measurement scale.4. It should be easy to aggregate, or average, scores from a given bounded scale to get a single bounded score that represents the original bounded scores analogous to the way that the arithmetic mean represents a set of boundary-free decimal numbers.The above list is not complete, but it highlights the key features of the concept/construct of bounded scale. It appears that the resulting scale type has all the characteristics of an algebraic module, with scores as module elements, and addition and scalar multiplication of scores as module operations. Moreover, using the two basic operations, it is rather straightforward to define the neutral score, the inverse of a given score, the subtraction operation, the absolute value of a score, the distance between two scores, and finally an n-place operation called quasi-arithmetic mean of scores. Thus, bounded scales conceived of as modules have a rich structure of their own. As modules are a generalisation of ordinary linear spaces, we may still talk about scores and their manifold manipulations as if they were decimal numbers with associated numeric operations, but when it comes to actual calculations, we must of course apply the new operations defined for bounded scales as genuine modules.We will demonstrate the above construction for the percentage scale, i.e., the standard bounded scale of scores between 0 (0%) and 1 (100%) with a variable neutral score. This scale type is well-known and often used in educational assessment for its unique properties and relatively simple rules of calculation. For neutral scores of 0%, 50%, or 100%, there are simple (but different) formulae for score addition, multiplication, and averaging. However, for arbitrary neutral scores, we must introduce a so-called generating function with two auxiliary parameters (both dependent on the neutral score) to get the correct (generalized) definitions of the score operations. Furthermore, we will show how to get a complete characterisation of the signed percentage scale with signed scores between ‒1 (‒100%) and +1 (+100%), again with a variable neutral score, using the same approach as for the standard percentage scale, but slightly adapted formulae and auxiliary parameters.",

keywords = "measurement, assessment, grading, marking, scoring, rating, bounded scale, standard percentage scale, signed percentage scale, quasi-arithmetic mean",

author = "Paul Vossen and Suraj Ajit",

year = "2023",

month = jul,

day = "3",

doi = "10.21125/edulearn.2023.1818",

language = "English",

series = "EDULEARN Conference Proceedings",

publisher = "International Academy of Technology, Education and Development (IATED)",

pages = "6912--6922",

booktitle = "EDULEARN23 Proceedings",

note = "15th Annual International Conference on Education and New Learning Technologies, EDULEARN 23 ; Conference date: 03-07-2023 Through 05-07-2023",

url = "https://iated.org/edulearn/",

}

Vossen, P & Ajit, S 2023, Bounded scales: a new type of scale for educational practice and research . in EDULEARN23 Proceedings. EDULEARN Conference Proceedings, International Academy of Technology, Education and Development (IATED), pp. 6912-6922, 15th Annual International Conference on Education and New Learning Technologies, EDULEARN 23, Palma de Mallorca, Spain, 3/07/23. https://doi.org/10.21125/edulearn.2023.1818

TY - GEN

T1 - Bounded scales

T2 - 15th Annual International Conference on Education and New Learning Technologies, EDULEARN 23

AU - Vossen, Paul

AU - Ajit, Suraj

PY - 2023/7/3

Y1 - 2023/7/3

N2 - In Higher Education, achievement assessment of students (aka summative evaluation) is nearly always done by qualified humans, called assessors, by means of counting or scoring on bounded scales related to quality criteria that are specific to the assignment or test. We do not want to criticize or change this age-old tradition. On the contrary: we accept this practice as it is, not the least because a viable alternative is not yet in sight (for theoretical as well as practical reasons, we consider psychometric tests and learning analytics, though available, as non-viable). Moreover, assessment by well-qualified and well-trained assessors appears to be reasonably effective and robust, whereas it is often not possible to delegate it to some testing instrument or software (exceptions, of course, are the simplest forms of assessment like multiple-choice tests and short essays). Thus, human assessment as such, in its double connotation of “assessment of humans by humans”, is not a big issue in educational assessment practice and research. The real issue though is the deceptively simple-looking phrase “bounded scale.” Unfortunately, there is not yet a well-developed theory or methodology of measuring on bounded scales that is flexible and practical enough for recurrent or incidental use in educational assessment. Therefore, we set out to develop such a theory of measurement for bounded scales satisfying several technical and practical requirements:1. It should enable assessors to add, multiply, and average bounded scores measured for any number of quality criteria.2. It should be possible to convert scores from one bounded scale to another bounded scale, with the lower bound, upper bound, and neutral score as parameters.3. There should be a systematic way of standardizing bounded scales, so that scores can be interpreted independently from the original measurement scale.4. It should be easy to aggregate, or average, scores from a given bounded scale to get a single bounded score that represents the original bounded scores analogous to the way that the arithmetic mean represents a set of boundary-free decimal numbers.The above list is not complete, but it highlights the key features of the concept/construct of bounded scale. It appears that the resulting scale type has all the characteristics of an algebraic module, with scores as module elements, and addition and scalar multiplication of scores as module operations. Moreover, using the two basic operations, it is rather straightforward to define the neutral score, the inverse of a given score, the subtraction operation, the absolute value of a score, the distance between two scores, and finally an n-place operation called quasi-arithmetic mean of scores. Thus, bounded scales conceived of as modules have a rich structure of their own. As modules are a generalisation of ordinary linear spaces, we may still talk about scores and their manifold manipulations as if they were decimal numbers with associated numeric operations, but when it comes to actual calculations, we must of course apply the new operations defined for bounded scales as genuine modules.We will demonstrate the above construction for the percentage scale, i.e., the standard bounded scale of scores between 0 (0%) and 1 (100%) with a variable neutral score. This scale type is well-known and often used in educational assessment for its unique properties and relatively simple rules of calculation. For neutral scores of 0%, 50%, or 100%, there are simple (but different) formulae for score addition, multiplication, and averaging. However, for arbitrary neutral scores, we must introduce a so-called generating function with two auxiliary parameters (both dependent on the neutral score) to get the correct (generalized) definitions of the score operations. Furthermore, we will show how to get a complete characterisation of the signed percentage scale with signed scores between ‒1 (‒100%) and +1 (+100%), again with a variable neutral score, using the same approach as for the standard percentage scale, but slightly adapted formulae and auxiliary parameters.

AB - In Higher Education, achievement assessment of students (aka summative evaluation) is nearly always done by qualified humans, called assessors, by means of counting or scoring on bounded scales related to quality criteria that are specific to the assignment or test. We do not want to criticize or change this age-old tradition. On the contrary: we accept this practice as it is, not the least because a viable alternative is not yet in sight (for theoretical as well as practical reasons, we consider psychometric tests and learning analytics, though available, as non-viable). Moreover, assessment by well-qualified and well-trained assessors appears to be reasonably effective and robust, whereas it is often not possible to delegate it to some testing instrument or software (exceptions, of course, are the simplest forms of assessment like multiple-choice tests and short essays). Thus, human assessment as such, in its double connotation of “assessment of humans by humans”, is not a big issue in educational assessment practice and research. The real issue though is the deceptively simple-looking phrase “bounded scale.” Unfortunately, there is not yet a well-developed theory or methodology of measuring on bounded scales that is flexible and practical enough for recurrent or incidental use in educational assessment. Therefore, we set out to develop such a theory of measurement for bounded scales satisfying several technical and practical requirements:1. It should enable assessors to add, multiply, and average bounded scores measured for any number of quality criteria.2. It should be possible to convert scores from one bounded scale to another bounded scale, with the lower bound, upper bound, and neutral score as parameters.3. There should be a systematic way of standardizing bounded scales, so that scores can be interpreted independently from the original measurement scale.4. It should be easy to aggregate, or average, scores from a given bounded scale to get a single bounded score that represents the original bounded scores analogous to the way that the arithmetic mean represents a set of boundary-free decimal numbers.The above list is not complete, but it highlights the key features of the concept/construct of bounded scale. It appears that the resulting scale type has all the characteristics of an algebraic module, with scores as module elements, and addition and scalar multiplication of scores as module operations. Moreover, using the two basic operations, it is rather straightforward to define the neutral score, the inverse of a given score, the subtraction operation, the absolute value of a score, the distance between two scores, and finally an n-place operation called quasi-arithmetic mean of scores. Thus, bounded scales conceived of as modules have a rich structure of their own. As modules are a generalisation of ordinary linear spaces, we may still talk about scores and their manifold manipulations as if they were decimal numbers with associated numeric operations, but when it comes to actual calculations, we must of course apply the new operations defined for bounded scales as genuine modules.We will demonstrate the above construction for the percentage scale, i.e., the standard bounded scale of scores between 0 (0%) and 1 (100%) with a variable neutral score. This scale type is well-known and often used in educational assessment for its unique properties and relatively simple rules of calculation. For neutral scores of 0%, 50%, or 100%, there are simple (but different) formulae for score addition, multiplication, and averaging. However, for arbitrary neutral scores, we must introduce a so-called generating function with two auxiliary parameters (both dependent on the neutral score) to get the correct (generalized) definitions of the score operations. Furthermore, we will show how to get a complete characterisation of the signed percentage scale with signed scores between ‒1 (‒100%) and +1 (+100%), again with a variable neutral score, using the same approach as for the standard percentage scale, but slightly adapted formulae and auxiliary parameters.

KW - measurement

KW - assessment

KW - grading

KW - marking

KW - scoring

KW - rating

KW - bounded scale

KW - standard percentage scale

KW - signed percentage scale

KW - quasi-arithmetic mean

U2 - 10.21125/edulearn.2023.1818

DO - 10.21125/edulearn.2023.1818

M3 - Conference Contribution

T3 - EDULEARN Conference Proceedings

SP - 6912

EP - 6922

BT - EDULEARN23 Proceedings

PB - International Academy of Technology, Education and Development (IATED)

Y2 - 3 July 2023 through 5 July 2023

ER -