Interpretation for scales of measurement linking with abstract algebra

Sawamura, Jitsuki; Morishita, Shigeru; Ishigooka, Jun

doi:10.1186/2043-9113-4-9

Research
Open access
Published: 10 June 2014

Interpretation for scales of measurement linking with abstract algebra

Jitsuki Sawamura¹,
Shigeru Morishita² &
Jun Ishigooka¹

Journal of Clinical Bioinformatics volume 4, Article number: 9 (2014) Cite this article

7083 Accesses
3 Citations
1 Altmetric
Metrics details

Abstract

The Stevens classification of levels of measurement involves four types of scale: “Nominal”, “Ordinal”, “Interval” and “Ratio”. This classification has been used widely in medical fields and has accomplished an important role in composition and interpretation of scale. With this classification, levels of measurements appear organized and validated. However, a group theory-like systematization beckons as an alternative because of its logical consistency and unexceptional applicability in the natural sciences but which may offer great advantages in clinical medicine. According to this viewpoint, the Stevens classification is reformulated within an abstract algebra-like scheme; ‘Abelian modulo additive group’ for “Ordinal scale” accompanied with ‘zero’, ‘Abelian additive group’ for “Interval scale”, and ‘field’ for “Ratio scale”. Furthermore, a vector-like display arranges a mixture of schemes describing the assessment of patient states. With this vector-like notation, data-mining and data-set combination is possible on a higher abstract structure level based upon a hierarchical-cluster form. Using simple examples, we show that operations acting on the corresponding mixed schemes of this display allow for a sophisticated means of classifying, updating, monitoring, and prognosis, where better data mining/data usage and efficacy is expected.

Background

In 1946, S. S. Stevens devised his classification of “levels of measurement” [1], which subsequently has been used widely and has accomplished an important role in composition and interpretation of scales in medical fields. The systematics of levels of measurement seems to have been organized and validated by virtue of this classification. Nevertheless, we believe that an abstract algebra-like interpretation/systematization awaits introduction because of its logical consistency and unexceptional applicability in describing patterns and processes. We conjecture that it offers benefits in clinical medicine, especially, with respect to scales of measurement [2, 3].

Thus, in the following, we re-interpret Stevens classification, and endeavour to give it meaning in some abstract algebra-like modelling. There, the most preferred construct is a vector-like structure of various sets of scores based on individual scales and operators that permit changes of score within the set. Additionally, classical datasets that are classified in terms of the Stevens scales of measurement can be mined and combined on a higher abstract structure level based upon a hierarchical-cluster form. To explore this possibility, we provide simple examples to help readers understand this modelling tool.

§1. Application of group/field of abstract algebra to the various types of scales

Stevens classified the scales of measurement into four scale types [1]; І) “Nominal scale” that uses only labels or numbers (e.g., numbering of football players, blood type, nationality); II) “Ordinal scale” that introduces equality, rank-ordering (e.g., hardness of minerals, grading for efficacy of clinical treatment); III) “Interval scale” that is based on equally quantitative intervals (e.g., temperature as read in centigrade, duration, frequency); and ІV) “Ratio scale” that assumes a ‘zero’ as an origin, equality, rank-order, equality of intervals, and equality of ratios (e.g., absolute temperature, speed of vehicles, and most physical values) that then admit manipulations using the four arithmetic operations.

For І), the “Nominal scale”, there seems to be little room where group theoretical operations apply because within that scale only a labelling scheme is permissible. Although some non-cyclic group might be definable, it seems that little meaning can be attached to operations for this sort of scale.

For II), the “Ordinal scale”, a ranking is realised by introducing a set with an N-graded scoring like ‘1, 2, 3,…, N – 1, N’ (N: positive integer) for a score deficient in (or with no absolute need for) a quantitative character, but not requiring a ‘0’ score according to the Stevens classification. Historically, the “graphic rating scale”, a grading from І to V, was proposed by Hayes and Patterson in 1921 [4], and Freyd in 1923 [5]. However, here, we envisage either operations that decrease the score by ‘1’ in an N-graded graphic scale necessitating a ‘0’, so that {0, 1, 2, 3,…, N – 2, N – 1} establishes the scoring scale, or simply adding the score ‘0’ as in {0, 1, 2, 3,…, N – 2, N – 1, N}. We focus on the former type. Then, for an arbitrary non-negative integer X, the operation giving the remainder of X after division by N, written X (mod N), defines the cyclic group Z_N = {0, 1, 2, 3,…, N – 2, N – 1}, where modulo N addition is postulated. With this assumption, given two elements ‘X_j’ and ‘X_k’ (X_j, X_k ∈ Z_N) corresponding for example to the severity of a clinical symptom and/or finding, then composition (denoted by ‘*’) is taken to be modulo N addition; ‘X_j*X_(j→k) = X_k’ (with X_(j→k) ∈ Z_N). Here ‘X_(j→k)’ is an operator that produces the change in score, ‘X_j → X_k’ (formally we have ‘X_(j→k) = X_j^-1*X_k = X_k – X_j’). Then, all scores ‘X_j’s and operators ‘X_(j→k)’ are composable within a single Abelian modulo additive group ‘Z_N’, where ‘X_j*X_k = X_k*X_j’ holds, at least, in terms of operation ‘*’. Thus a patient’s state corresponding to a certain illness or disease can be changed through the application of a single operation determined by the two elements belonging to ‘Z_N’ [6, 7] representing the previous and current state of the patient. A simple example is presented in Appendix A.

If a state of maximum severity is present, then the antithesis for any given disease Y is the ideal healthy state E_Y = [0|0|0|0|0|…], the combination of all scores being ‘0’ and represented by the identity element for group Y = {Z_N^×n, *}. Here, Y is the n-fold Cartesian product of ‘Z_N’ (n: the number of components) that comprises all possible assessments related to each state of a given disease, for instance, ‘hypertension’, ‘hyperglycaemia’, ‘diabetes mellitus’, ‘acute pancreatitis’, ‘systemic lupus erythematosus’, and ‘cerebral artery stroke’. If in addition composition is given by modulo ‘N’ arithmetic, prime numbers (e.g., N = 7) are preferable [8, 9] and considerable parts of components could be overlapping among individual diseases as was mentioned in our previous reports [6, 7]. Note that, in practice, equal increments within a grading scheme are not always postulated. Nevertheless, the scale represented by this Abelian modulo additive group ‘Z_N^×n’ will be called a “modular scale”. However, it may be an atypical case (partially weakened example) of a “Ratio scale” (type ІV) without the strict requirement for equal calibration. Indeed, there are such scales because, like the ‘TNM classification (with a ‘T0’ entry) for malignant tumours’ [7, 10], grades for scoring are determined for example according to histological characteristics, selection of treatment, and prognosis, having no strict linearity in scale, but which might be regarded as an “modular scale”. Based upon these results, for instance, the following are considered composable; Abelian modulo additive group Y₁ = {Z₇, *} for ‘hypertension’, Y₂ = {Z₇, *} for ‘hyperglycaemia’, Y₃ = {Z₇, *} for ‘diabetes mellitus’, Y₄ for ‘acute pancreatitis’, Y₅ for ‘systemic lupus erythematosus’, Y₆ for ‘cerebral artery stroke’, Y_all = {Z₇ × Z₇ × Z₇ × …, *} = {Z₇^×n, *} (n: the number of components) for an entire body, and Y₇ = {Z₈ × Z₄ × Z₂ × Z₂, *} for the ‘TNM classification (with a ‘T0’ entry) for malignant tumours’ [7, 10]. Additionally, these are treatable without exception within the abstract algebraic theory. For this case, an equal calibration for severity may have unbeneficial outcomes if used in clinical treatments. However, for ‘delirium’, ‘chronic liver dysfunction’, ‘acute pancreatitis’, and ‘diabetes mellitus’, for example, total scores based on equal calibration are desirable to assess disease severity.

For III), the “Interval scale”, differences in quantities are allowed. An example is ‘periods of time’ or ‘duration’, which, although can be measured with ratio scales, enables one period to be double another when compared. The same is true of ‘temperature’. If parameters ‘X_j’ and ‘X_l’ ∈ R (the continuous real number line) have ranges

- \infty < X < + \infty

(i)

we can consider an operator ‘X_k’ that causes changes from ‘X_j’ to ‘X_l’, and introduce a binary operation, denoted ‘◦’, where ordinal addition and its inverse, subtraction, are assumed;

X_{j} \circ X_{k} = X_{j} + X_{k} = X_{l} (j, k, l; session numbers)

(ii)

In this regard, as for ‘X_j’, it can also be expressed as a sum of an integer part and a decimal part,

X_{j} = 1 m_{j} + c_{j}

(iii)

(m_j = [X_j], c_j = X_j - [X_j], ‘0 ≤ c_j < 1’; ‘[X]’ is the floor function meaning the highest integer below ‘X’). Similarly,

X_{k} = 1 m_{k} + c_{k} (m_{k} = [X_{k}], c_{k} = X_{k} - [X_{k}], ‘ 0 \leq c_{k} < 1 ’)

(iv)

X_{l} = 1 m_{l} + c_{l} (m_{l} = [X_{l}], c_{l} = X_{l} - [X_{l}], ‘ 0 \leq c_{l} < 1 ’)

(v)

‘1’ is a ‘unit length’ of the respective values. Thus, (iii) - (v) can be redefined using the unit length ‘1’ as an interval scale,

\begin{array}{l} X_{j} \circ X_{k} = (1 m_{j} + c_{j}) + (1 m_{k} + c_{k}) \\ = 1 (m_{j} + m_{k}) + (c_{j} + c_{k}) = 1 m_{l} + c_{l} \\ = X_{l} \end{array}

(vi)

There exists an identity element ‘X₀’ (=0) that satisfies ‘X_j ◦X₀ = X₀ ◦X_j (=X_j + X₀ = X_j + 0) = X_j’. Additionally, the inverse element is ‘X_j^-1 = -X_j’ satisfying ‘X_j^-1◦X_j = X_j ◦X_j^-1 = X_j + X_j^-1 = X_j - X_j = X₀ (=0)’.

Naturally, commutativity and associativity are satisfied. Let U be the set that comprises all ‘X_j’s, i.e., U ≡ {X_j | X_j ∈ R}. Because ‘X_j , X_k, X_j ∈ set U, the closure law holds. Therefore, this operation defines a group U = {X_j, ◦} [2, 3]. “Body temperature readings”, “clock time for the onset of sleep within a day” and “clock time for the onset of drip infusion within a day” are definable in this scale. Examples of the first two are provided in Appendix B. By making use of this procedure, the differences between quantitative values and operators are eliminated, and both can be regarded as elements belonging to a single group U. Moreover, a collection of additive Abelian groups U₁ ≡ {X_1j | X_1j ∈ R (deg C)} based on an individual’s clinical values can be described as, as for example U₁ = {X_1j, ◦} for “body temperature readings”, and U₂ ≡ {X_2j | X_2j ∈ R (/24 hrs)} and U_2j = {X_2j, ◦} for “clock time for the onset of sleep within a day”, U₃ ≡ {X_3j | X_3j ∈ R (/24 hrs)} and U_3j = {X_3j, ◦} for “clock time for the onset of drip infusion within a day”,…, U_N = {X_Nj, ◦},…, (N: natural number). Those are considered readily treatable and recordable within an abstract algebraic context.

For IV), the “Ratio scale”, the ‘administration of medicine (with strict dosage regimes)’ and ‘International Statistical Classification and Health Related Problems’ [11] were given as examples in our previous report [6, 7]. Essentially, for this scale, because the four arithmetic operations are possible, ‘rings’ and ‘fields’ in abstract algebra are applicable so long as composition is given by modulo ‘N’ arithmetic with ‘N’ a prime. Although there could be scope where the four modulo arithmetic operations (denoted by ‘†’ in ‘X_j†X_k = X_l’) are applicable in assessment scoring in clinical medicine, it might be preferable at this stage to confine the application of ratio scales to just modulo N addition ‘*’ collectively for ‘†’, similar in manner as established in Appendix A. For the example given in Appendix A, the difference in interpretation is the presence/absence of an equal calibration.

Whereas the scale of ‘TNM classification for malignant tumours’ [10] was regarded as an example of an “Ordinal scale”, some of the scales defined as “Ratio scales” at initial glance should be regarded as “Ordinal scales” accompanied with ‘0’. It might be contentious whether clinical assessments performed using superficial scales based on the four arithmetic operations could have sufficient validity in clinical treatments or clinical research.

Nevertheless, other clinical scales range over a semi-open continuous interval like ‘0 ≤ X < +∞’ (X: real number), such as ‘blood concentration of white blood cells: [WBC] (/mm³)’, and ‘administration of a certain drug like lithium carbonate: [Li⁺] (mEq/l), sodium: [Na⁺] (mEq/l), calcium: [Ca⁺⁺] (mg/dl), chloride: [Cl^-] (mEq/l) and bicarbonate: [HCO₃^-] (mEq/l)’. Also, there are clinical scales whose ranges are the open interval like ‘-∞ < X < +∞’ (X: real number); ‘Anion gap [AG] = [Na⁺] - ([Cl^-] + [HCO₃^-]) (reference range for blood tests: 12 ± 2 mEq/l)’ and ‘Base excess [BE] (reference range for blood tests: 0 ± 2 mmol/l)’. However, both can be treated using the notion of ‘field’ because those values are real numbers where all four arithmetic operations are included, with the exception of division by zero. Thus, the above clinical values could be definable over a ‘field’. In this regard, we assume a rule that each unit like ‘mEq/l’ accompanies the value automatically with the results of operations regardless of types of operation among the four arithmetic operations (Note that there are cases when units vanish as when ratios are taken ‘mEq/mEq (unitless)’ or displayed in reciprocal form like ‘l/mEq’). Examples for ‘[WBC] (/mm³)’, ‘[Na⁺] (mEq/l)’ are presented in Appendix C.

In this case, we consider a set V and assume that ‘#’ means one of ‘addition, subtraction, multiplication, and division’ collectively; thus, ‘X_j # X_k = X_l (∈V), where ordinal arithmetic calculations are performed excluding of course division by zero.

For set V, addition is commutative: X_j + X_k = X_k + X_j, and associative: (X_j + X_k) + X_l = X_j + (X_k + X_l). As for multiplication, set V meets the conditions of a ‘monoid’ [2, 3]. Associativity: (X_j × X_k) × X_l = X_j × (X_k × X_l), with Left and Right Distributivity: X_j × (X_k + X_l) = X_j × X_k + X_j × X_l, (X_j + X_k) × X_l = X_j × X_l + X_k × X_l. A nonzero Identity X₀ (=1) for multiplication exists. The Inverse ‘X_j^-1 = 1/X_j’ satisfies ‘X_j × X_j^-1 = X_j^-1 × X_j = X₀ (=1)’. For division, ‘X_j/X_k = X_j × X_k^-1 = 1’ is definable except for division by zero. Therefore, we can confirm that set V is a ‘field’. It can be expressed as V = {X_j, #} or V = {X_j | X_j ∈ R}.

Furthermore, different fields based on different sets of clinical values can be described as follows: field V₁ ≡ {X_1j | X_1j ∈ R (/mm³)} and V₁ = {X_1j, #} for “blood concentration of white blood cells: [WBC] (/mm³)”, field V₂ ≡ {X_2j | X_2j ∈ R (mEq/l)} and V₂ = {X_2j, #} for “administration of a certain drug like lithium carbonate: [Li⁺] (mEq/l)”, field V₃ ≡ {X_3j | X_3j ∈ R (mEq/l)} and V₃ = {X_3j, #} for “sodium: [Na⁺] (mEq/l)”, field V₄ ≡ {X_4j, #} for calcium: [Ca⁺⁺] (mg/dl), field V₅ for chloride: [Cl^-] (mEq/l), field V₆ for ‘Anion gap [AG] (mEq/l)’, field V₇ for ‘Base excess [BE] (mmol/l)’,…, V_N,…, (N: natural number). For each, an independent abstract algebraic treatment is possible as for ordinal abstract algebra.

§2. A vector-like notation using group/field operations belonging to a single set

By making use of all types of scales of measurement, we propose a vector-like expression of a patient’s state (denoted ‘R_j’, j = 1, 2, 3,…: number of sessions), where the mixed expression and its totality of operations that could be performed belong to a single set R. Because of the possible variety of operation rules, the genuine use of this set may be unwieldy at this stage.

Partially based upon our previous description [6, 7], let us define ‘R_j’ to be a vector of five clinical values,

R_j = [severity for depression (within modulo 7 arithmetic) | clock time for the onset of sleep (/24 hrs) | blood concentration of white blood cell [WBC] (/mm³) | blood concentration of [Na⁺] (mEq/l)| a certain value (a certain operational unit)],

\begin{array}{l} = [X_{(j) 1} (mod 7) | X_{(j) 2} (/ 24 hrs) | X_{(j) 3} (/ {mm}^{3}) \\ | X_{(j) 4} (mEq / l) | X_{(j) 5} (\dots)] \end{array}

(vii)

Next, suppose the patient’s state ‘R_j’ changes to ‘R_j+1’ effected by operator ‘R_(j→j+1)’; we denote by ‘◊’ the binary composition composed of the product of compositions for each component. Three possible states are:

\begin{array}{l} R_{1} = [(X_{(1) 1} =) 2 (mod 7) | (X_{(1) 2} =) 21 (/ 24 hrs) | (X_{(1) 3} =) 5000 \\ (/ {mm}^{3}) | (X_{(1) 4} =) 145 (mEq / l) | X_{(1) 5} (\dots)] \end{array}

\begin{array}{l} R_{2} = [(X_{(2) 1} =) 5 (mod 7) | (X_{(2) 2} =) 19.5 (/ 24 hrs) | (X_{(2) 3} =) 18000 \\ (/ {mm}^{3}) | (X_{(2) 4} =) 128 (mEq / l) | X_{(2) 5} (\dots)], \end{array}

\begin{array}{l} R_{3} = [(X_{(3) 1} =) 3 (mod 7) | (X_{(3) 2} =) 22 (/ 24 hrs) | (X_{(3) 3} =) 7000 \\ (/ {mm}^{3}) | (X_{(3) 4} =) 158 (mEq / l) | X_{(3) 5} (\dots)] . \end{array}

For the 1st component, ‘X₍₁₎₁’,‘X₍₂₎₁’, and ‘X₍₃₎₁’, modulo 7 arithmetic (addition) is used. For the 2nd components, ‘X₍₁₎₂, X₍₂₎₂, X₍₃₎₂’, operations of Abelian addition are used. For the 3rd component, ‘X₍₁₎₃, X₍₂₎₃, X₍₃₎₃’, 4th ‘X₍₁₎₄, X₍₂₎₄, X₍₃₎₄’, the four arithmetic operators (those operations denoted by ‘#’) are required, and for the 5^th, ‘X₍₁₎₅, X₍₂₎₅, X₍₃₎₅’, a certain operational unit is postulated. In the following examples, only addition/subtraction is presented; naturally, multiplication/division is also considered permissible.

Then, using results in Appendix D, ‘R_(1→2)’ and ‘R_(2→3)’ from the three states given above are as follows:

\begin{matrix} R_{(1 \to 2)} = [3 (mod 7) | - 1.5 (/ 24 hrs) | 13000 (/ {mm}^{3}) | \\ - 17 (mEq / l) [| X_{(1 \to 2) 5} (\dots)] \end{matrix}

\begin{matrix} R_{(2 \to 3)} = [5 (mod 7) | 2.5 (/ 24 hrs) | - 11000 (/ {mm}^{3}) | \\ 30 (mEq / l) | X_{(2 \to 3) 5} (\dots)] \end{matrix}

Thus, we confirm the relation

R_{1} ◊ R_{(1 \to 2)} ◊ R_{(2 \to 3)} = R_{3}

(viii)

Details are illustrated in Appendix E.

Note that, in general, there exists an identity ‘E (=R₀) = [0 (mod 7)| 0 (/24 hrs)| 0 (/mm³)| 0 (mEq/l) | X₀ (…)]’ such that ‘R_j◊E = E◊R_j = R_j’. Additionally, there exists an inverse for any ‘R_j’, ‘R_j^- 1 = [X_(j)1^- 1(mod 7) | X_(j)2^- 1(/24 hrs)| X_(j)3^- 1(/mm³)| X_(j)4^- 1(mEq/l)|X_(j)5^- 1(…)] = [7–X_(j)1(mod 7) | 24 - X_(j)2(/24 hrs)| - X_(j)3(/mm³)| - X_(j)4(mEq/l)|X_(j)5^- 1(…)]’ that satisfies ‘R_j^-1◊R_j = R_j◊R_j^-1 = E’. However, commutativity, ‘R_j◊R_k = R_k◊R_j’ and associativity, ‘(R_j◊R_k)◊R_l = R_j◊(R_k◊R_l)’ are not satisfied. Here, we assume that operators acting on ‘R_j’s should be performed from left to right, that is, from R₁ to R_m (m; number of session for assessment). They should not be applied between ‘R_j’s. For any assortment of ‘R_j’s with scales of measurement among types I)–IV), a single set R = {R_j| X_(j)1 × X_(j)2 × X_(j)3 × X_(j)4 × X_(j)5} (‘×’ means products among groups and fields) using a vector-like notation for the scoring of patient states can be structured where all possible assessments and/or clinical findings of the patient and treatment are included. The general form is the n-fold product; set R = {R_j| X_(j)1 × X_(j)2 × X_(j)3 × X_(j)4 × …×X_(j)(n-2) × X_(j)(n-1) × X_(j)n} (n; the number of components).

As for the possible application to better data mining or data usage from the viewpoint of our reinterpretation, we provide a simple example that may help readers to follow an outline of the argument. Consider an example of 17 states “R₁, R₂, …, R₁₇” (∈set R) each with four component (‘n = 5’) and arrows (only symbols) that indicate the possible changes among the ‘R_j’s, as displayed in Figure 1. The scheme covers the notation of our model, and also that of existing methods where (possible) results of data, ‘R_j’s, are not combined directly with each other in the sense of operations. Then, the arrows could be re-displayed according to our concepts as operators ‘R_(j→k) that can be regarded as elements ‘R_j’ belonging to a set R as in Figure 2. In ordinal data sets, the ‘R_j’s are merely a collection of values and the arrows in Figure 1 are only marks. However, in our interpretation, all ‘R_j’s and ‘R_(j→k)’s are elements of a single set R subject to axioms of an abstract algebra as indicated using composition symbol ‘◊’ in Figure 3. There, the changes ‘from R_j to R_k’ can be traced at each session. Displayed in this way, Figure 3 represents an “operational tree” that could offer potential for better data mining/data usage through a more generalized/concise treatment (e.g., withdrawing/recording correspond to schemes in Figure 3) that might be permissible. Practical improvements for efficacy, however, will need future investigations.

Here, consider the scenario of Figure 1 where from an initial value ‘R₁’ there are four outcomes ‘R₆’, ‘R₉’, ‘R₁₆’, and ‘R₁₇’ containing nodes at ‘R₂’ ‘R₄’ ‘R₁₀’ ‘R₁₂’ and ‘R₁₃’. By making use of our previous examples ‘R₁ - R₃’, the next simplest examples with ‘n (component number) = 5’ can be confirmed easily:

\begin{matrix} R_{10} = [0 (mod 7) | 17 (/ 24 hrs) | 9000 (/ {mm}^{3}) \\ | 130 (mEq / l) [| X_{(10) 5} (\dots) \end{matrix}

\begin{matrix} R_{11} = [6 (mod 7) | 20 (/ 24 hrs) | 20000 (/ {mm}^{3}) \\ | 149 (mEq / l) [| X_{(11) 5} (\dots) \end{matrix}

\begin{matrix} R_{12} = [4 (mod 7) | 23 (/ 24 hrs) | 6000 (/ {mm}^{3}) \\ | 140 (mEq / l) | X_{(12) 5} (\dots)] \end{matrix}

\begin{matrix} R_{13} = [1 (mod 7) | 18 (/ 24 hrs) | 5000 (/ {mm}^{3}) \\ | 135 (mEq / l) | X_{(13) 5} (\dots)] \end{matrix}

\begin{matrix} R_{17} = [2 (mod 7) | 23.5 (/ 24 hrs) | 3000 (/ {mm}^{3}) \\ | 150 (mEq / l) | X_{(17) 5} (\dots)] \end{matrix}

Following these results, the next relations, according to the tree in Figure 3, can be obtained for instance:

R_{1} ◊ R_{(1 \to 2)} ◊ R_{(2 \to 10)} ◊ R_{(10 \to 11)} ◊ R_{(11 \to 12)} = R_{12}

R_{1} ◊ R_{(1 \to 2)} ◊ R_{(2 \to 10)} ◊ R_{(10 \to 13)} ◊ R_{(13 \to 17)} = R_{17}

The operator expressions are evaluated in Appendix F.

Similarly, the next sequences are definable in principle,

R_{1} ◊ R_{(1 \to 2)} ◊ R_{(2 \to 3)} ◊ R_{(3 \to 4)} ◊ R_{(4 \to 5)} ◊ R_{(5 \to 6)} = R_{6}

R_{1} ◊ R_{(1 \to 7)} ◊ R_{(7 \to 8)} ◊ R_{(8 \to 9)} = R_{9}

R_{12} ◊ R_{(12 \to 4)} = R_{4}

R_{1} ◊ R_{(1 \to 2)} ◊ R_{(2 \to 10)} ◊ R_{(10 \to 13)} ◊ R_{(13 \to 14)} ◊ R_{(14 \to 15)} ◊ R_{(15 \to 16)} = R_{16}

In general, we denote a node divergence ‘R_a to R_b (=R_a◊R_(a→b) = R_b)’ and ‘R_a to R_c (=R_a◊R_(a→c) = R_c)’ as ‘R_a[(◊R_(a→b))(◊R_(a→c))]’ (a,b,c: non-negative integers); here ‘( )( )( )…’ meaning simple juxtaposition. All paths belonging to the operational tree of Figure 3 can then be described/recorded, for instance, as the sequence

\begin{array}{l} R_{1} ◊ R_{(1 \to 2)} [(◊ R_{(2 \to 3)} ◊ R_{(3 \to 4)} ◊ R_{(4 \to 5)} ◊ R_{(5 \to 6)}) (◊ R_{(2 \to 7)} ◊ R_{(7 \to 8)} \\ ◊ R_{(8 \to 9)}) (◊ R_{(2 \to 10)}[(◊ R_{(10 \to 11)} ◊ R_{(11 \to 12)} ◊ R_{(12 \to 4)}) \\ (◊ R_{(10 \to 13)} [(◊ R_{(13 \to 14)} ◊ R_{(14 \to 15)} ◊ R_{(15 \to 16)}) (◊ R_{(13 \to 17)})])])] \end{array}

(ix)

To display for easy recognition, for example, end states like ‘R₆, R₉, R₁₆, and R₁₇’ and divergence point ‘R₄’ a notation ‘(=R₆), (=R₉), (=R₁₆) and (=R₁₇)’ might be considered. Hence,

\begin{array}{l} R_{1} ◊ R_{(1 \to 2)} [(◊ R_{(2 \to 3)} ◊ R_{(3 \to 4)} ◊ R_{(4 \to 5)} ◊ R_{(5 \to 6)} (= R_{6})) (◊ R_{(2 \to 7)} \\ ◊ R_{(7 \to 8)} ◊ R_{(8 \to 9)} (= R_{9})) (◊ R_{(2 \to 10)}[(◊ R_{(10 \to 11)} ◊ R_{(11 \to 12)} \\ ◊ R_{(12 \to 4)} (= R_{4})) (◊ R_{(10 \to 13)} [(◊ R_{(13 \to 14)} ◊ R_{(14 \to 15)} \\ ◊ R_{(15 \to 16)} (= R_{16})) (◊ R_{(13 \to 17)} (= R_{17}))])])] \end{array}

(x)

Moreover, composition with an operator as in operating on ‘R_{(3 → 4)}[(◊R_{(4 → 5)} ◊ R_{(5 → 6)})(◊R_{(4 → 8)} ◊ R_{(8 → 9)} ◊ R_{(9 → 10)})(◊R_{(4 → 15)} ◊ R_{(15 → 16)}) …] from the left-hand side by ‘R₃’. The subsequent result can be expressed in accordance with the single scheme presented in Figure 3,

\begin{array}{l} R_{3} ◊ R_{(3 \to 4)} [(◊ R_{(4 \to 5)} ◊ R_{(5 \to 6)}) (◊ R_{(4 \to 8)} ◊ R_{(8 \to 9)} ◊ R_{(9 \to 10)}) \\ (◊ R_{(4 \to 15)} ◊ R_{(15 \to 16)}) \dots] = R_{6}, R_{10}, R_{16},\dots \end{array}

(xi)

Note that the above descriptions (ix)–(xi) express one-to-many functionality. However, we think that these formulae are the algebra equivalent to the single operational tree as exemplified by Figure 3. These play the algebraic role in composite record-keeping in applied fields such as medicine. In this formalism, any possible result ‘R_j’ (∈set R) is obtained and traceable from any state ‘R_k’ under operations involving a plurality of elements belonging to a single set R.

Additionally, we can include data mining in a more symbolic/abstract way as follows. For an arbitrary j (j = 1, 2, 3,…, m), a hierarchical-cluster-like expression can be defined [12]. For instance, if a partition of R_j is a set of subsets H = {₁R_j, ₂R_j, ₃R_j,…, _rR_j} such that (1) R_j ∈ H; (2) for all single sets _sR_j in R_j, _sR_j ∈ H; and (3) ‘_sR_j ∩ _tR_j ∈ {ϕ, _sR_j, _tR_j}’ for all s ≠ t = 1, 2,…, r. That is, condition (3) means that either any two clusters ‘_sR_j and _tR_j’ are disjoint, or one cluster is contained entirely inside the other, and every individual R_j is contained in at least one cluster larger than itself. Note that if ‘_sR_j ∩ _tR_j = ϕ’ for all s ≠ t, then the hierarchy becomes a partitioning. Henceforth, reference to a hierarchy implies that ‘_sR_j ∩ _tR_j = ϕ’ for at least one set of (s, t) values. In the previous example (vii), ‘R_j’ could be expressed in hierarchical-cluster notation where there are eight clusters (and relabeling within clusters) as shown in Figure 4. If R_j comprises ‘₁R_j¹ and ₂R_j¹’, the first level of hierarchy, ‘R_j = ₁R_j¹ ∪ ₂R_j¹’ holds. At the second level, ‘₁R_j¹ = ₁₁R_j² ∪ ₁₂R_j²’ = [X_(j)1 (mod 7) | X_(j)2 (/24 hrs)], ‘₂R_j¹ = ₂₁R_j² ∪ ₂₂R_j²’ = [X_(j)3 (/mm³) | X_(j)4 (mEq/l)|X_(j)₅ (…)], whereas at the third level, ‘₂₂R_j² = ₂₂₁R_j³ ∪ ₂₂₂R_j³’ = [X_(j)4 (mEq/l)|X_(j)₅ (…)], ₂₂₁R_j³ = [X_(j)4 (mEq/l)], ₂₂₂R_j³ = [X_(j)5 (…)] (Figure 4). Hence we obtain the complete set R_j = {X_(j)1, X_(j)2, X_(j)3, X_(j)4, X_(j)5} = [X_(j)1 (mod 7) | X_(j)2 (/24 hrs) | X_(j)3 (/mm³) | X_(j)4 (mEq/l) | X_(j)5 (…)]. A hierarchy has additional levels as necessary to reach single units at its base [12]. The top level is the entire dataset ‘R_j’ and that is always composable using base units. That is, arbitrary ‘R_k’ and ‘R₁’ can be combined into a single dataset as with ‘R_k = [X_(k)1| X_(k)2 |…| X_(k)a]’ and ‘R₁ = [X₍₁₎₁| X₍₁₎₂ |…| X_(1)b]’, ‘{R_k, R₁} =R_j [X_(j)1| X_(j)2 |…| X_(j)a | X_(j)a+1| X_(j)a+2 |…| X_(j)a+b] ’ (a,b; positive integers). In this way, classical datasets that are classified in the Stevens scales of measurement could be mined and combined on a higher abstract structure level. To help better understand the concept, a sequence of schemes illustrating the principles of our model is presented in Figure 5.

Subject to future improvements, we envisage that this compact description is versatile to provide better data mining/data usage than from existing methods, although a final version is far from complete at this early stage.

§3. Supplementary suggestions and limitations

If the four arithmetic operations are appropriate in handling the values from clinical assessments, representation by “Ratio scales” (in some cases, the “modular scale” with suitable modulo number previously mentioned) might be effective in describing the clinical treatments or studies. The “Numerical rating scale (NRS)” with range ‘0–10’ [13, 14] illustrates the point where the modulo 11 additive group ‘Z₁₁’ arises as a natural modular scale. In contrast, similar approaches might be difficult for a “visual analogue scale” [15, 16] where values could take any real number.

Whereas rating scales systemized as abstract algebra-like form may enable a more generalized/sophisticated understanding, establishing a link between fields of clinical medicine and abstract algebra, and mixed states and operators in vector-like notation as in (vii)–(xi), does not always assure more concise manipulations. A mixed treatment as exemplified in (vii)–(xi) might not always yield optimal results at present. In general, combining group and field-like structures within ‘R_j’ may cause some confusion in handling the ‘R_j’s although benefits accrue through operational compliance and convenience in dealing with the abstract algebra. For description and records, a vector-like definition ‘R_j’ may not always be advantageous in which only the four types I)–IV) are used (particularly for ‘I)’, the “nominal scale”, where systematization of operation seems to be impossible). Nevertheless, we infer that in the handling of operations in mixed-notation like ‘R_j’, the classification and synthesis of scales of measurement in some group/field-like form may be devised in a more rigorous methodology in future improvements.

That apart, similar, redundant, and obscure components may have been incorporated into the ‘R_j’s description without discretion. The ‘R_j’ in such instances loses validity and versatility in terms of a concise composition of scales. This is considered to result from the fact that a total state of a certain disease or a condition of a patient is not always composable or describable via the combination of partial components. This implies that a larger number of components is not always desirable for assessment or rating scales.

Unfortunately, almost all current assessment scales in medicine are handled as if they were ratio scales although almost all are just ordinal scales. That might introduce considerable futility and/or waste of scientific resources. As previously indicated, some clinical scales (e.g., TNM classification) should be represented as an ordinal scale accompanied by ‘0’ with no absolute need for a quantitative calibration (modular scale). Although a combination composed of entirely ratio scales seems to be difficult or impossible, we believe at least that appropriate operational structures (e.g., group, field) should always be selected that satisfied the conditions in instances like composition of scale, analysis, and interpretation of the results. These structures must be recognized clearly by users per each assessment to avoid misestimation, overconfidence, and complacency in scales.

Conclusions

The Stevens classification of scales of measurement can be re-interpreted and modelled as some abstract algebra-like systematization. Moreover, a vector-like notation using mixed types of operations and a hierarchical structure-like systematization are possible enabling a sophisticated means to classify, update, monitor, and forecast patient treatments. Better data mining/data usage and efficacy is expected and will be considered in future studies.

Appendix

Appendix A

Using ‘N = 5’ for the scale of a certain symptom or clinical finding with set Z₅ ≡{0, 1, 2, 3, 4}, we suppose ‘X₁ = 1’ (∈ set Z₅) for the initial state and ‘X₂ = 4’ (∈ set Z₅) for the final state. Expressed as ‘X₁*X_(1→2) = X₂’, the change can be determined as ‘X_(1→2) = X₂ – X₁ (mod 5) = 4 – 1 (mod 5) = 3 (mod 5) (∈ Z₅)’.

Appendix B

Suppose ‘the body-temperature thermometer’ (deg C; degree Celsius) changes from ‘T₁ = 36.7 (deg C)’ to ‘T₂ = 35.1 (deg C)’. Because ‘T₁ ◦T_(1→2) = T₂’, an operator part is calculated as ‘T_(1→2) = T₂ - T₁ = 35.1 - 36.7 (deg C) = - 1.6 (deg C)’. For an another example, when there are two clock times for the onset of sleep ‘t₁ = 21 (/24 hrs)’ and ‘t₂ = 19.5 (/24 hrs)’, the operator part is determined as ‘t_(1→2) (/24 hrs) = t₂ - t₁ (/24 hrs) = 19.5 - 21 (/24 hrs) = -1.5 (/24 hrs) = 24 -1.5 (/24 hrs) = 22.5 (/24 hrs)’.

Appendix C

Provided [WBC] changes in the following manner: ‘5000 (/mm³) (= W₁) →18000 (/mm³) (=W₂). Because ‘W₁ # W_(1→2) = W₂’, the operator denoted by ‘W_(1→2)’ for addition is derived from ‘W_(1→2) = W₂ - W₁ = 18000 - 5000 = 13000 (/mm³)’. Collectively, the operator is determined by division: ‘W_(1→2) = W₂/W₁ =18000/5000 (= 3.6) (/mm³)’,

For an another example, if ‘[Na]₁ = 145 (mEq/l)’ changes into ‘[Na]₂ = 128 (mEq/l)’, because ‘[Na]₁ # [Na]_(1→2) = [Na]₂’, the operator for addition is obtain from ‘[Na]_(1→2) = [Na]₂ - [Na]₁ = 128 - 145 = - 17 (mEq/l)’. Collectively, the operator for division is ‘[Na]_(1→2) = [Na]₂/[Na]₁ = 128/145 (mEq/l)’.

Appendix D

R_(1→2) = R₂ - R₁

= [5 (mod 7) | 19.5 (/24 hrs) | 18000 (/mm³) | 128 (mEq/l) | X₍₂₎₅ (…)] - [2 (mod 7) | 21 (/24 hrs) | 5000 (/mm³) | 145 (mEq/l) | X₍₁₎₅ (…)],

= [5 - 2 (mod 7) | 19.5 - 21 (/24 hrs) | 18000 - 5000 (/mm³) | 128 - 145 (mEq/l) | X_(1→2)5 (…)],

= [3 (mod 7) | - 1.5 (/24 hrs) | 13000 (/mm³) | - 17 (mEq/l) | X_(1→2)5 (…)].

R_(2→3) = R₃ - R₂

= [3 (mod 7) | 22 (/24 hrs) | 7000 (/mm³) | 158 (mEq/l)] | X₍₃₎₅ (…)] - [5 (mod 7) | 19.5 (/24 hrs) | 18000 (/mm³) | 128 (mEq/l) | X₍₂₎₅ (…)],

= [3 - 5 (mod 7) | 22 - 19.5 (/24 hrs) | 7000 - 18000 (/mm³) | 158 - 128 (mEq/l) | X_(2→3)5 (…)],

= [- 2 (mod 7) | 2.5 (/24 hrs) | - 11000 (/mm³) | 30 (mEq/l) | X_(2→3)5 (…)],

= [5 (mod 7) | 2.5 (/24 hrs) | - 11000 (/mm³) | 30 (mEq/l) | X_(2→3)5 (…)].

Appendix E

R₁◊R_(1→2)◊R_(2→3) = [2 (mod 7) | 21 (/24 hrs) | 5000 (/mm³) | 145 (mEq/l) | X₍₁₎₅ (…)]◊[3 (mod 7) | - 1.5 (/24 hrs) | 13000 (/mm³) | - 17 (mEq/l) | X_(1→2)5 (…)]◊[5 (mod 7) | 2.5 (/24 hrs) | - 11000 (/mm³) | 30 (mEq/l) | X_(2→3)5 (…)],

= [2 + 3 + 5 (mod 7) | 21 - 1.5 + 2.5 (/24 hrs) | 5000 + 13000 - 11000 (/mm³) | 145 - 17 +30 (mEq/l) | X₍₃₎₅ (…)],

= [10 (mod 7) | 22 (/24 hrs) | 7000 (/mm³) | 158 (mEq/l) | X₍₃₎₅ (…)],

= [3 (mod 7) | 22 (/24 hrs) | 7000 (/mm³) | 158 (mEq/l) | X₍₃₎₅ (…)].

Appendix F

For the 3rd and 4th components, only addition/subtraction is demonstrated collectively for ease in comprehension.

R_(2→10) = R₁₀ - R₂ = [0 - 5 (mod 7) | 17 - 19.5 (/24 hrs) | 9000 - 18000 (/mm³) | 130 - 128 (mEq/l) | X_(2→10)5 (…)] = [- 5 (mod 7) | - 2.5 (/24 hrs) | - 9000 (/mm³) | 2 (mEq/l) | X_(2→10)5 (…)],

R_(10→11) = R₁₁ - R₁₀ = [6 - 0 (mod 7) | 20 - 17 (/24 hrs) | 20000 - 9000 (/mm³) | 149 - 130 (mEq/l) | X_(10→11)5 (…)] = [6 (mod 7) | 3 (/24 hrs) | 11000 (/mm³) | 19 (mEq/l) | X_(10→11)5 (…)],

R_(11→12) = R₁₂ - R₁₁ = [4 - 6 (mod 7) | 23 - 20 (/24 hrs) | 6000 - 20000 (/mm³) | 140 - 149 (mEq/l) | X_(11→12)5 (…)] = [- 2 (mod 7) | 3 (/24 hrs) | - 14000 (/mm³) | - 9 (mEq/l) | X_(11→12)5 (…)],

R_(10→13) = R₁₃ - R₁₀ = [1 - 0 (mod 7) | 18 - 17 (/24 hrs) | 5000 - 9000 (/mm³) | 135 - 130 (mEq/l) | X_(10→13)5 (…)] = [1 (mod 7) | 1 (/24 hrs) | - 4000 (/mm³) | 5 (mEq/l) | X_(10→13)5 (…)],

R_(13→17) = R₁₇ - R₁₃ = [2 - 1 (mod 7) | 23.5 - 18 (/24 hrs) | 3000 - 5000 (/mm³) | 150 - 135 (mEq/l) | X_(13→17)5 (…)] = [1 (mod 7) | 5.5 (/24 hrs) | - 2000 (/mm³) | 15 (mEq/l) | X_(13→17)5 (…)].

References

Stevens SS: On the theory of scales of measurement. Science. 1946, 103 (2684): 677-680. 10.1126/science.103.2684.677.
Article Google Scholar
Judson TW: Abstract Algebra: Theory and Applications. 1997, Virginia: PWS Publishing Company
Google Scholar
Hungerford TW: Abstract Algebra, An Introduction. 1997, Philadelphia: Saunders College Publishing, 2
Google Scholar
Hayes MHS, Patterson DG: Experimental development of the graphic rating method. Psychol Bull. 1921, 18: 98-99.
Google Scholar
Freyd M: The graphic rating scale. J Educ Psychol. 1923, 14 (2): 83-102.
Article Google Scholar
Sawamura J, Morishita S, Ishigooka J: A group-theoretical notation for disease states: an example using the psychiatric rating scale. Theor Biol Med Model. 2012, 9: 28-10.1186/1742-4682-9-28. July 9
Article PubMed Central PubMed Google Scholar
Sawamura J, Morishita S, Ishigooka J: Further suggestions on the group-theoretical approach using clinical values. Theor Biol Med Model. 2012, 9: 54-10.1186/1742-4682-9-54. Dec 19
Article PubMed Central PubMed Google Scholar
Tate J, Oort F: Group schemes of prime order. Ann Scient Éc Norm Sup. 1970, 3 (1): 1-21. 4e série, t.3
Google Scholar
Jullien GA: Implementation of multiplication, modulo a prime number, with applications to number theoretic transforms. IEEE Transac Comput. 1980, C-29: 899-905.
Article Google Scholar
Sobin LH, Gospodarowicz MK, Wittekind C: International Union Against Cancer (UICC), TNM classification of malignant tumours. 2010, New York: Wiley-Liss, 7
Google Scholar
WHO: International Statistical Classification of Diseases and Related Health Problems. 10th Revision. 1992, Geneva, Switzerland: World Health Organization
Google Scholar
Billard L, Diday E: Symbolic data analysis, in 'Conceptual statistics and data mining'. 2006, England: Wiley & Sons Ltd
Book Google Scholar
Turk DC, Rudy TE, Sorkin BA: Neglected topics in chronic pain treatment outcome studies: determination of success. Pain. 1993, 53 (1): 3-16. 10.1016/0304-3959(93)90049-U.
Article CAS PubMed Google Scholar
Farrar JT, Young JP, LaMoreaux L, Werth JL, Poole RM: Clinical importance of changes in chronic pain intensity measured on an 11-point numerical pain rating scale. Pain. 2001, 94 (2): 149-158. 10.1016/S0304-3959(01)00349-9.
Article CAS PubMed Google Scholar
Crichton N: Information point: visual analogue scale (VAS). J Clin Nurs. 2001, 10 (5): 697-706. 10.1046/j.1365-2702.2001.00525.x.
Article Google Scholar
Langley GB, Sheppard H: The visual analogue scale: Its use in pain measurement. Rheumatol Int. 1985, 5 (4): 145-148. 10.1007/BF00541514.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors wish to acknowledge Katsuji Nishimura, Kaoru Sakamoto, Takashi Oshimo, and Keiko Kojo for providing us with very useful advice.

Author information

Authors and Affiliations

Department of Psychiatry, Tokyo Women’s Medical University, Tokyo, Japan
Jitsuki Sawamura & Jun Ishigooka
Depression Prevention Medical Center, Inariyama Takeda Hospital, Kyoto, Japan
Shigeru Morishita

Authors

Jitsuki Sawamura
View author publications
You can also search for this author in PubMed Google Scholar
Shigeru Morishita
View author publications
You can also search for this author in PubMed Google Scholar
Jun Ishigooka
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jitsuki Sawamura.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

JS conceived the main concept of this article and wrote the manuscript. SM revised the manuscript. JI gave advice on the potential validity from the viewpoint of clinical research and treatment. Additionally, all authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Sawamura, J., Morishita, S. & Ishigooka, J. Interpretation for scales of measurement linking with abstract algebra. J Clin Bioinform 4, 9 (2014). https://doi.org/10.1186/2043-9113-4-9

Download citation

Received: 28 January 2014
Accepted: 02 June 2014
Published: 10 June 2014
DOI: https://doi.org/10.1186/2043-9113-4-9

Interpretation for scales of measurement linking with abstract algebra

Abstract

Background

§1. Application of group/field of abstract algebra to the various types of scales

§2. A vector-like notation using group/field operations belonging to a single set

§3. Supplementary suggestions and limitations

Conclusions

Appendix

Appendix A

Appendix B

Appendix C

Appendix D

Appendix E

Appendix F

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Journal of Clinical Bioinformatics