Wednesday 21 December 2016

Equalised Mean-based Normalised Proportion Cited

The Equalised Mean-based Normalised Proportion Cited (EMNPC) is an indicator to assess the proportion of documents cited (i.e., with a non-zero citation count) to see whether it is above or below the world average or the values for another group. It is designed for web indicators for which a high proportion is uncited so that average citation indicators, such as MNLCS, are not very accurate. It is particularly suitable for metrics designed by Kayvan Kousha, such as Wikipedia citation counts, and syllabus mentions.

EMNPC has a simple formula. For each field f in which group g publishes, calculate the proportion of articles with at least one citation, pgf. Now for the same set of fields, calculate the proportion of articles published by the world with at least one citation, pwf. Now sum the first set and divide by the sum of the second set to get EMNPC for the group.
If this has a value greater than 1 then a higher proportion of articles by group g is cited than average for the world, so g's research has above average impact in terms of the proportion cited. Note that this calculation is unfair if the group publishes small numbers of articles in some fields because all fields have equal weight in the above formula. Small fields should therefore be removed. 

This calculation also gives a higher implicit weight to fields with a relatively high proportion cited because these can dominate the numerator and denominator of the formula, but this is necessary to get narrower confidence intervals (in contrast to the similar MNPC indicator).

Here is a worked example in Excel, showing the formulae used (click to expand).

Confidence intervals

Confidence intervals can be calculated with the formula below.

Here is a worked example in Excel (click to expand).

Spreadsheet and original publication

Download a spreadsheet containing the EMNPC calculations. This also has MNPC calculations, and a second worksheet repeating the calculations with a continuity correction for greater accuracy.

Additional details can be found in this article.
Thelwall, M. (in press). Three practical field normalised alternative indicator formulae for research evaluation Journal of Informetrics. 10.1016/j.joi.2016.12.002


Tuesday 20 December 2016

Mean Normalised Log-transformed Citation Score (MNLCS): A field normalised average citation impact indicator for sets of articles from multiple fields and years


The Mean Normalised Log-transformed Citation Score (MNLCS) is a variant of the Mean Normalised Citation Score (MNLCS) to assess the average citation impact of a set of articles. This formula compares the average citation impact of articles within a group to the average citation impact of all articles in the fields and year of the group’s articles. A score of above 1 indicates that the group’s articles have a higher average citation impact than normal for the fields and years in which they were published. The MNLCS uses a log transformation to citation counts before processing them because sets of citation counts are typically highly skewed and this transformation prevents individual articles from having too much influence on the results. The MNLCS calculation is as follows.
  1.  Log transformation: For each article A in the group to be assessed, replace its citation count c by a log-transformed version, ln(1+c).
  2.  Field normalisation: Divide the log transformed citation count ln(1+c) of each article A in the group by the average (arithmetic mean) log transformed citation count ln(1+x) for all articles x in the same field and year as A.
  3. Calculation: MNLCS is the arithmetic mean of all the field-normalised, log-transformed citation counts of articles in the group.

The MNLCS calculations are illustrated below in Excel for a group publishing articles A,C, H and J in a single field containing articles A to K.

 And here is an example of MNLCS calculations for a group that is split between two fields, X and Y, with articles A,C, H and J in field X and O, S, U in field Y.

Confidence intervals can be calculated for MNLCS to assess whether two different MNLCS values are statistically significantly different from each other. The formulae use the means and standard errors of the log transformed citation counts (note: not the field normalised versions) for the articles in the group and the world’s articles in the same field and year as the group. The formula is more complicated if the group publishes in multiple fields and years – this post just discusses the simplest case, with formula below.

The confidence interval calculations are illustrated below in Excel. In this case the group MNLCS is 0.885 but its 95% confidence interval is (0.119, 2.081) so the group average is not statistically significantly different from the world MNLCS value, which is always 1.

All of the above calculations (including confidence intervals) are also built into the free software Webometric Analyst for sets of citation counts from Scopus, the Web of Science as well as for web indicators.
More details are given in this paper:
Thelwall, M. (in press). Three practical field normalised alternative indicator formulae for research evaluation Journal of Informetrics. 10.1016/j.joi.2016.12.002