L`exercice britannique 2008 d`évaluation de la recherche, RAE 2008

Transcription

L`exercice britannique 2008 d`évaluation de la recherche, RAE 2008
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
L’exercice britannique 2008 d’évaluation de la
recherche, RAE 2008
Ralph Kenna 1 and BB 2
1
Coventry University, England
Nancy Université, France
2
R Kenna & BB, arXiv:1004.3155
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Outline
1
L’enjeu
2
Le “RAE”
3
Le modèle empirique
4
Conclusions
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Concentration ou dispersion des moyens ?
The head of the Russell Group “said that the country’s most successful
institutions should get 90% of the $ billion in research funding that is
allocated to higher education every year. They currently receive around 80%.”
[Telegraph, 23 Oct 2009]
The UK must abandon the idea that excellent research should be funded
wherever it is found and concentrate resources on just 25 to 30 universities,
the chair of the Russell Group of large research-intensive institutions has
argued.
[Times Higher Education, Oct 2009]
But the Director of University Alliance said “It is a principle of ... funding
research, based on excellence, that has driven the improvement in the UK
research base and will continue to do so - not an explicit policy to concentrate
research in particular institutions... It is only in ... a meritocratic system that
we can continue to drive up the efficiency and quality of UK research.”
[Press release, Nov 2009]
Groups ?
So which is best : concentration or competition
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
The Research Assessment Exercice
Subject areas were scrutinised to assess the proportion of research work
submitted which fell into categories
4* : Quality that is world-leading in terms of originality, significance and
rigour
3* : Quality that is internationally excellent in terms of originality,
significance and rigour but which nonetheless falls short of the highest
standards of excellence
2* : Quality that is recognised internationally in terms of originality,
significance and rigour
1* : Quality that is recognised nationally in terms of originality,
significance and rigour
Unclassified : Quality that falls below the standard of nationally
recognised work
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
The Research Assessment Exercice
An example of a UOA quality profile is as follows :
Quality level
pn∗ = % of research
activity
4*
3*
2*
1*
u/c
p4∗ = 15
p3∗ = 25
p2∗ = 30
p1∗ = 20
p0∗ = 10
Based upon quality profiles, a formula is used to determine how research
funding is distributed. In England in 2009/2010 the funding formula is
s = p4∗ +
1
3
p3∗ + p2∗ .
7
7
The funding allocated to a groups of size N is then proportional to S = sN.
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Limites de l’analyse (qui va suivre) et précautions
There are obvious limits to what our statistical analysis can achieve. We
assume RAE scores are reliable and robust
cannot account for links with industry
cannot account for managerial tactics
cannot account for individual calibre
These factors amount to (sometimes considerable) noise. It is remarkable
that despite them we can make quantitative statements.
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
L’évaluation des groupes de recherche
100
applied mathematics
s
75
Cambridge
Oxford
Liverpool
50
25
Warwick
QMUL
Coventry
Brighton
UWE
Plymouth
0
0
50
institution
clearly : s = constant ± noise.
Quality is randomly distributed about a mean, with older, prestigious
universities tending to be above and newer ones below.
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
L’évaluation des groupes de recherche
One may expect that, on average, a group of size N = 10 is twice as strong
as a group of size N = 5 : The strength of the group g is the sum of the
strengths ag i of its N individuals :
Sg =
N
X
ag i = N āg
i=1
so that the quality of the group is the average strength per head of its
individuals sg = āg .
This is also the philosophy behind RAE rankings.
Ex : if S = strength of a group of size N, one may expect S ∝ N then we
“explain” that
s = constant ± noise.
But this is not the full picture....
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Corrélation taille-qualité
100
applied mathematics
s
75
Warwick Oxford
Liverpool
50
Cambridge
QMUL
25
Coventry
Brighton
UWE
0
0
10 20 30 40 50 60 70 80 90 100
N
Quality s is clearly correlated with groups size N : s = s(N).
What is the nature of this correlation ?
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Les relations entre individus comme moteur de la
qualité
Instead, we have to treat (research) groups as collaborating systems. With
ai
=
strength of individual i,
bi,j
=
strength of interaction between individuals i
and j
the strength of the (small) group is
S = N ā +
N(N − 1)
b̄.
2
However, two-way communication cannot be carried out between every pair
of nodes if N is too large. If N > Nc , N/Nc subgroups form and interact with
strength β,
„
«
N
b̄
β N
−1 .
S = āN + NNc +
Nc
|
{z2
} |2 Nc {z
}
intra-subgroup
inter-subgroup interaction
I.e., we have a different linear dependency of quality s on size N.
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Les relations entre individus. . .
We therefore expect
s=
(
a1 + b1 N
a2 + b2 N
if N ≤ Nc
if N ≥ Nc
b1 ≃ 21 b
b1 ≃ 2Nβ 2 .
c
If N > Nc , we call the group large.
If N < Nc , we call the group small or medium.
Since b2 ∼ 1/Nc2 , the slope to the right is small for large Nc .
100
applied mathematics
We have now explained
s
75
Warwick Oxford
Liverpool
50
Cambridge
QMUL
25
Coventry
Brighton
UWE
0
0
10 20 30 40 50 60 70 80 90 100
N
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Natural Sciences
We start the analysis with the natural sciences
100
100
physics
geography and environment
75
s
s
75
50
25
50
25
0
0
0
50
100
150
0
30
60
N
N
We measure Nk = 13 ± 3
(with R 2 = 53%)
and Nk = 15 ± 2
(with R 2 = 66%).
Compare to the result for applied mathematics where we find
Nk = 6 ± 1
(with R 2 = 74%)
Groups
90
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Pure mathematics
100
pure mathematics
75
s
In pure mathematics, no crossover
point was found :
50
25
0
0
20
40
60
N
A linear fit has relatively small slope and high intercept implying all groups
are large.
The smallest group submitted had N = 4. So we may estimate Nc
Nk
<
∼
<
∼
2.
I.e., pure mathematicians work alone or in pairs (size does not matter !).
Groups
4 or
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Other subject areas
100
100
French, German, Dutch
and Scandinavian languages
English
75
s
s
75
50
25
50
25
0
0
0
15
30
45
0
30
60
N
N
Nk = 3 ± 1
Nk = 16 ± 2
Groups
90
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Stability
This year (2010) the HEFCE funding formula will change to s = p4∗ +
3
9
p3∗ +
1
9
p3∗ .
The effect on the above analysis is
100
100
applied mathematics
(2009 funding formula)
applied mathematics
(2010 funding formula)
75
s
s
75
50
25
50
25
0
0
0
25
50
75
100
0
25
50
75
100
N
N
Nk = 6.2 ± 0.9 (R 2 = 74.3)
Nk = 6.3 ± 1.0 (R 2 = 73.5)
Similarly for physics
Nk = 12.7 ± 2.4 (R 2 = 53.0) → Nk = 12.6 ± 2.3 (R 2 = 54.4)
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
University ranking
For applied mathematics, we plot s − s̄ and s − hsi against N, highlighting the
various university groupings
50
50
applied mathematics
applied mathematics
s - s
|
s - <s>
Warwick
QMUL
25
0
-25
Coventry
-50
25
Russell Group
1994 Group
Million+
University Alliance
Unaffiliated
Warwick
QMUL
0
Coventry
-25
-50
0
10
20
30
40
institution
50
0
10
20
30
40
50
institution
Left plot represents deviation of data from global average. St. dev. = 12.6,
range = 52.9
Right plot represents deviation of data from local average. St. dev. = 6.4,
range = 28.8
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
University ranking
Same plots for physics :
50
50
physics
s - s
Warwick
|
0
QMUL
-25
-50
institution
s - <s>
physics
25
Russell Group
1994 Group
Million+
University Alliance
Unaffiliated
25
0
QMUL
Warwick
-25
-50
institution
Left plot (global average) : St. dev. = 7.8, range = 32.1
Right plot (local average) : St. dev. = 5.3, range = 25.1
Left line separates groupings ; right line goes through groupings
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
The French System (AERES)
Cumulative effect :
80
Paris 7
Exact Sciences
Sciences of Life
Strsbg 1
60
Paris 6
Paris 13
Strsbg 1
Paris 12
Nancy 1
40
Paris 6
Metz
Paris 7
Paris 10
Nancy 1
Paris 12
20
Mulhouse
Paris 10
Paris 8
Metz
0
0
1000
500
Groups
1500
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
The French System (AERES)
We plot the standardised data for the hard and life sciences in the UK (red)
and France (13 universities of 2008 assessement, blue), where
σ=
ν=
2
1
1
0
0
-1
-1
-2
-2
-3
-1.5
0
N − N̄
σN
3 (b) life sciences
(a) hard sciences
2
σ
σ
3
s − s̄
σs
ν
1.5
3
-3
-1.5
0
Groups
ν
1.5
3
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Saturation of Russell Group
Times Higher Education (THE), [29 Nov 2009]
“Analysis questions call to concentrate funding on Russell Group
universities”
100
applied mathematics
75
s
“Research produced by the bulk
of the elite Russell Group of ... is
no better than that generated by
[the 1994 Group], according to a
new analysis. Thomson Reuters ...
compared the percentages of all cited research papers published by
the two groups between 2002 and
2006...”
50
25
0
0
10 20 30 40 50 60 70 80 90 100
N
Clearly, both the Russell Group and the 1994 Group are in fact large and
have hit saturation point.
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Saturation of Russell Group
The same plot for physics
100
Russell Group
1994 Group
Million+
University Alliance
Unaffiliated
physics
s
75
50
25
0
0
30
60
90
120
N
Saturation beyond Nc ≈ 25 explains the Thomson Reuters findings.
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Critical Mass
Critical mass is the minimum size a research team must achieve for it to be
viable in the longer term.
The total strength of a group is
S(N) = ai N + bi N 2
where i = 1 or 2 if N < Nc or N > Nc .
If M new nodes become available where should they be allocated ?
S(N + M) = ai (N + M) + bi (N + M)2 .
The average increase in societal strength is
depends on whether N < Nc or N > Nc .
∆S
M
= ai + bi (2N + M) and
Next we take the limit M → 0. (i.e., we simply maximize the gradient of S(N)).
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Critical Mass
It turns out it is best to support the small/medium group if N > Nk where
Nk =
Nc
.
2
If N < Nk , we call the group small.
If Nk < N < Nc , we call the group medium.
It is best to support the medium group (N > Nk ). Then to support the large
group (N > Nc ) and then the small one.
I.e.,
Nk = critical mass.
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Critical Mass
750
applied mathematics
S
500
250
0
0
5
10
15
20
25
N
Without the crossover point, the steepest gradient condition would lead to
supporting only the biggest among the large groups so that only one group
remains in the end.
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Critical masses
The estimates for the critical masses of 24 (combined) subjects are
Subject
Applied mathematics
Physics
Earth sciences
Biology‡
Chemistry
Agriculture, vet. etc‡
Law†
Architecture & planning
Non-English languages
English
Pure mathematics∗
Medical sciences
Nk = Nc /2
Subject
Nk = Nc /2
6±1
13 ± 3
15 ± 2
10 ± 2
18 ± 7
5±2
15 ± 2
7±2
3±1
16 ± 2
≤2
20 ± 4
Nursing, etc.
Computer science♯
Archaeology♯
Economics
Business
Politics
Sociology
Education
History‡
Philosophy & theology
Art & design‡
Other arts
9±3
25 ± 5
8±2
5±2
24 ± 4
13 ± 3
7±2
15 ± 3
12 ± 3
9±2
12 ± 4
4±1
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Bibliometry vs peer review
Two line fit of Evidence data gives half value for Nc and worse R 2 . One
should prefer peers review than bibliometry.
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Conclusions
Community is greater than sum of it parts ...
... an effect which saturates beyond Nc
It is unwise to judge a group solely on quality profile : small/medium
groups should not be expected to yield same strength as large ones
It is unwise to judge individuals in a group on the basis of the group
strength as this neglects interactions
Need to take size into account to determine which groups are punching
above or below their weight
other factors (e.g. individual excellence) seem to contribute
subdominantly (i.e. noise here !) compared to individual interactions
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Conclusions
Strength is primarily ascribed to two-way communication links.
Universities should facilitate this
Hotdesking bad !
Distance working is bad !
Collaboration is good !
Medium-sized groups should be supported
BUT small groups must endeavour to achieve critical mass
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Natural Sciences
100
100
biology
chemistry
75
s
s
75
50
25
50
25
0
0
0
50
100
150
200
250
0
20
40
60
N
N
We measure Nk = 10 ± 2
(with R 2 = 54%)
and Nk = 18 ± 7
(with R 2 = 62%).
80
Note the large errors for chemistry. The smallest chemistry group had N = 10
(cf N = 1 for physics and N = 3 for biology). So chemistry in the UK is
dominated by large/medium groups, the small ones having petered out prior
to 2008.
Groups
L’enjeu
Le “RAE”
Le modèle empirique
Conclusions
Other subject areas
100
100
philosophy and theology
history
75
s
s
75
50
25
50
25
0
0
0
20
40
60
80
100
0
30
60
90
N
N
Nk = 9 ± 2
Nk = 12 ± 2
Groups
120

Documents pareils