Title: | Sequential Rank Agreement |
---|---|
Description: | Tools for analysing the agreement of two or more rankings of the same items. Examples are importance rankings of predictor variables and risk predictions of subjects. Benchmarks for agreement are computed based on random permutation and bootstrap. See Ekstrøm CT, Gerds TA, Jensen, AK (2018). "Sequential rank agreement methods for comparison of ranked lists." _Biostatistics_, *20*(4), 582-598 <doi:10.1093/biostatistics/kxy017> for more information. |
Authors: | Claus Thorn Ekstrøm [aut, cre], Thomas Alexander Gerds [aut] |
Maintainer: | Claus Thorn Ekstrøm <[email protected]> |
License: | GPL (>= 2) |
Version: | 1.2.1 |
Built: | 2024-12-30 05:51:29 UTC |
Source: | https://github.com/cran/SuperRanker |
Compute the average overlap
average_overlap(obj)
average_overlap(obj)
obj |
Either a vector or matrix |
A vector of the average overlap
# setting with 3 lists mlist <- matrix(cbind(1:8,c(1,2,3,5,6,7,4,8),c(1,5,3,4,2,8,7,6)),ncol=3) average_overlap(mlist)
# setting with 3 lists mlist <- matrix(cbind(1:8,c(1,2,3,5,6,7,4,8),c(1,5,3,4,2,8,7,6)),ncol=3) average_overlap(mlist)
Computes the overlap (number of items present in all k lists divided by the current rank) for each rank in the k lists
overlap(rankMat)
overlap(rankMat)
rankMat |
A matrix with k columns corresponding to the k ranked lists. Elements of each column are integers between 1 and the length of the lists |
A vector of the same length as the rows in rankMat containing the overlap between the lists for each rank
Claus Ekstrøm <[email protected]>
Plot the agreement between lists as a function of the list depth
## S3 method for class 'sra' plot( x, xlim, ylim, xlab = "List depth", ylab = "Sequential rank agreement", add = FALSE, ... )
## S3 method for class 'sra' plot( x, xlim, ylim, xlab = "List depth", ylab = "Sequential rank agreement", add = FALSE, ... )
x |
Agreement object |
xlim |
x-axis limits |
ylim |
y-axis limits |
xlab |
x-axis lab |
ylab |
y-axis lab |
add |
Logical. If |
... |
Processed by function |
Graph
Thomas A. Gerds <[email protected]>
R1=c(1,2,3,4,5,7,6,8,9,10,11,12,13) R2=c(5,11,4,7,8,3,12,13,6,10,9,2,1) a <- sra(list(R1,R2)) plot(a) arand = colMeans(do.call("rbind",lapply(1:20,function(b){ sra(list(sample(R1),sample(R1))) }))) lines(1:length(R1),arand,col=2,lwd=3) l <- c(1,2,3,4,5,7,6,8,9,10,11,12,13) l <- 1:100 aa <- sapply(1:20,function(i){ sra(list(sample(l),sample(l),sample(l)))[i] }) c(mean(aa),sd(aa))
R1=c(1,2,3,4,5,7,6,8,9,10,11,12,13) R2=c(5,11,4,7,8,3,12,13,6,10,9,2,1) a <- sra(list(R1,R2)) plot(a) arand = colMeans(do.call("rbind",lapply(1:20,function(b){ sra(list(sample(R1),sample(R1))) }))) lines(1:length(R1),arand,col=2,lwd=3) l <- c(1,2,3,4,5,7,6,8,9,10,11,12,13) l <- 1:100 aa <- sapply(1:20,function(i){ sra(list(sample(l),sample(l),sample(l)))[i] }) c(mean(aa),sd(aa))
Simulate sequential rank agreement from completely uninformative lists (ie., raw permutations of items) and compute the corresponding sequential rank agreement curves. The following attributes are copied from the input object: number of lists, number of items and amount of censoring.
random_list_sra( object, B = 1, n = 1, na.strings = NULL, nitems = NULL, type = c("sd", "mad"), epsilon = 0 )
random_list_sra( object, B = 1, n = 1, na.strings = NULL, nitems = NULL, type = c("sd", "mad"), epsilon = 0 )
object |
A matrix of numbers or list of vectors representing ranked lists. |
B |
An integer giving the number of randomizations to sample over in the case of censored observations |
n |
Integer: the number of permutation runs. For each permutation run we permute each of the lists in object and compute corresponding the sequential rank agreement curves |
na.strings |
A vector of character values that represent censored observations |
nitems |
The total number of items in the original lists if we only have partial lists available. Will be derived from the unique elements of the object if set to |
type |
The type of measure to use. Either sd (standard deviation - the default) or mad (median absolute deviance) |
epsilon |
A non-negative numeric vector that contains the minimum limit in proportion of lists that must show the item. Defaults to 0. If a single number is provided then the value will be recycles to the number of items. Should usually be low. |
A matrix with n columns and the same number of rows as for the input object. Each column contains one simulated sequential rank agreement curve from one permutation run.
Claus Ekstrøm <[email protected]>
# setting with 3 lists mlist <- matrix(cbind(1:8,c(1,2,3,5,6,7,4,8),c(1,5,3,4,2,8,7,6)),ncol=3) # compute sequential rank agreement of lists sra(mlist) # compute sequential rank agreement of 5 random permutations random_list_sra(mlist, n=5)
# setting with 3 lists mlist <- matrix(cbind(1:8,c(1,2,3,5,6,7,4,8),c(1,5,3,4,2,8,7,6)),ncol=3) # compute sequential rank agreement of lists sra(mlist) # compute sequential rank agreement of 5 random permutations random_list_sra(mlist, n=5)
Smooth quantiles of a matrix of sequential ranked agreements.
smooth_sra(object, confidence = 0.95)
smooth_sra(object, confidence = 0.95)
object |
A matrix |
confidence |
the limits to compute |
A list containing two vectors for the smoothed lower and upper limits
Claus Ekstrøm <[email protected]>
# setting with 3 lists mlist <- matrix(cbind(1:8,c(1,2,3,5,6,7,4,8),c(1,5,3,4,2,8,7,6)),ncol=3) # compute rank agreement of 5 random permutations null=random_list_sra(mlist,n=15) # now extract point-wise quantiles according to confidence level smooth_sra(null)
# setting with 3 lists mlist <- matrix(cbind(1:8,c(1,2,3,5,6,7,4,8),c(1,5,3,4,2,8,7,6)),ncol=3) # compute rank agreement of 5 random permutations null=random_list_sra(mlist,n=15) # now extract point-wise quantiles according to confidence level smooth_sra(null)
Compute the sequential rank agreement
sra(object, B, na.strings, nitems, type, epsilon = 0, ...) ## Default S3 method: sra(object, B, na.strings, nitems, type, epsilon = 0, ...) ## S3 method for class 'matrix' sra( object, B = 1, na.strings = NULL, nitems = nrow(object), type = c("sd", "mad"), epsilon = 0, ... ) ## S3 method for class 'list' sra( object, B = 1, na.strings = NULL, nitems = max(sapply(object, length)), type = c("sd", "mad"), epsilon = 0, ... )
sra(object, B, na.strings, nitems, type, epsilon = 0, ...) ## Default S3 method: sra(object, B, na.strings, nitems, type, epsilon = 0, ...) ## S3 method for class 'matrix' sra( object, B = 1, na.strings = NULL, nitems = nrow(object), type = c("sd", "mad"), epsilon = 0, ... ) ## S3 method for class 'list' sra( object, B = 1, na.strings = NULL, nitems = max(sapply(object, length)), type = c("sd", "mad"), epsilon = 0, ... )
object |
Either matrix where each column is a ranked list of items or a list of ranked lists of items. Elements are integers between 1 and the length of the lists. The lists should have the same length but censoring can be used by setting the list to zero from a point onwards. See details for more information. |
B |
An integer giving the number of randomization to sample over in the case of censored observations |
na.strings |
A vector of strings/values that represent missing values in addition to NA. Defaults to NULL which means only NA are censored values. |
nitems |
The total number of items in the original lists if we only have partial lists available. |
type |
The type of measure to use. Either sd (standard deviation - the default) or mad (median absolute deviance around the median) |
epsilon |
A non-negative numeric vector that contains the minimum limit in proportion of lists that must show the item. Defaults to 0. If a single number is provided then the value will be recycles to the number of items. |
... |
Arguments passed to methods. |
A vector of the sequential rank agreement
Claus Ekstrøm <[email protected]> and Thomas A Gerds <[email protected]>
mlist <- matrix(cbind(1:8,c(1,2,3,5,6,7,4,8),c(1,5,3,4,2,8,7,6)),ncol=3) sra(mlist) mlist <- matrix(cbind(1:8,c(1,2,3,5,6,7,4,8),c(1,5,3,4,2,8,7,6)),ncol=3) sra(mlist, nitems=20, B=10) alist <- list(a=1:8,b=sample(1:8),c=sample(1:8)) sra(alist) blist <- list(x1=letters,x2=sample(letters),x3=sample(letters)) sra(blist) ## censored lists are either too short clist <- list(x1=c("a","b","c","d","e","f","g","h"), x2=c("h","c","f","g","b"), x3=c("d","e","a")) set.seed(17) sra(clist,na.strings="z",B=10) ## or use a special code for missing elements Clist <- list(x1=c("a","b","c","d","e","f","g","h"), x2=c("h","c","f","g","b","z","z","z"), x3=c("d","e","a","z","z","z","z","z")) set.seed(17) sra(Clist,na.strings="z",B=10)
mlist <- matrix(cbind(1:8,c(1,2,3,5,6,7,4,8),c(1,5,3,4,2,8,7,6)),ncol=3) sra(mlist) mlist <- matrix(cbind(1:8,c(1,2,3,5,6,7,4,8),c(1,5,3,4,2,8,7,6)),ncol=3) sra(mlist, nitems=20, B=10) alist <- list(a=1:8,b=sample(1:8),c=sample(1:8)) sra(alist) blist <- list(x1=letters,x2=sample(letters),x3=sample(letters)) sra(blist) ## censored lists are either too short clist <- list(x1=c("a","b","c","d","e","f","g","h"), x2=c("h","c","f","g","b"), x3=c("d","e","a")) set.seed(17) sra(clist,na.strings="z",B=10) ## or use a special code for missing elements Clist <- list(x1=c("a","b","c","d","e","f","g","h"), x2=c("h","c","f","g","b","z","z","z"), x3=c("d","e","a","z","z","z","z","z")) set.seed(17) sra(Clist,na.strings="z",B=10)
Computes the sequential rank agreement (number of items present in all k lists divided by the current rank) for each rank in the k lists
sracpp(rankMat, maxlength, B, cens, type = 0L, epsilon = as.numeric(c(0)))
sracpp(rankMat, maxlength, B, cens, type = 0L, epsilon = as.numeric(c(0)))
rankMat |
A matrix with k columns corresponding to the k ranked lists. Elements of each column are integers between 1 and the length of the lists |
maxlength |
The maximum depth that are needed XXX |
B |
The number of resamples to use in the presence of censored lists |
cens |
A vector of integer values that |
type |
The type of distance measure to use: 0 (the default) is the variance while 1 is MAD (median absolute deviation) |
epsilon |
A non-negative numeric vector that contains the minimum limit in proportion of lists that must show the item. Defaults to 0. If a single number is provided then the value will be recycles to the number of items. |
A vector of the same length as the rows in rankMat containing the squared (!) sequential rank agreement between the lists for each depth. If the MAD type was chosen then the sequential MAD values are returned
Claus Ekstrøm <[email protected]>
Computes the sequential rank agreement (number of items present in all k lists divided by the current rank) for each rank in the k lists
sracppfull(rankMat, type = 0L, epsilon = as.numeric(c(0)))
sracppfull(rankMat, type = 0L, epsilon = as.numeric(c(0)))
rankMat |
A matrix with k columns corresponding to the k ranked lists. Elements of each column are integers between 1 and the length of the lists |
type |
The type of distance measure to use: 0 (the default) is the variance while 1 is MAD (mean absolute deviation) |
epsilon |
A non-negative numeric vector that contains the minimum limit in proportion of lists that must show the item. Defaults to 0. If a single number is provided then the value will be recycles to the number of items. |
A vector of the same length as the rows in rankMat containing the sequential rank agreement between the lists for each depth (squared for type=0)
Claus Ekstrøm <[email protected]>
SuperRanker allows you to estimate the agreement between two or more rankings of the same items.
Compute a Kolmogorov-Smirnoff-like test for Smooth quantiles of a matrix of sequential rank agreements
test_sra(object, nullobject, weights = 1)
test_sra(object, nullobject, weights = 1)
object |
An object created with |
nullobject |
An object created with |
weights |
Either a single value or a vector of the same length as the number of item with the weight that should be given to specific depths. |
A single value corresponding to the p-value
Claus Ekstrøm <[email protected]>
# setting with 3 lists mlist <- matrix(cbind(1:8,c(1,2,3,5,6,7,4,8),c(1,5,3,4,2,8,7,6)),ncol=3) # compute sequential rank agreements x=sra(mlist) # compute rank agreement of 5 random permutations null=random_list_sra(mlist,n=15) # now extract point-wise quantiles according to confidence level test_sra(x,null) # compare to when we use the result of the first permutation run test_sra(null[,1],null[,-1])
# setting with 3 lists mlist <- matrix(cbind(1:8,c(1,2,3,5,6,7,4,8),c(1,5,3,4,2,8,7,6)),ncol=3) # compute sequential rank agreements x=sra(mlist) # compute rank agreement of 5 random permutations null=random_list_sra(mlist,n=15) # now extract point-wise quantiles according to confidence level test_sra(x,null) # compare to when we use the result of the first permutation run test_sra(null[,1],null[,-1])