Package: OptHoldoutSize 0.1.0.0

OptHoldoutSize: Estimation of Optimal Size for a Holdout Set for Updating a Predictive Score

Predictive scores must be updated with care, because actions taken on the basis of existing risk scores causes bias in risk estimates from the updated score. A holdout set is a straightforward way to manage this problem: a proportion of the population is 'held-out' from computation of the previous risk score. This package provides tools to estimate a size for this holdout set and associated errors. Comprehensive vignettes are included. Please see: Haidar-Wehbe S, Emerson SR, Aslett LJM, Liley J (2022) <arxiv:2202.06374> for details of methods.

Authors:Sami Haidar-Wehbe [aut], Sam Emerson [aut], Louis Aslett [aut], James Liley [cre, aut]

OptHoldoutSize_0.1.0.0.tar.gz
OptHoldoutSize_0.1.0.0.zip(r-4.5)OptHoldoutSize_0.1.0.0.zip(r-4.4)OptHoldoutSize_0.1.0.0.zip(r-4.3)
OptHoldoutSize_0.1.0.0.tgz(r-4.5-any)OptHoldoutSize_0.1.0.0.tgz(r-4.4-any)OptHoldoutSize_0.1.0.0.tgz(r-4.3-any)
OptHoldoutSize_0.1.0.0.tar.gz(r-4.5-noble)OptHoldoutSize_0.1.0.0.tar.gz(r-4.4-noble)
OptHoldoutSize_0.1.0.0.tgz(r-4.4-emscripten)OptHoldoutSize_0.1.0.0.tgz(r-4.3-emscripten)
OptHoldoutSize.pdf |OptHoldoutSize.html✨
OptHoldoutSize/json (API)
NEWS

# Install 'OptHoldoutSize' in R:

install.packages('OptHoldoutSize', repos = c('https://jamesliley.r-universe.dev', 'https://cloud.r-project.org'))

Datasets:

aspre_emulation - Emulation-based OHS estimation for ASPRE
aspre_parametric - Parametric-based OHS estimation for ASPRE
ci_cover_a_yn - Data for example on asymptotic confidence interval for OHS.
ci_cover_cost_a_yn - Data for example on asymptotic confidence interval for min cost.
ci_cover_cost_e_yn - Data for example on empirical confidence interval for min cost.
ci_cover_e_yn - Data for example on empirical confidence interval for OHS.
data_example_simulation - Data for vignette showing general example
data_nextpoint_em - Data for 'next point' demonstration vignette on algorithm comparison using emulation algorithm
data_nextpoint_par - Data for 'next point' demonstration vignette on algorithm comparison using parametric algorithm
ohs_array - Data for vignette on algorithm comparison
ohs_resample - Data for vignette on algorithm comparison
params_aspre - Parameters of reported ASPRE dataset

On CRAN:

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

3.18 score 10 scripts 173 downloads 30 exports 9 dependencies

Last updated 3 years agofrom:923c183550. Checks:3 OK, 6 NOTE. Indexed: yes.

Target	Result	Latest binary
Doc / Vignettes	OK	Mar 13 2025
R-4.5-win	NOTE	Mar 13 2025
R-4.5-mac	NOTE	Mar 13 2025
R-4.5-linux	NOTE	Mar 13 2025
R-4.4-win	NOTE	Mar 13 2025
R-4.4-mac	NOTE	Mar 13 2025
R-4.4-linux	NOTE	Mar 13 2025
R-4.3-win	OK	Mar 13 2025
R-4.3-mac	OK	Mar 13 2025

Exports:add_aspre_interactions aspre aspre_k2 ci_mincost ci_ohs cov_fn error_ohs_emulation exp_imp_fn gen_base_coefs gen_preds gen_resp grad_mincost_powerlaw grad_nstar_powerlaw logistic logit model_predict model_train mu_fn next_n optimal_holdout_size optimal_holdout_size_emulation oracle_pred powerlaw powersolve powersolve_general powersolve_se psi_fn sens10 sim_random_aspre split_data

Dependencies:lattice Matrix matrixStats mle.tools mnormt mvtnorm ranger Rcpp RcppEigen

ASPRE example

Sami Haidar-Wehbe, Sam Emerson, Louis Aslett, James Liley

Rendered fromASPRE_example.Rmdusingknitr::knitron Mar 13 2025.

Last update: 2022-02-09
Started: 2022-02-09

Comparison of algorithms

Sami Haidar-Wehbe, Sam Emerson, Louis Aslett, James Liley

Rendered fromcomparison_of_algorithms.Rmdusingknitr::knitron Mar 13 2025.

Last update: 2022-02-18
Started: 2022-02-09

Simulated example

Sami Haidar-Wehbe, Sam Emerson, Louis Aslett, James Liley

Rendered fromsimulated_example.Rmdusingknitr::knitron Mar 13 2025.

Last update: 2022-02-09
Started: 2022-02-09

Help page	Topics
Add interaction terms corresponding to ASPRE model	add_aspre_interactions
Computes ASPRE score	aspre
Emulation-based OHS estimation for ASPRE	aspre_emulation
Cost estimating function in ASPRE simulation	aspre_k2
Parametric-based OHS estimation for ASPRE	aspre_parametric
Data for example on asymptotic confidence interval for OHS.	ci_cover_a_yn
Data for example on asymptotic confidence interval for min cost.	ci_cover_cost_a_yn
Data for example on empirical confidence interval for min cost.	ci_cover_cost_e_yn
Data for example on empirical confidence interval for OHS.	ci_cover_e_yn
Confidence interval for minimum total cost, when estimated using parametric method	ci_mincost
Confidence interval for optimal holdout size, when estimated using parametric method	ci_ohs
Covariance function for Gaussian process	cov_fn
Data for vignette showing general example	data_example_simulation
Data for 'next point' demonstration vignette on algorithm comparison using emulation algorithm	data_nextpoint_em
Data for 'next point' demonstration vignette on algorithm comparison using parametric algorithm	data_nextpoint_par
Measure of error for emulation-based OHS emulation	error_ohs_emulation
Expected improvement	exp_imp_fn
Coefficients for imperfect risk score	gen_base_coefs
Generate matrix of random observations	gen_preds
Generate response	gen_resp
Gradient of minimum cost (power law)	grad_mincost_powerlaw
Gradient of optimal holdout size (power law)	grad_nstar_powerlaw
Logistic	logistic
Logit	logit
Make predictions	model_predict
Train model (wrapper)	model_train
Updating function for mean.	mu_fn
Finds best value of n to sample next	next_n
Data for vignette on algorithm comparison	ohs_array
Data for vignette on algorithm comparison	ohs_resample
Estimate optimal holdout size under parametric assumptions	optimal_holdout_size
Estimate optimal holdout size under semi-parametric assumptions	optimal_holdout_size_emulation
Generate responses	oracle_pred
Parameters of reported ASPRE dataset	params_aspre
Plot estimated cost function	plot.optholdoutsize
Plot estimated cost function using emulation (semiparametric)	plot.optholdoutsize_emul
Power law function	powerlaw
Fit power law curve	powersolve
General solver for power law curve	powersolve_general
Standard error matrix for learning curve parameters (power law)	powersolve_se
Updating function for variance.	psi_fn
Sensitivity at theshold quantile 10%	sens10
Simulate random dataset similar to ASPRE training data	sim_random_aspre
Split data	split_data

Package: OptHoldoutSize 0.1.0.0

OptHoldoutSize: Estimation of Optimal Size for a Holdout Set for Updating a Predictive Score

ASPRE example

Comparison of algorithms

Simulated example

Citation

Readme and manuals

Help Manual

Usage by other packages (reverse dependencies)