Releases: tidymodels/infer
Releases · tidymodels/infer
Standardized proportion z statistic, improvements to various helpers
-
rep_sample_n()no longer errors when supplied aprobargument (#279) - Added
rep_slice_sample(), a light wrapper aroundrep_sample_n(), that more closely resemblesdplyr::slice_sample()(the function that supersedesdplyr::sample_n()) (#325) - Added a
success,correct, andzargument toprop_test()(#343, #347, #353) - Implemented observed statistic calculation for the standardized proportion
$z$ statistic (#351, #353) - Various bug fixes and improvements to documentation and errors.
Bias-corrected confidence intervals
get_confidence_interval()can now produce bias-corrected confidence intervals
by settingtype = "bias-corrected". Thanks to @davidbaniadam for the
initial implementation (#237, #318)!get_confidence_interval()now uses column names ('lower_ci' and 'upper_ci')
in output that are consistent with other infer functionality (#317).- Fix CRAN check failures related to long double errors.
New test statistics and vignettes, improved warnings/errors
- Warn the user when a p-value of 0 is reported (#257, #273)
- Added new vignettes:
chi_squaredandanova(#268) - Updates to documentation and existing vignettes (#268)
- Add alias for
hypothesize()(hypothesise()) (#271) - Subtraction order no longer required for difference-based tests--a warning will be raised in the case that the user doesn't supply an
orderargument (#275, #281) - Add new messages for common errors (#277)
- Increase coverage of theoretical methods in documentation (#278, #280)
- Drop missing values and reduce size of
gssdataset used in examples (#282) - Add
stat = "ratio of props"andstat = "odds ratio"tocalculate(#285) - Add
prop_test(), a tidy interface toprop.test()(#284, #287) - Updates to
visualize()for compatibility withggplot2v3.3.0 (#289) - Fix error when bootstrapping with small samples and raise warnings/errors
when appropriate (#239, #244, #291) - Fix unit test failures resulting from breaking changes in
dplyrv1.0.0 - Fix error in
generate()when response variable is namedx(#299) - Add
two-sidedandtwo sidedas aliases fortwo_sidedfor the
directionargument inget_p_value()andshade_p_value()(#302) - Fix
t_test()andt_stat()ignoring theorderargument (#310)
Documentation and other tweaks
0.5.1 Update NEWS.md
Update chi squared tests
infer 0.5.0
Breaking changes
shade_confidence_interval()now plots vertical lines starting from zero (previously - from the bottom of a plot) (#234).shade_p_value()now uses "area under the curve" approach to shading (#229).
Other
- Updated
chisq_test()to take arguments in a response/explanatory format, perform goodness of fit tests, and default to the approximation approach (#241). - Updated
chisq_stat()to do goodness of fit (#241). - Make interface to
hypothesize()clearer by adding the options for the point null parameters to the function signature (#242). - Manage
inferclass more systematically (#219). - Use
vdiffrfor plot testing (#221).
change p-val computation; add visualization layers
infer 0.4.0
Breaking changes
- Changed method of computing two-sided p-value to a more conventional one. It also makes
get_pvalue()andvisualize()more aligned (#205).
Deprecation changes
- Deprecated
p_value()(useget_p_value()instead) (#180). - Deprecated
conf_int()(useget_confidence_interval()instead) (#180). - Deprecated (via warnings) plotting p-value and confidence interval in
visualize()(use new functionsshade_p_value()andshade_confidence_interval()instead) (#178).
New functions
shade_p_value()- {ggplot2}-like layer function to add information about p-value region tovisualize()output. Has aliasshade_pvalue().shade_confidence_interval()- {ggplot2}-like layer function to add information about confidence interval region tovisualize()output. Has aliasshade_ci().
Other
- Account for
NULLvalue in left hand side of formula inspecify()(#156) andtypeingenerate()(#157). - Update documentation code to follow tidyverse style guide (#159).
- Remove help page for internal
set_params()(#165). - Fully use {tibble} (#166).
- Fix
calculate()to not depend on order ofpfortype = "simulate"(#122). - Reduce code duplication (#173).
- Make transparancy in
visualize()to not depend on method and data volume. - Make
visualize()work for "One sample t" theoretical type withmethod = "both". - Add
stat = "sum"andstat = "count"options tocalculate()(#50).
Bug fixes and switch to {glue}
- Stop using package {assertive} in favor of custom type checks (#149)
- Fixed
t_stat()to use...sovar.equalworks - With the help of @echasnovski, fixed
var.equal = TRUEforspecify() %>% calculate(stat = "t") - Use custom functions for error, warning, message, and
paste()handling (#155)
Add p_value and conf_int functions and observed stat shortcut with specify() %>% calculate()
- Added
conf_intlogical argument andconf_levelargument tot_test() - Switched
shade_colorargument invisualize()to bepvalue_fillinstead
since fill color for confidence intervals is also added now - Shading for Confidence Intervals in
visualize()- Green is default color for CI and red for p-values
-
direction = "between"to get the green shading - Currently working only for simulation-based methods
- Implemented
conf_int()function for computing confidence interval provided a simulation-based method with astatvariable-
get_ci()andget_confidence_interval()are aliases forconf_int() - Converted longer confidence interval calculation code in vignettes to use
get_ci()instead
-
- Implemented
p_value()function for computing p-value provided a simulation-based method with astatvariable-
get_pvalue()is an alias forp_value() - Converted longer p-value calculation code in vignettes to use
get_pvalue()instead
-
- Implemented Chi-square Goodness of Fit observed stat depending on
paramsbeing set inhypothesizewithspecify() %>% calculate()shortcut - Removed "standardized" slope
$t$ since its formula is different than "standardized" correlation and there is no way currently to give one over the other - Implemented correlation with bootstrap CI and permutation hypothesis test
- Filled the
typeargument automatically ingenerate()based
onspecify()andhypothesize()- Added message if
typeis given differently than expected
- Added message if
- Implemented
specify() %>% calculate()for getting observed
statistics.-
visualize()works with either a 1x1 data frame or a vector
for itsobs_statargument - Got
stat = "t"working
-
- Refactored
calculate()into smaller functions to reduce complexity - Produced error if
muis given inhypothesize()butstat = "median"
is provided incalculate()and other similar mis-specifications - Tweaked
chisq_stat()andt_stat()to match withspecify() %>% calculate()framework- Both work in the one sample and two sample cases by providing
formula - Added
orderargument tot_stat()
- Both work in the one sample and two sample cases by providing
- Added implementation of one sample
t_test()by passing in themuargument tot.test
fromhypothesize() - Tweaked
pkgdownpage to include ToDo's using {dplyr} example
Theoretical distributions to visualize and a few wrapper functions
- Switched to
!!instead ofUQ()sinceUQ()is deprecated in
{rlang} 0.2.0 - Added many new files:
CONDUCT.md,CONTRIBUTING.md, andTO-DO.md - Updated README file with more development information
- Added wrapper functions
t_test()andchisq_test()that use a
formula interface and provide an intuitive wrapper tot.test()and
chisq.test() - Created
stat = "z"andstat = "t"options - Added many new arguments to
visualize()to prescribe colors to shade and
use for observed statistics and theoretical density curves - Added check so that a bar graph created with
visualize()if number of
unique values for generated statistics is small - Added shading for
method = "theoretical" - Implemented shading for simulation methods w/o a traditional distribution
- Use percentiles to determine two-tailed shading
- Changed
method = "randomization"tomethod = "simulation" - Added warning when theoretical distribution is used that
assumptions should be checked - Added theoretical distributions to
visualize()alone and as overlay with
current implementations being- Two sample t
- ANOVA F
- One proportion z
- Two proportion z
- Chi-square test of independence
- Chi-square Goodness of Fit test
- Standardized slope (t)
Initial CRAN release
v0.1.0 add cran-comments.md