Commit graph

80 commits

Author SHA1 Message Date
6f9e3b91a6 added custom rfecv to get names out from stackoverflow 2022-05-24 07:51:16 +01:00
f2d2f361f0 n_estimators range to include upto a 1000 alex's suggestion 2022-05-24 07:50:11 +01:00
b49c877f49 huge progress with getting feature names out from One Hot encoder 2022-05-24 07:48:00 +01:00
95852fa40e aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2022-05-24 02:33:56 +01:00
6db5046302 modifying output dict for printing in UQ_FS_eg.py 2022-05-23 22:52:57 +01:00
3c7d8690ee added feature selection on all models but lets see if it works, only worked until DT 2022-05-23 07:47:56 +01:00
a420822a93 saved the UQ_FS_eg with almost complete output, still some minor fixes to do 2022-05-23 03:35:35 +01:00
1436557287 saving work and wrapping up from the weekend 2022-05-23 00:31:02 +01:00
9839b6f8d1 remove __estimator 2022-05-22 23:39:35 +01:00
1a1154b4f4 remove ClfSwitcher() from this lot 2022-05-22 23:36:24 +01:00
18d9b77aee fs_* versions 2022-05-22 23:33:11 +01:00
c4142f441d remove redundant stuff 2022-05-22 23:31:58 +01:00
80e6b3af96 copy of ML dir to an FS-only version 2022-05-22 23:30:58 +01:00
52cc16f3fa added UQ_LR FS2.py that has the FS run with LR model as part of pipeline and gridsearch 2022-05-21 13:30:45 +01:00
39cd7b4259 finally made the fs work within class and without 2022-05-21 13:30:04 +01:00
4a9e9dfedf added sripts to try FS 2022-05-21 04:25:07 +01:00
3742a5f62d renamed UQ_LR_FS.py to UQ_LR_FS_p1.py 2022-05-21 04:24:28 +01:00
e16e82e673 added ml numbers from run for multiple models for pnca without fs and started a file for fs 2022-05-20 08:11:53 +01:00
c1d27f5a92 added UQ_LR_FS.py scrip for LR feature selection. SO far this is manual. cannot get it to be part of pipeline 2022-05-20 08:10:44 +01:00
37bda41f44 added and ran hyperparam script for all different classifiers, but couldn't successfully run the feature selection and hyperparam together 2022-05-20 08:09:24 +01:00
74af5ef890 moved UQ_RF older verison to earlier_versions dir 2022-05-19 08:32:33 +01:00
8b0f69bbd9 added pratice and feature selection scripts for LR and hyperparam for all classification models as separate scripts in uq_ml_models 2022-05-19 08:31:16 +01:00
fa0f5e5b39 saving work with scripts for feature selection 2022-05-19 08:30:18 +01:00
a9dc3c43e5 added UQ_practice.py 2022-05-19 02:38:01 +01:00
ee163d3978 added UQ scripts to do hyperparam ML models 2022-05-19 02:37:00 +01:00
4dbc90ad44 modified ml params and models 2022-05-19 02:35:50 +01:00
3ed7840f60 added UQ import and ml call scripts 2022-05-16 08:06:56 +01:00
66ebb1a30a saving work before adding uq stuff 2022-05-16 08:05:41 +01:00
5fe2dc47cd added files and saving work 2022-05-05 19:44:19 +01:00
409caaf0bc added lineage and af count accounting for corrupt data 2022-04-08 17:00:57 +01:00
28d0d68413 added distinct lin count for each mutation 2022-04-07 18:47:53 +01:00
67d9e6160a added logoplot_example.R and ga_customers.csv 2022-04-05 14:52:08 +01:00
c647773520 saving work 2022-04-05 14:51:21 +01:00
6a9d23ec8f added sample test data for processing to get correct annotations 2022-03-24 17:42:02 +00:00
005efb1e0e updated NOTES to reflect importance of eg 5 in unsup_v1.py 2022-03-23 16:25:27 +00:00
89a0c3a58a added tutorial examples and my data workthrough examplesin unsup_v1.py 2022-03-23 16:23:18 +00:00
ad5ebad7f8 renamed hyperparams to gscv 2022-03-22 11:08:20 +00:00
a82358dbb4 renamed practice_cv2 to cross_validate_vs_loopity_loop 2022-03-22 11:03:51 +00:00
0c4f1e1e5f added all classification algorithms params for gridsearch 2022-03-21 13:51:20 +00:00
d012542435 added NOTES to indicate which scripts are important 2022-03-18 17:56:26 +00:00
ffd3ce6ee3 added intra_model_gscv.py that tell me within each model which hyperparasm are best, allows me to choose the models with the best hyperparams to then compare 'INTER' model 2022-03-18 17:52:06 +00:00
d3b6fe13a6 added grid_search_vs_base_estimator.py to compare results from baseestimator and gridsearch manual 2022-03-18 17:51:38 +00:00
b27bfa4a96 added names and links for classification algo 2022-03-18 17:50:49 +00:00
824c2f041c saving work 2022-03-18 17:50:24 +00:00
ab1508e9fb added testing_lazypredict that runs 30 ML models in one go 2022-03-17 18:20:50 +00:00
de05652ef6 added scripts for playing base_estimator 2022-03-17 18:20:19 +00:00
5138036d8b playing with gridsearchCV and base estimator 2022-03-17 18:19:43 +00:00
458a933d73 added proof of concept checks to make sure loopity loop is equivalent to cross_validate with stratified Kfold passed as a cv param 2022-03-17 18:18:43 +00:00
d0c329a1d9 modified loopity and multclass3 to have skf_cv as a parameters for cv 2022-03-17 18:17:58 +00:00
97620c1bb0 added practice and base_estimator for all the confusion in my head 2022-03-16 10:12:59 +00:00