Commit graph

519 commits

Author SHA1 Message Date
f99b5d1888 added GetMLData.py for combined model and added to functions including previous ones that have been moved there 2022-06-25 14:12:07 +01:00
5d38cde912 added all run scripts for diffferent splits 2022-06-24 20:39:50 +01:00
e2bc384155 added FS to MultClfs.py and modified data for different splits for consistency 2022-06-24 20:35:53 +01:00
edb7aebd6a saving 2022-06-24 15:43:00 +01:00
b37a950fec optimised run_7030.py to generate ouput from dict now that the processfunction and parameter dicts have been added 2022-06-24 15:40:18 +01:00
7dc7e25016 appened sys.path to allow local imports 2022-06-24 13:41:07 +01:00
a15ab80bc6 added log_FS_pnca_7030.txt after running FS for pnca 2022-06-24 13:27:16 +01:00
96f4e7085a added test_MultClfs.py to test the functions now in a single script 2022-06-24 13:26:42 +01:00
a3c644d04b removed MultModelsCl.py and ProcessMultModelsCl.py as these are merged into a single script for convenience 2022-06-24 13:25:51 +01:00
fba1481c08 added MultClfs.py that contains my ML functions 2022-06-24 13:25:00 +01:00
19da36842b removed the two functions MultModelsCl.py and ProcessMultModelsCl.py as these have now been combined 2022-06-24 13:24:04 +01:00
ad99efedd7 saving work 2022-06-24 13:21:21 +01:00
3514e1b4ba added run_7030_LOOP.py to loop through the resampling data and get processed output 2022-06-23 21:29:54 +01:00
1d3190899d added ProcessMultModelsCl.py that processes the output for multiple models 2022-06-23 21:27:13 +01:00
4fe62c072b added metadata output for running multiple models 2022-06-23 21:25:00 +01:00
5dea35f97c aaded scripts for FS including test call, etc 2022-06-23 14:53:01 +01:00
8fe0048328 saving work 2022-06-23 14:52:27 +01:00
0350784d52 changed blind_test_input_df to blind_test_df in MultModelsCl 2022-06-22 16:42:04 +01:00
bc12dbd7c2 added run_7030.py that runs as cmd for all gene targets and sampling methods and outputs a single csv 2022-06-21 20:37:53 +01:00
5b0ccdfec4 added ml_data_fg.py 2022-06-21 18:21:41 +01:00
11ef627150 removed _dissected files and renamed them to _fg 2022-06-21 18:20:22 +01:00
fe0986aa28 adde script to run ml baseline models orig version with feature groups 2022-06-21 18:17:56 +01:00
137f19a285 saving work 2022-06-21 18:12:31 +01:00
7b378ca6f3 adding formatting to get all output from ML for feature grpups starting with genomics 2022-06-21 14:08:12 +01:00
cadaed2ba7 ML logs 2022-06-20 21:55:47 +01:00
4c5afa614f python scripts for original analysis with logs 2022-06-20 21:54:48 +01:00
8d8fc03f72 added test script to test dissected model 2022-06-20 21:53:15 +01:00
e68a153883 working on dissected model, testing diff feature groups 2022-06-20 21:51:07 +01:00
135efcee41 added option to add confusion matrix and target numbers in the mult function 2022-06-20 17:08:22 +01:00
905327bf4e script to run models based on group of features 2022-06-20 14:59:02 +01:00
4ab99dcbd2 saving work for yesterday where uq runs were repeated 2022-06-20 14:57:11 +01:00
efeaf52cde added ml runs for complete data with _cd_ in filenames reflecting this 2022-06-18 19:36:33 +01:00
9bc26c1947 slight formatting for existing scripts 2022-06-18 19:35:49 +01:00
a53fce5455 added notes for running ml scripts 2022-06-18 14:45:48 +01:00
e176d018cb added log files for these ml runs 2022-06-18 14:44:02 +01:00
5bd8ba33f7 added scripts for reverse training 2022-06-18 14:43:35 +01:00
d85415daf8 added scripts for scaling law split 2022-06-18 14:42:46 +01:00
4037641dfa added data and ml scripts for 8020 splits 2022-06-18 14:42:02 +01:00
2e50a555a0 minor formatting consistency for 7030 scripts 2022-06-18 14:41:05 +01:00
e05e4e2e38 added script for running 7030 split 2022-06-17 18:28:26 +01:00
91e868736c changed dir to allow ml script to import functions 2022-06-17 18:27:20 +01:00
e6d3692445 changed dir for reading func in pnca_config.py 2022-06-17 16:37:07 +01:00
96d4e61dca added baseline config files for running v2 ml analysis 2022-06-17 14:14:26 +01:00
05dd9698c4 added aa_index data for running ml 2022-06-17 13:41:25 +01:00
39ccd6cdf4 initial adding of ml scripts for baseline models 2022-06-17 13:40:09 +01:00
f355846dae added active site indication for merged_dfs in count_vars_ML.R and also added 'gene_name' in combining_dfs.py 2022-06-15 18:36:28 +01:00
1204f1faba added scripts and files to make AA index work for all drug targets, add header to the aa index output and fetch the aa index headers 2022-06-15 11:24:07 +01:00
03321c261e working new_aa.sh 2022-06-13 22:05:41 +01:00
c4ae6d2412 improved aa script 2022-06-13 21:48:44 +01:00
2307a19d86 added example bash cmds 2022-06-13 21:22:49 +01:00