Extract pathways from KEGG database using EC entries.
Extract pathways from KEGG database using EC entries.
Input files contain interaction information in three columns:
Tissue: Breast, Lung, Kidney or Renal, and Urothelial
Condition: Cancer, Healthy
Set: MDS, nonMDS, where MDS means Minimum Dominanting Set
Example of file name:
EC_BreastCancer_MDS,
EC_UrothelialHealthy_nonDMS
In this script you have three functions:
This function is for extracting the list of pathways set by set and save two results:
This function is to calculate the p-value for proportions differences using prop.test() or fisher.test(). It is recomended to use Fisher test. The results of this function are:
This function joins the results of proportional test or Fisher exact test, and returns three results.
This script is to calculate the Fisher exact test p-value for the comparison of healthy and cancer pathways.
This script is to plot the figures used in this project report. It uses a tidy data file manipulated in excel with the raw results of pathways lists.
This script allows to extract the name of the diseases to which our list of pathways are associated. The input data in this script is a list of codes in KEGG format. For example: map01100 is the KEGG code for purine metabolism.
This is a shorter version and time saving pathway extraction script. This version is still in developing, but so far it extracts the list of all pathways, unlisting the list is not working correctly.