tinyarray
Expression Data Analysis and Visualization
The Gene Expression Omnibus (<https://www.ncbi.nlm.nih.gov/geo/>) and The Cancer Genome Atlas (<https://portal.gdc.cancer.gov/>) are widely used medical public databases. Our platform integrates routine analysis and visualization tools for expression data to provide concise and intuitive data analysis and presentation.
README
# tinyarray
### Introduction
Hi, I'm Xiao Jie. This is an R package I wrote based on my own data analysis needs. I'm glad you found it. I will update some useful functions here on the public account "bioinfoplanet" and also do some other sharing.
### Installation
#### 1.online
```
if(!require(tinyarray))install.packages("tinyarray")
if(!require(devtools))install.packages("devtools")
if(!require(tinyarray))devtools::install_github("xjsun1221/tinyarray",upgrade = FALSE,dependencies = TRUE)
```
#### 2.local
Click the green button "code" on this page, then click "Download ZIP" to download it to your working directory. Install it with `devtools::install_local("tinyarray-master.zip",upgrade = F,dependencies = T)`.
### functions
#### 1.basic
draw_heatmap(),draw_volcano(),draw_venn(),draw_boxplot(),draw_KM(),draw_venn(),risk_plot()
ggheat() is a function from the ggplot2 package that can be used to create heatmaps. It is still relatively immature and mainly used for aligning plots and collecting legends.
Something about ggheat():
https://mp.weixin.qq.com/s/WhsBf6QAhVXeXeScM59cSA
#### 2.Downstream Analysis of Gene Expression Array Data from GEO Database
geo_download(): Provide a GEO number and get back the expression matrix, clinical information table, and platform number used.
find_anno(): Look up the annotation of the array platform.
get_deg(): Provide the array expression matrix, grouping information, probe annotation and get back the differential analysis results.
multi_deg(): Differential analysis for multiple groups (up to 5).
If you want to do differential analysis and get the common figures in one step, you can use get_deg_all() and multi_deg_all(). This part mainly integrates and simplifies the differential analysis of GEOquery, Annoprobe, and limma.
quick_enrich(): Simple and intuitive enrichment analysis.
double_enrich(): Separate enrichment of up- and down-regulated genes, combined with plotting.
https://mp.weixin.qq.com/s/YQQoDsE5JaKpgFGlbEfQNg
https://mp.weixin.qq.com/s/j5IB_MQ0zeOCe1j_ahwtdQ
#### 3.Exploring Expression Matrices
make_tcga_group(): Quickly get the grouping according to the TCGA sample naming rules.
sam_filter(): Remove duplicate tumor samples in TCGA.
match_exp_cl(): Match TCGA expression matrix with clinical information.
trans_array(): Replace the row names of the matrix, such as replacing the probe names of the expression matrix with gene names.
trans_exp(): Convert TCGA or TCGA+GTeX data to gene IDs (old version, genecode v22 or v23)
trans_exp_new(): Convert TCGA or TCGA+GTeX data to gene IDs(new versions)
t_choose(): Do t-tests for individual genes in batches.
cor.full() and cor.one(): Calculate correlations between genes in batches.
#### 4.Survival Analysis and Visualization
point_cut(): Calculate the best cutoff point for survival analysis in batches.
surv_KM(): Do KM survival analysis in batches, supporting grouping with the best cutoff point.
surv_cox(): Do single factor Cox in batches, supporting grouping with the best cutoff point.
risk_plot(): Risk factor three-way linkage.
https://mp.weixin.qq.com/s/WYBhGxfGg6QFUPHFBashaA
exp_boxplot(): Draw T-N boxplot for the interested genes.
exp_surv(): Draw KM-plot for the interested genes.
box_surv(): Draw boxplot and KM-plot for the interested genes.
#### 5.Something about network graph
hypertest(): Do hypergeometric distribution test for mRNA and lncRNA in batches.
plcortest(): Do correlation test for mRNA and lncRNA in batches.
https://www.yuque.com/xiaojiewanglezenmofenshen/bsgk2d/dt0isp
interaction_to_edges(): Generate the connection table for the network graph based on the relationship table.
edges_to_nodes(): Generate the node table based on the connection table.
#### 6.Tricks
dumd(): Count how many values each column of the data frame has.
intersect_all(): Take the intersection of any number of vectors.
union_all(): Take the union of any number of vectors.
Versions across snapshots
| Version | Repository | File | Size |
|---|---|---|---|
2.4.3 |
rolling linux/jammy R-4.5 | tinyarray_2.4.3.tar.gz |
1.8 MiB |
2.4.3 |
rolling linux/noble R-4.5 | tinyarray_2.4.3.tar.gz |
1.8 MiB |
2.4.3 |
rolling source/ R- | tinyarray_2.4.3.tar.gz |
1.8 MiB |
2.4.3 |
latest linux/jammy R-4.5 | tinyarray_2.4.3.tar.gz |
1.8 MiB |
2.4.3 |
latest linux/noble R-4.5 | tinyarray_2.4.3.tar.gz |
1.8 MiB |
2.4.3 |
latest source/ R- | tinyarray_2.4.3.tar.gz |
1.8 MiB |
2.4.3 |
2026-04-26 source/ R- | tinyarray_2.4.3.tar.gz |
1.8 MiB |
2.4.3 |
2026-04-23 source/ R- | tinyarray_2.4.3.tar.gz |
1.8 MiB |
2.4.3 |
2025-04-20 source/ R- | tinyarray_2.4.3.tar.gz |
1.8 MiB |