tgstat - Amos Tanay's Group High Performance Statistical Utilities
A collection of high performance utilities to compute distance, correlation, auto correlation, clustering and other tasks. Contains graph clustering algorithm described in "MetaCell: analysis of single-cell RNA-seq data using K-nn graph partitions" (Yael Baran, Akhiad Bercovich, Arnau Sebe-Pedros, Yaniv Lubling, Amir Giladi, Elad Chomsky, Zohar Meir, Michael Hoichman, Aviezer Lifshitz & Amos Tanay, 2019 <doi:10.1186/s13059-019-1812-2>).
Last updated 6 months ago
algorithms-implementedcorrelationknnstatisticsopenblascpp
6.06 score 8 stars 1 dependents 24 scripts 436 downloadsmisha - Toolkit for Analysis of Genomic Data
A toolkit for analysis of genomic data. The 'misha' package implements an efficient data structure for storing genomic data, and provides a set of functions for data extraction, manipulation and analysis. Some of the 2D genome algorithms were described in Yaffe and Tanay (2011) <doi:10.1038/ng.947>.
Last updated 1 days ago
genomic-data-analysiscpp
5.81 score 4 stars 83 downloadstglkmeans - Efficient Implementation of K-Means++ Algorithm
Efficient implementation of K-Means++ algorithm. For more information see (1) "kmeans++ the advantages of the k-means++ algorithm" by David Arthur and Sergei Vassilvitskii (2007), Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, pp. 1027-1035, and (2) "The Effectiveness of Lloyd-Type Methods for the k-Means Problem" by Rafail Ostrovsky, Yuval Rabani, Leonard J. Schulman and Chaitanya Swamy <doi:10.1145/2395116.2395117>.
Last updated 2 months ago
algorithms-implementedkmeanscpp
5.35 score 7 stars 16 scripts 445 downloadsnaryn - Native Access Medical Record Retriever for High Yield Analytics
A toolkit for medical records data analysis. The 'naryn' package implements an efficient data structure for storing medical records, and provides a set of functions for data extraction, manipulation and analysis.
Last updated 8 days ago
data-analysismedical-recordscpp
5.08 score 3 stars 4 scripts 242 downloadsslanter - Slanted Matrices and Ordered Clustering
Slanted matrices and ordered clustering for better visualization of similarity data.
Last updated 4 years ago
4.65 score 3 stars 1 dependents 1 scripts 169 downloads