Resume
|
I am Professor and the Chair in Modern Statistics
and Data Science in the department of Statistics and
Operations Research at Tel Aviv University, which I joined in 2007.
My research interests are in Statistical Learning theory and
methods.
Contact Info:
School of
Mathematical
Sciences,
Tel Aviv University, Tel Aviv, 69978 Israel
12345saharon@tauex.tau.ac.il54321 (remove numbers)
|
Teaching (last five years)
Topics
in Statistical Genetics: Spring 2022; Spring 2024
Statistics
of Big Data: Fall 2021/22; Fall 2023/24
Bootstrap and
Resampling Methods: Spring 2021; Spring 2023
Statistical Learning:
Fall 2020/21; Fall 2022/23
Statistics for Computer Science : Fall 2019/20; Fall 2020/21 (Course page on
Moodle)
Introduction to Data Science: Spring 2019; Spring 2020; Spring 2023
(Course page on Moodle)
Group members
Current PhD+ students: Oren Yuval (PhD), Giora Simchoni (PhD), Rajesh
Karmakar (PhD, joint with Ruth
Heller)
Past: Amit
Moscovich-Eiger (Post-Doc, 2017-2018), Ronny Luss
(Post-Doc, 2009-2011), David Golan
(PhD 2014), Shlomi
Lifshits (PhD 2015, joint with Yaniv Assaf), Amichai Painsky
(PhD 2016, joint with Meir Feder),
Omer Weissbrod
(PhD 2017, joint with Dan Geiger of Technion), Assaf Rabinowicz
(PhD 2021), Aya Vituri (PhD 2023), Keren Levinstein Hallak (PhD 2024), Giora
Simchoni (MSc 2011), Adi Sarid (MSc 2011), Shachar Kaufman (MSc 2012),
Amichai Painsky (MSc 2012) , Slava Borodovski (MSc
2012), Lital Bridavsky (MSc 2013), Roee Eilat (MSc 2016), Ayala Neudorfer
(MSc 2017), Eyal Fisher (MSc 2017), Dan Caspi (MSc 2017), Gal Oren (MSc
2017), Dana Kaner (MSc 2018), Aviv Navon (MSc 2018), Gal Naamani (MSc 2019),
Guy Smorodinsky (MSc 2021), Sigal Fleishman (MSc
2021), Nir Basanchik (MSc 2021), Noa Haas (MSc
2022), Michael Kozak (MSc 2022), Pavel Petrov (MSc 2022)
Publications (in bold – members of our research group)
Technical reports
2024
|
Keren
Levinstein-Hallak, Saharon Rosset. (2024).
Dating
ancient humans splits by estimating Poisson rates from mitochondrial DNA
parity samples.
BMC
Genomic Data 25
(1), 4
https://www.biorxiv.org/content/10.1101/2023.04.21.537838v1.full.pdf
|
|
Ali Pazokitoroudi,
Zhengtong Liu, Andy Dahl, Noah Zaitlen, Saharon Rosset, Sriram Sankararaman.
(2024).
A scalable
and robust variance components method reveals insights into the
architecture of gene-environment interactions underlying complex traits.
American Journal of Human Genetics 111, 1–19, July 11, 2024.
doi: 10.1016/j.ajhg.2024.05.015.
|
2023
|
Uri Rom,
Saharon Rosset. (2023).
Key-Specific
Structure in Mozart’s Music: A Peek into his Creative Process?
Empirical
Musicology Review,
16(2), 276–311 , 2023.
|
|
Giora Simchoni,
Saharon Rosset.
(2023).
Integrating
Random Effects in Deep Neural Networks.
Journal
of Machine Learning Research, 24(156):1−57 , 2023.
|
|
Ruth Heller, Abba
Krieger, Saharon Rosset. (2023).
Optimal multiple testing and design
in clinical trials
Biometrics, 79(3): 1908−1919, 2023.
|
2022
|
Amit Moscovich, Saharon Rosset. (2022).
On the
cross-validation bias due to unsupervised preprocessing.
Journal
of the Royal Statistical Society, Series B, 84 (4),
1474-1502, September 2022.
|
|
Assaf Rabinowicz,
Saharon Rosset.
(2022).
Tree-Based
Models for Correlated Data.
Journal
of Machine Learning Research, 23(258):1−31, 2022.
|
|
Oren Yuval,
Saharon Rosset.
(2022).
Semi-Supervised
Empirical Risk Minimization: When can unlabeled
data improve prediction.
Electronic
Journal of Statistics, 16 (1), 1434-1460, 2022.
|
|
Keren
Levinstein-Hallak, Saharon Rosset. (2022).
Modeling SARS-CoV-2 substitution
processes: predicting the next variant.
Communications
Biology,
5, Article number: 285 (2022).
|
|
Saharon Rosset, Ruth Heller,
Amichai Painsky, Ehud Aharoni. (2022).
Optimal
and Maximin Procedures for Multiple Testing Problems.
Journal of the
Royal Statistical Society, Series B, 84 (4), 1105-1128, September 2022.
|
2021
|
Trevor Hastie, Andrea Montanari, Saharon Rosset, Ryan J. Tibshirani.
(2021).
Surprises in High-Dimensional Ridgeless Least Squares Interpolation.
Annals of
Statistics,
to appear (accepted 8/21). https://arxiv.org/abs/1903.08560
|
|
Giora Simchoni, Saharon
Rosset, (2021).
Using Random Effects to Account for
High-Cardinality Categorical Features and Repeated Measures in Deep Neural
Networks.
NeurIPS 2021
|
|
Ruth Heller, Saharon
Rosset. (2021).
Optimal Control of False Discovery
Criteria in the Two Group Model.
Journal of the
Royal Statistical Society, Series B, Volume 83, Issue 1, Pages 133-155.
|
2020
|
Assaf Rabinowicz, Saharon Rosset. (2020).
Cross-Validation for Correlated
Data.
Journal of the American Statistical Association, to appear (accepted
7/20).
|
|
Aviv Navon, Saharon Rosset. (2020).
Capturing
between-tasks covariance and similarities using multivariate linear mixed
models.
Electronic Journal of Statistics, Volume 14,
Number 2 (2020), 3821-3844.
|
|
Assaf Rabinowicz, Saharon Rosset. (2020).
Assessing Prediction Error at
Interpolation and Extrapolation Points.
Electronic Journal of Statistics, Volume 14, Number 1 (2020),
272-301.
|
|
Eyal Fisher, Regev Schweiger, Saharon Rosset. (2020).
Efficient
Construction of Test Inversion Confidence Intervals Using Quantile
Regression.
Journal
of Computational and Graphical Statistics, 29:1, 140-148.
|
|
Saharon Rosset, Ryan Tibshirani. (2018).
From
Fixed-X to Random-X Regression: Bias-Variance Decompositions, Covariance
Penalties, and Prediction Error Estimation.
Journal of the American Statistical
Association,
115 (529), 138-151 (with discussion and rejoinder).
|
|
Shiran Abadi, Oren Avram, Saharon Rosset, Tal Pupko, Itay Mayrose.
(2020).
ModelTeller: Model Selection for Optimal Phylogenetic
Reconstruction Using Machine Learning.
Molecular Biology and Evolution, Volume 37, Issue 11, November 2020,
Pages 3338–3352.
|
2019
|
Ryan Tibshirani, Saharon Rosset. (2018).
Excess
Optimism: How Biased is the Apparent Error of an Estimator Tuned by SURE?.
Journal of the American Statistical Association, 114:526, 697-712.
|
|
Omer Weissbrod, Shachar Kaufman, David Golan, Saharon
Rosset. (2019).
Modeling
High-Dimensional Data with Case-Control Sampling and Dependency Structures.
Journal of Machine Learning Research, 20 (108), 1-30.
|
|
Amichai Painsky, Saharon Rosset, Meir Feder. (2019).
Innovation
Representation of Stochastic Processes with Application to Causal
Inference.
IEEE
Transactions on Information Theory, 66 (2), 1136-1154.
|
|
Elior Rahmani, Regev Schweiger, Brooke Rhead, Lindsey Criswell, Lisa
Barcellos, Eleazar Eskin, Saharon Rosset, Sriram Sankararaman, Eran
Halperin. (2019).
Cell-type-specific
resolution epigenetics without the need for cell sorting or single-cell
biology.
Nature
Communications,
10 (1), 1-11.
|
2018
|
Keren Levinstein Hallak, Shay Tzur, Saharon Rosset. (2018).
Big data analysis of mitochondrial DNA
substitution models: A regression approach.
BMC
Genomics, 19:759, 2018.
|
|
Amichai Painsky, Saharon Rosset, Meir Feder. (2018).
Linear Independent
Component Analysis over Finite Fields: Algorithms and Bounds.
IEEE
Transactions on Signal Processing, 66(22), p5875–5886, Nov. 2018.
|
|
Blake Woodworth, Vitaly Feldman, Saharon Rosset, Nathan Srebro.
(2018).
The Everlasting Database: Statistical
Validity at a Fair Price.
NIPS 2018
|
|
Ayala Neudorfer, Saharon Rosset. (2018).
Predicting
the NCAA basketball tournament using isotonic least squares pairwise
comparison model.
Journal
of Quantitative Analysis in Sports, 14(4), pp. 173–183.
|
|
Omer Weissbrod, Jonathan Flint, Saharon Rosset.
(2018).
Estimating SNP-based
heritability and genetic correlation in case control studies directly and
with summary statistics.
American
Journal of Human Genetics, Volume 103, Issue 1, p89–99, 5 July
2018
|
|
|
2017
|
Regev Schweiger, Omer Weissbrod, Elior
Rahmani, Martina Müller-Nurasyid, Sonja Kunze, Christian Gieger, Melanie Waldenberger, Saharon Rosset, Eran Halperin.
(2017)
RL-SKAT: An Exact and
Efficient Score Test for Heritability and Set Tests.
Genetics, to
appear (accepted 8/17).
|
|
Regev Schweiger, Eyal Fisher, Elior Rahmani, Liat Shenhav, Saharon Rosset, Eran Halperin. (2017).
Using Stochastic Approximation Techniques
to Efficiently Construct Confidence Intervals for Heritability.
International
Conference on Research in Computational Molecular Biology (RECOMB2017), 241-256
|
|
Ronny Luss, Saharon Rosset. (2017).
Bounded
Isotonic Regression.
Electronic Journal of Statistics, Volume 11, No. 2, Nov. 2017.
|
|
Amichai Painsky, Saharon Rosset, Meir Feder. (2017).
Large Alphabet Source
Coding using Independent Component Analysis.
IEEE Transactions on Information Theory, Volume 63, Issue 10, Oct.
2017.
|
|
Omer Weissbrod, Elior Rahmani, Regev
Schweiger, Saharon Rosset, Eran Halperin. (2017).
Association testing of bisulfite-sequencing methylation data via a Laplace
approximation
Bioinformatics, 33 (14),
i325-i332, 2017
|
|
Amichai Painsky, Saharon Rosset. (2017).
Cross-Validated
Variable Selection in Tree-Based Methods Improves Predictive Performance.
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol
39, No .11.
|
|
Shay Tzur, Saharon Rosset. (2017).
Strictly
conserved tri-nucleotide motif ‘CAT’ is associated with TAS DNA protein
binding sites in human mitochondrial DNA control region.
Mitochondrial DNA, Vol 28, Issue 2, 250–253 (2017).
|
2016
|
Sagi
Shporer, Benny Chor, Saharon Rosset, David
Horn. (2016).
Inversion
symmetry of DNA k-mer counts:
validity and deviations.
BMC Genomics, 17:696.
|
|
Yaron
Granot, Omri Tal, Saharon Rosset, Karl Skorecki.
(2016).
On
the Apportionment of Population Structure.
PLOS One, DOI:10.1371/journal.pone.0160413
August 9, 2016.
|
|
Omer Weissbrod, Dan Geiger, Saharon
Rosset. (2016).
Multikernel linear mixed models for complex phenotype
prediction.
Genome Research, 26 (7), 969–979 (July 2016).
|
|
Regev Schweiger, Shachar Kaufman, Reijo Laaksonen, Marcus E. Kleber,
Winfried März, Eleazar Eskin, Saharon Rosset, Eran Halperin. (2016).
Fast
and Accurate Construction of Confidence Intervals for Heritability.
American Journal of Human Genetics, Vol. 98, No. 6, p1181–1192, 2
June 2016.
|
|
Amichai Painsky, Saharon Rosset, Meir Feder. (2016).
Generalized Independent Component
Analysis Over Finite Alphabets.
IEEE Transactions on Information Theory, Vol. 62, No. 2, Feb. 2016.
|
|
Amichai Painsky, Saharon Rosset. (2016).
Isotonic
Modeling with Non-differentiable Loss Functions
with Application to Lasso Regularization.
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.
32, No. 2, Feb. 2016.
|
2014
|
David Golan, Eric Lander, Saharon Rosset. (2014).
Measuring
Missing Heritability: Inferring the Contribution of Common Variants.
Proceedings of the National Academy of Sciences, Vol. 111, No. 49,
E5272–E5281, 2014.
|
|
David Golan, Saharon Rosset. (2014).
Effective
Genetic Risk Prediction Using Mixed Models.
American Journal of Human Genetics, Volume 95, Issue 4, p383–393, 2
October 2014.
|
|
Shachar Kaufman, Saharon Rosset. (2014).
When Does More
Regularization Imply Fewer Degrees of Freedom? Sufficient Conditions and
Counterexamples.
Biometrika, 101 (4): 771–784, Dec. 2014.
|
|
Saharon Rosset, Ehud Aharoni, Hani Neuvirth. (2014).
Novel Statistical Tools
for Management of Public Databases Facilitate Community-Wide Replicability
and Control of False Discovery.
Genetic Epidemiology, Vol. 38, Issue 5, pages 477–481, July 2014.
|
|
Shachar Kaufman, Saharon Rosset. (2014).
Exploiting
Population Samples to Enhance Genome Wide Association Studies of Disease.
Genetics, Vol. 197, 337–349, May 2014.
(Press
release from Genetics Society of America)
(Blog
post from GSA website)
|
|
Ronny
Luss, Saharon Rosset. (2014).
Generalized
Isotonic Regression.
Journal of Computational and Graphical Statistics, Vol. 23, No. 1,
192-210.
|
|
Ehud Aharoni, Saharon Rosset. (2014).
Generalized
Alpha Investing: Definitions, Optimality Results, and Application to Public
Databases.
Journal of Royal Statistical Society, Series B, Volume 76, Issue 4,
771–794, September 2014.
|
|
Amichai Painsky, Saharon Rosset. (2014).
Optimal
Set Cover Formulation for Exclusive Row Biclustering
of Gene Expression.
Journal of Computer Science and Technology (JCST), 29(3): 423–435, May 2014.
(long version of SDM-2012 paper)
|
|
|
2013
|
David Golan, Saharon Rosset. (2013).
Statistical
Modeling of coverage in High-Throughput Data.
Chapter 4 in Deep Sequencing Data Analysis, N. Shomron (ed.),
Springer, 2013.
|
|
Saharon Rosset. (2013).
Practical
Sparse Modeling: an Overview and Two Examples
from Genetics.
Chapter 3 in Practical Applications of Sparse Modeling,
I. Rish et al. (eds.), MIT Press, 2013, to appear (accepted 5/13).
|
|
|
|
|
2012
|
Amichai Painsky, Saharon Rosset. (2012).
Exclusive
Row Biclustering for Gene Expression Using a
Combinatorial Auction Approach.
IEEE Conference on Data Mining (ICDM-2012).
|
|
Shachar Kaufman, Saharon Rosset, Claudia Perlich, Ori Stitleman. (2012).
Leakage in Data Mining:
Formulation, Detection, and Avoidance.
ACM Transactions on Knowledge Discovery from Data, Vol. 6 No.
4, December 2012.
(long version of KDD-2011 paper by same name)
|
|
David Golan, Saharon Rosset. (2012).
Comment
on “The Predictive Capacity of Personal Genome Sequencing”.
Science Translational Medicine 4, 135le4 (2012).
|
|
David Golan, Yaniv Erlich, Saharon Rosset. (2012).
Weighted
Pooling - Practical and Cost Effective Techniques
for Pooled High Throughput Sequencing .
Bioinformatics, Vol. 28, pages i197–i206 (Proceedings ISMB 2012).
|
|
Doron M. Behar, Mannis van Oven, Saharon Rosset, Mait Metspalu, Eva-Liis Loogväli,
Nuno M. Silva, Toomas Kivisild, Antonio Torroni,
Richard Villems. (2012).
A
“Copernican” Reassessment of the Human Mitochondrial DNA Tree from its Root
.
The American Journal of Human Genetics, Volume 90, Issue 4, 675-684,
6 April 2012.
|
|
Shay Tzur, Saharon Rosset, Walter Wasser, Doron Behar, Karl Skorecki. (2012).
APOL1
allelic variants are associated with lower age of dialysis initiation and
thereby increased dialysis vintage in African and Hispanic Americans with
non-diabetic end-stage kidney disease.
Nephrology, Dialysis and Transplantation, 27(4):1498-505, 2012.
|
|
Melyssa Gymrek, David Golan, Saharon Rosset,
Yaniv Erlich. (2012).
lobSTR: A short tandem repeat profiler for personal
genomes.
Genome Research, 22(6) : 1154-62, June
2012, doi: 10.1101/gr.135780.111 .
|
|
Giles Hooker, Saharon Rosset. (2012).
Prediction-Based
Regularization Using Data Augmented Regression.
Statistics and Computing, Volume 22, Issue 1, pp 237-249.
|
|
Ronny Luss, Saharon Rosset and Moni Shahar. (2012).
Efficient
Regularized Isotonic Regression with Application to Gene-Gene Interaction
Search.
Annals of Applied Statistics, Volume 6, Number 1, pp 253-283.
Matlab
code implementing the IRP algorithm.
|
|
|
|
|
2011
|
Sijian Wang, Bin Nan, Saharon Rosset and
Ji Zhu. (2011).
Random
Lasso.
Annals of Applied Statistics, Vol.
5, No. 1, 468-485.
|
|
Ehud Aharoni, Hani Neuvirth and Saharon Rosset. (2011).
The
Quality Preserving Database: A Computational Framework for Encouraging
Collaboration, Enhancing Power and Controlling False Discovery.
IEEE Transactions on Computational Biology and Bioinformatics,
Sep-Oct;8(5):1431-7.
|
|
Saharon Rosseta,
Shay Tzura, Walter Wasser, Doron Behar
and Karl Skorecki. (2011).
The
population genetics of chronic kidney disease: insights from the MYH9–APOL1
locus.
Nature Reviews Nephrology, 7, 313-326 (June 2011) |
doi:10.1038/nrneph.2011.52.
(aequal
contribution)
|
|
David Golan, Saharon Rosset. (2011).
Accurate
Estimation of Heritability in Genome Wide Studies using Random Effects
Models.
Bioinformatics (Proceedings of ISMB-ECCB11), Volume27, Issue13, Pp.
i317-i323.
|
|
Shachar Kaufman, Saharon Rosset and Claudia
Perlich. (2011).
Leakage in Data Mining:
Formulation, Detection, and Avoidance.
KDD-2011.
Best paper award winner at KDD-2011.
|
|
Doron M. Behar, Einat Kedem, Saharon Rosset, Yonas Haileselassie,
Shay Tzur, Zipi Kra-Oz,
Walter G. Wasser, Yotam Shenhar, Eduardo Shahar,
Gamal Hassoun, Carcom Maor, Dawit Wolday, Shimon Pollack, Karl Skorecki.
(2011).
Absence
of APOL1 Risk Variants Protects against HIV-Associated Nephropathy in the
Ethiopian Population.
American Journal of Nephrology, 2011,34:452-459.
|
2010
|
Doron M. Behara, Saharon Rosseta, Shay Tzura, Sara Selig, Guennady
Yudkovsky, Sivan Bercovici, Jeffrey B. Kopp,
Cheryl A. Winkler, George W. Nelson, Walter G. Wasser and Karl Skorecki. (2010).
African
ancestry allelic variation at the MYH9 gene contributes to increased
susceptibility to non-diabetic end-stage kidney disease in Hispanic
Americans.
Human
Molecular Genetics, Vol. 19, No. 9 1816–1827. doi:10.1093/hmg/ddq040
(aequal contribution)
|
|
Doron M. Behar, Bayazit Yunusbayev, Mait Metspalu, Ene Metspalu, Saharon
Rosset, Jüri Parik, Siiri Rootsi,
Gyaneshwer Chaubey, Ildus Kutuev,
Guennady Yudkovsky,
Elza K. Khusnutdinova, Oleg Balanovsky,
Ornella Semino, Luisa Pereira, David Comas, David Gurwitz, Batsheva
Bonne-Tamir, Tudor Parfitt, Michael F. Hammer, Karl Skorecki
and Richard Villems. (2010).
The
genome-wide structure of the Jewish people.
Nature, Vol. 466 238–242. doi:10.1038/nature09103
|
|
Saharon Rosset, Claudia Perlich, Grzegorz Swirszcz. Yan Liu and Prem
Melville (2010).
Medical
Data Mining: Lessons from Winning Two Competitions.
Data Mining and Knowledge Discovery Journal, Vol. 20, Num. 3,
439–468.
|
|
Osnat Ravid-Amir and Saharon
Rosset. (2010).
Maximum
Likelihood Estimation of Locus-Specific Mutation Rates in Y-chromosome
Short Tandem Repeats.
Bioinformatics (proceedings of ECCB10) 26(18): i440-i445.
|
|
Shay Tzura, Saharon Rosseta,
Revital Shemer, Guennady Yudkovsky,
Sara Selig, Ayele Tarekegn, Endashaw Bekele, Neil
Bradman, Walter G. Wasser, Doron M. Behar and Karl Skorecki . (2010).
Missense
mutations in the APOL1 gene are highly associated with end stage kidney
disease risk previously attributed to the MYH9 gene.
Human Genetics, Vol. 128, Issue 3 345–350.
(aequal contribution)
|
|
Ronny Luss, Saharon Rosset and Moni Shahar. (2010).
Decomposing
Isotonic Regression for Efficiently Solving Large Problems.
NIPS 2010.
|
|
Richard Lawrence, Claudia Perlich, Saharon Rosset, et al. (10
authors). (2010).
Operations
Research Improves Sales Force Productivity at IBM.
INFORMS Interfaces, Vol. 40, No. 1, January-February 2010, pp.
33-46.
|
2009
|
Aurelie Lozano, Naoki Abe, Yan Liu and Saharon Rosset. (2009).
Grouped
graphical Granger modeling for gene expression
regulatory networks discovery.
Bioinformatics 25(12):i110-i118 (proceedings of ISMB09);
doi:10.1093/bioinformatics/btp199.
|
|
Aurelie Lozano, Naoki Abe, Yan Liu and Saharon Rosset. (2009).
Grouped graphical Granger modeling methods for temporal causal modeling.
KDD-09.
|
|
Michael F. Hammer, Doron M. Behar, Tatiana M.
Karafet, Fernando L. Mendez, Brian Hallmark, Tamar Erez, Lev A.
Zhivotovsky, Saharon Rosset, Karl Skorecki. (2009).
Extended
Y chromosome haplotypes resolve multiple and unique lineages of the Jewish
priesthood.
Human Genetics, Volume 126,
Number 5 / November, 2009 DOI 10.1007/s00439-009-0727-5
|
|
Ji Zhu, Hui Zou, Saharon Rosset and Trevor Hastie. (2009).
Multi-class
AdaBoost.
Statistics and its Interface, volume 2, issue 3.
|
|
Saharon Rosset. (2009).
Bi-Level
Path Following for Cross Validated Solution of Kernel Quantile Regression.
Journal of Machine Learning Research, 10(Nov):2473−2505, 2009
(short version from
ICML-08).
|
2008
|
Saharon Rosset, Spencer Wells,
David Soria-Hernanz, Chris Tyler-Smith, Ajay Royyuru, Doron Behar. (2008). Maximum
Likelihood Estimation of Site-Specific Mutation Rates in Human
Mitochondrial DNA from Partial Phylogenetic Classification. Genetics 180 :
1511–1524. DOI: 10.1534/genetics.108.091116.
|
|
Claudia Perlich,
Prem Melville, Yan Liu, Grzegorz Swirszcz, Richard Lawrence, Saharon
Rosset. (2008). Breast Cancer
Identification: KDD CUP Winner's Report. SIGKDD
Explorations, vol. 10, issue 2, 39-42
|
|
Prem Melville,
Saharon Rosset, Richard Lawrecne. (2008). Customer
Targeting Models Using Actively-Selected Web
Content. KDD-08.
|
|
Doron M Behar,
Ene Metspalu, Toomas Kivisild, Saharon Rosset,
Shay Tzur, Yarin Hadid, Guennady Yudkovsky, Dror Rosengarten, Luisa Pereira, Antonio
Amorim, Ildus Kutuev, David Gurwitz, Batsheva
Bonne-Tamir, Richard Villems and Karl Skorecki. (2008). Counting the
Founders: The Matrilineal Genetic Ancestry of the Jewish Diaspora. PLoS ONE 3(4): e2062. DOI:10.1371/journal.pone.0002062.
|
|
Doron M Behar, Richard Villems,
Himla Soodyall, Jason
Blue-Smith,
Luisa Pereira, Ene Metspalu, Rosaria Scozzari,
Heeran Makkan, Shay Tzur, David Comas, Jaume Bertranpetit, Lluis Quintana-Murci,
Chris Tyler-Smith, R. Spencer
Wells and
Saharon
Rosset.
(2008). The Dawn of
Human Matrilineal Diversity. American Journal of Human Genetics, 82(5) : 1130-1140. DOI:10.1016/j.ajhg.2008.04.002.
|
2007
|
Saharon Rosset, Claudia Perlich and Yan Liu. (2007). Making
the Most of Your Data: KDD Cup 2007 "How Many Ratings" Winner's
Report. SIGKDD Explorations, vol. 9, issue
2.
|
|
Claudia Perlich, Saharon Rosset, Rick Lawrence, Bianca
Zadrozny. (2007). High
Quantile Modeling for Customer Wallet Estimation
with Other Applications. KDD-07.
|
|
Saharon Rosset, Grzegorz Swirszcz, Nathan
Srebro, Ji Zhu. (2007). l1
Regularization in Infinite Dimensional Feature Spaces . COLT-07.
|
|
Doron M Behar, Saharon Rosset, Jason Blue-Smith,
Oleg Balanovsky, Shay Tzur, David Comas, R. John Mitchell,
Lluis Quintana-Murci, Chris
Tyler-Smith,
R. Spencer Wells and The Genographic
Consortium. (2007). The
Genographic Project Public Participation
Mitochondrial DNA Database . PLoS Genetics Vol. 3, No. 6, e104.
|
|
Saharon Rosset, Ji Zhu. (2007). Piecewise
Linear Regularized Solution Paths. Annals of Statistics, 35(3). (Earlier longer
versions 1,
2).
|
|
Claudia Perlich, Saharon Rosset. (2007). Identifying Bundles of Product
Options using Mutual Information Clustering. SIAM Data Mining 07 (SDM-07).
|
|
Saharon Rosset. (2007). Efficient
Inference on Known Phylogenetic Trees Using Poisson Regression. Proc. of the 5th European Conference on Computational Biology
(ECCB-2006), Bioinformatics 23: e142-e147.
|
|
Saharon Rosset, Claudia Perlich,
Bianca
Zadrozny. (2007). Ranking-Based
Evaluation of Regression Models. Knowledge and Information Systems, Vol. 12, No. 3. (short
version from ICDM-05)
|
2006
|
Srujana Merugu, Saharon Rosset, Claudia Perlich. (2006). A New Multi-View Regression
Method with an Application to Customer Wallet Estimation. KDD-06.
|
|
Saharon Rosset, Rick
Lawrence. (2006). Data
Enhanced Predictive Modeling for Sales Targeting. SIAM Data Mining 06 (SDM-06).
|
2005
|
Rob Tibshirani, Michael Saunders, Saharon Rosset, Ji Zhu,
Keith Knight. (2005). Sparsity
and Smoothness via the Fused Lasso. Journal of the Royal
Statistical Society Series B, Vol. 67 No. 1.
|
|
Saharon Rosset. (2005). Robust
Boosting and Its Relation to Bagging. KDD-05.
|
|
Sofus Macskassy, Foster Provost, Saharon Rosset. (2005). ROC
Confidence Bands: An Empirical Evaluation . ICML-05.
|
|
Saharon Rosset, Claudia Perlich,
Bianca
Zadrozny, Srujana Merugu, Sholom Weiss,
Rick Lawrence. (2005). Customer
Wallet Estimation . NYU workshop on CRM and Data Mining.
|
2004
|
Saharon Rosset, Ji Zhu,
Trevor Hastie. (2004). Boosting as a
Regularized Path to A Maximum Margin Classifier. Journal of Machine
Learning Research, 5(Aug):941-973.
|
|
Saharon Rosset, Ji Zhu. (2004). Discussion of "Least Angle Regression" by Efron et al. .
Annals
of Statistics, April 2004.
|
|
Jerry Friedman, Trevor Hastie, Saharon Rosset, Rob Tibshirani, Ji Zhu. (2004). Discussion of three boosting
papers. Annals
of Statistics, February 2004.
|
|
|
|
Saharon Rosset, Ji Zhu. (2004). Corrected Proof of the Result of "A Prediction Error Property
of the Lasso" by Huang(2003).
Australia and New Zealand Journal of Statistics, 46(3):505-510.
|
|
Trevor Hastie, Saharon Rosset,
Rob Tibshirani,
Ji Zhu. (2004). The
Entire Regularization Path for the Support Vector Machine. Journal of Machine
Learning Research, 5(Oct):1391--1415. R package.
(short
version presented at NIPS 2004)
|
|
Saharon Rosset, Ji Zhu, Hui Zou, Trevor Hastie.
(2004). A
Method for Inferring Label Sampling Mechanisms in Semi-Supervised Learning. NIPS 2004.
|
|
Saharon Rosset. (2004). Tracking
Curved Regularized Optimization Solution Paths. NIPS 2004.
|
|
Saharon Rosset. (2004). Model
Selection via the AUC. ICML-04.
|
2003
|
Saharon Rosset, Ji Zhu,
Trevor Hastie. (2003). Margin
Maximizing Loss Functions. NIPS 2003.
|
|
Ji Zhu, Saharon
Rosset, Trevor Hastie, Rob TIbshirani. (2003). 1-norm Support
Vector Machines. NIPS 2003.
|
|
Saharon Rosset, Einat Neumann. (2003). Integrating
Customer Value Considerations into Predictive Modeling. ICDM-03.
|
|
Saharon Rosset, Einat Neumann, Uri Eick, Nurit Vatnik. (2003). Lifetime
Value Models for Decision Support. Data Mining and Knowledge
Discovery Journal, Vol. 7, 321-339.
|
2002 and earlier
|
Saharon Rosset, Einat Neumann, Uri Eick,
Nurit Vatnik, Shuki Idan. (2002). Lifetime Value Modeling and Its Use for Customer Retention Planning.
KDD-02.
Winner of best application paper award.
|
|
Saharon Rosset, Eran Segal. (2002). Boosting
Density Estimation. NIPS-2002.
|
|
Saharon Rosset. (2002). Value Weighted
Analysis: Building Prediction Models for Data with Observation Weights. Uncompleted technical report. Draft available.
|
|
Saharon Rosset, Einat Neumann, Uri Eick,
Nurit Vatnik, Shuki Idan. (2001). Evaluation
of Prediction Models for Campaign Planning. KDD-01.
|
|
Saharon Rosset, Aron Inger. (2000). KDD-Cup
99: Knowledge Discovery In a Charitable
Organization's Donor Database. SIGKDD Explorations 1(2): 85-90 (2000)
|
|
Saharon Rosset, Uzi Murad, Einat Neumann, Yizhak Idan, and Gadi Pinkas. (1999). Discovery of
Fraud Rules for Telecommunications: Challenges and Solutions, KDD-99: 409-413.
|
|
Saharon Rosset. (1998). Ranking: Methods for Flexible Evaluation and
Efficient Comparison of Classification Performance. KDD-98.
|
|