Short Bio.

Saket joined IBM T. J. Watson Research Center in 2016 as a Research Staff Member. Until December 2015, he was a Researcher at IBM Research Australia. Saket received a PhD degree in Computer Science from EPFL, Switzerland under Prof. Karl Aberer in March, 2013. At EPFL he was associated with the Distributed Information Systems Laboratory. Before that he received a Master's (M.Tech.) degree in Electrical Engineering from IIT Bombay in 2006. Prior to joining EPFL, he spent one year working for an Indian startup.


Data Mining, Database Systems, Statistical Modelling, Distributed Computing.


Saket Sathe
IBM T. J. Watson Research Center,
Yorktown Heights, NY 10598


Linkedin: saket.sathe
Facebook: saket.sathe
  1. Saket Sathe, Charu Aggarwal. LODES: Local Density Meets Spectral Outlier Detection. SDM 2016.
  2. Sue Ann Chen, Arun Vishwanath, Saket Sathe, Shivkumar Kalayanaraman. Shedding Light on the Performance of Solar Panels: A Data-Driven View. SIGKDD Explorations.
  3. Xinyue Liu, Charu Aggarwal, Yu-Feng Li, Xiangnan Kong, Xinyuan Sun, Saket Sathe. Kernelized Matrix Factorization for Collaborative Filtering. SDM 2016.
  4. Charu Aggarwal, Saket Sathe. Theoretical Foundations and Algorithms for Outlier Ensembles. SIGKDD Explorations.
  5. Tian Guo, Saket Sathe, Karl Aberer. Fast Distributed Correlation Discovery Over Streaming Time-Series Data. CIKM 2015.
  6. Oshini Goonetilleke, Saket Sathe, Timos Sellis, Xiuzhen Zhang. Microblogging Queries on Graph Databases: An Introspection. GRADES 2015 Workshop (co-located with SIGMOD 2015).
  7. Saket Sathe, Timos Sellis, Karl Aberer. On Crowdsensed Data Acquisition using Multi-Dimensional Point Processes. ICDE Workshops, 2015.
  8. Nguyen Quoc Viet Hung, Saket Sathe, Duong Chi Thang, Karl Aberer. Towards Enabling Probabilistic Databases for Participatory Sensing. CollaborateCom 2014. (acceptance: 20%) Best Paper Runner-up
  9. Oshini Goonetilleke, Timos Sellis, Xiuzhen Zhang, Saket Sathe. Twitter Analytics: A Big Data Management Perspective. SIGKDD Explorations 16(1): 11-20 (2014).
  10. Saket Sathe, Roie Melamed, Peter Bak, Shivkumar Kalyanaraman. Enabling Location-Based Services 2.0: Challenges and Opportunities. IEEE MDM, Brisbane, 2014. (vision)
  11. Hoyoung Jeung, Hua Lu, Saket Sathe, Man Lung Yiu. Managing Evolving Uncertainty in Trajectory Databases. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2014.
  12. Saket Sathe, Arthur Oviedo, Dipanjan Chakraborty, Karl Aberer. EnviroMeter: A Platform for Querying Community-Sensed Data. VLDB, Trento, 2013. (demo)
  13. Saket Sathe, Karl Aberer. AFFINITY: Efficiently Querying Statistical Measures on Time-Series Data. ICDE, Brisbane, 2013. [talk]
  14. Saket Sathe, Thanasis G. Papaioannou, Hoyoung Jeung, Karl Aberer. A Survey of Model-Based Sensor Data Acquisition and Management. Managing and Mining Sensor Data, ed. Charu Aggarwal, Springer Publishers, 2013. (book chapter)
  15. Sebastian Cartier, Saket Sathe, Dipanjan Chakraborty, Karl Aberer. ConDense: Managing Data in Community-driven Mobile Geosensor Networks. IEEE SECON, Seoul, 2012. [talk] (Acceptance: ~19%)
  16. Saket Sathe, Sebastian Cartier, Dipanjan Chakraborty, Karl Aberer. Effectively Modeling Data from Large-area Community Sensor Networks. IPSN, Beijing, 2012. (poster paper)
  17. Saket Sathe, Hoyoung Jeung, Karl Aberer. Creating Probabilistic Databases from Imprecise Time-Series Data. ICDE, Hannover, 2011. [talk] [teaser]
  18. K. Aberer, S. Sathe, D. Chakraborty, A. Martinoli, G. Barrenetxea, B. Faltings, L. Thiele. OpenSense: Open Community Driven Sensing of Environment. ACM SIGSPATIAL IWGS, 2010, San Jose, (co-located with GIS 2010). [talk]
  19. H. Jeung, S. Sarni, I. Paparrizos, S. Sathe, K. Aberer, N. Dawes, T. Papaioannou, M. Lehning. Effective Metadata Management in Federated Sensor Networks. IEEE SUTC, Newport Beach, 2010. (invited paper) [talk] [bib]
  20. E. Ioannou, S. Sathe, N. Bonvin, A. Jain, S. Bondalapati, G. Skobeltsyn, C. Niederee, Z. Miklos. Entity Search with NECESSITY. WebDB, 2009. (demo) (co-located with PODS/SIGMOD).
  21. Saket Sathe and Uday Desai. Cell-phone Based Microcredit Risk Assessment using Fuzzy Clustering. ICTD, 2006, Berkeley. [bib]
  22. Saket Sathe. A Novel Bayesian Classifier using Copula Functions. Unpublished Manuscript. [arXiv]
  23. Saket Sathe, Rajendra Lagu, Uday Desai. Investigating Efficiency of the Indian Equities Market with Application to Risk Management. Journal of Applied Finance Vol. 12, No. 5, pp. 48-68, May 2006. [conference version]

Technical Reports.

PhD Thesis.

  • Program Committee:
    • IEEE Conference on Mobile Data Management (MDM) 2014, 2015
    • ACM International Conference on Information and Knowledge Mangement (CIKM) 2015, 2016
    • Australasian Database Conference (ADC), 2015
    • IEEE International Conference on Data Mining (ICDM), 2015
    • SIAM International Conference on Data Mining (SDM), 2016
    • ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2016
  • Web chair:
    • IEEE Mobile Data Management (MDM) 2012
  • Invited Journal Reviewer:
    • International Journal on Very Large Databases (VLDB Journal)
    • Data Mining and Knowledge Discovery (DMKD)
    • ACM Transactions on Knowledge Discovery from Data (TKDD)
  • Vasanth Chandramouli (EPFL). Semester Project → First Employment : CERN.
  • Anshul Jain (EPFL). Internship → Masters : CMU.
  • Arthur Oviedo (EPFL). Semester Project → First Employment : Google.
  • Sebastian Cartier (EPFL). Masters Thesis → PhD : ETHZ.
  • Tian Guo (EPFL). PhD Student. Ongoing.
  • Oshini Goonetilleke (RMIT University). PhD Student. Co-supervised with Timos Sellis.
  • OKKAM Project: It aims at enabling the Web of Entities, namely a virtual space where any collection of data and information about any type of entities (e.g. people, locations, organizations, events, products, etc.) published on the Web can be integrated into a single virtual, decentralized, open knowledge base (like the Web did for hypertexts, read here what Tim Berners-Lee says on this parallel).
  • Python Web Graph Generator: Generates synthetic powerlaw random graphs containing millions of nodes in a few minutes on a desktop machine or laptop. Released under the Apache License, Version 2.0 (FAQ) (3000+ downloads).
  • OpenSense: The project addresses key research challenges in the domain of information and communication systems related to community-based sensing using wireless sensor network technology in the context of air pollution monitoring. This project will result in open technology that allows integrating diverse sensors, including mobile sensors, into a single environmental model. The information processing techniques we develop will provide important insights to enable other Nano-Tera application domains dealing with monitoring complex events.
  • Swiss Experiment (SwissEx): A platform to enable real-time environmental experiments through wireless sensor networks and a common, modern, generic cyber-infrastructure.