Research Results

Machine Learning

  • novel semisupervised learning algorithms, including semisupervised class discovery, semisupervised learning with instance-level constraints, and semisupervised EM;
  • deterministic annealing techniques for clustering, tree-structured clustering, supervised classification, and piecewise regression;
  • maximum entropy statistical inference for classification, general inference, and collaborative filtering;
  • ensemble classification techniques, including transductive distributed classification, maximum entropy-improved iterative scaling ensemble classification, and critic-driven ensembles;
  • clustering/mixture modeling in high dimensions: parsimonious mixtures with integrated model order and feature selection (and application to document clustering);
  • improved mean-field annealing techniques for image segmentation;
  • data mining for network digesting and intrusion detection


Source Coding and Source-Channel Coding

  • novel joint source-channel decoding techniques, including MMSE decoders, MAP decoders for varaible-length encoded data, source-channel decoding for predictive coding systems, and joint source-channel decoding of digital images.
  • optimal encoding for variable rate source coding with dependent data streams;
  • entropy-constrained tree-structured vector quantization;
  • deterministic annealing for source-channel, tree-structured, and trellis quantization

Citations

  • M. Park and D. Miller£¬ ``Joint source-channel decoding for variable-length encoded data by exact and approximate {MAP} sequence estimation'', IEEE Trans. on Communications, vol. 48, no. 1, 2000, pp. 1-6; cited 73 times measured by Googlescholar.
  • D. J. Miller and H. S. Uyar, ``A mixture of experts classifier with learning based on both labelled and unlabelled data'', Neural Information Processing Systems, 1997, pp. 571-577; early paper on semisupervised learning via Expectation-Maximization, cited 102 times on Googlescholar and cited by Nigam, Mccallum, Thrun, Mitchell's ``Text classification from labeled and unlabeled documents using EM'', which has been cited 736 times on Googlescholar.
  • A subset of 15 papers has been cited more than 450 times, measured by Googlescholar.

Research Sponsorship

  • Principal Investigator on two National Science Foundation grants in machine learning area
  • Co-investigator on 3-year NSF Collaborative research grant on protecting network infrastructure, 2003 - 2006
  • Co-investigator on NIH bioinformatics proposal in single nucleotide polymorphism (SNP) biomarker discovery
  • Sponsored research support from AFRL Phase II SBIR in video tracking, 2004-2006
  • Sponsored research support from ONR in maximum entropy data and decision fusion, 2004-2006
  • Consultant on NASA, NIH, and ONR projects

Awards

  • NSF CAREER Award in 1996.
  • Best Presentation Award, Monday poster session, at the International Joint Conference on Neural Networks, 1999, held in Washington, D.C.
  • Best student paper finalist at ICASSP'2005 for paper on semisupervised learning.