Publications |
Presentations |
Thesis |
Tech Reports / Unpublished Works
Selected Publications
- J. Noel, S. Sanner, K.-N. Tran, P. Christen, L. Xie, E. Bonilla, E. Abbasnejad, N. Della Penna (2012).
New Objectives for Social Collaborative Filtering.
In Proceedings of the 21st International Conference on the World Wide Web (WWW-12).
Lyon, France.
[pdf]
[slides (pdf)]
[code]
- K.-W. Lim, S. Sanner, S. Guo (2012). On the Mathematical Relationship between Expected n-call@k and the Relevance vs. Diversity Trade-off. In Proceedings of the 35th Annual ACM SIG Information Retrieval Conference (SIGIR-12), to appear. Portland, USA.
- S. Sanner, E. Abbasnejad (2012).
Symbolic Variable Elimination for Discrete and Continuous Graphical Models.
In Proceedings of the 26th Conference on Artificial Intelligence (AAAI-12), to appear.
Toronto, Canada.
[pdf]
[code]
- Z. Zamani, S. Sanner, C. Fang (2012).
Symbolic Dynamic Programming for Continuous State and Action MDPs.
In Proceedings of the 26th Conference on Artificial Intelligence (AAAI-12), to appear.
Toronto, Canada.
[pdf]
[code]
- A. Coles, A. Coles, A. Garcia Olaya, S. Jimenez, C. Linares Lopez, S. Sanner, S. Yoon (2012).
A Survey of the Seventh International Planning Competition.
AI Magazine, in press.
[pdf (pre-print)]
- S. Sanner, M. Hutter, editors (2012).
Proceedings of the 9th European Workshop on Reinforcement Learning (EWRL).
Springer Verlag, Lecture Notes in Computer Science, Volume 7188, in press.
- S. Sanner, S. Guo, T. Graepel, S. Kharazmi, S. Karimi (2011).
Diverse Retrieval via Greedy Optimization of Expected 1-call@k
in a Latent Subtopic Relevance Model.
In Proceedings of the 20th ACM Conference on Information and Knowledge Management (CIKM-11).
Glasgow, UK.
[pdf]
[code]
- S. Sanner, K. V. Delgado, L. N. de Barros (2011).
Symbolic Dynamic Programming for Discrete and Continuous State MDPs.
In Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence (UAI-11).
Barcelona, Spain.
[pdf]
[slides (pdf)]
[code]
- M. Robards, P. Sunehag, S. Sanner, B. Marthi (2011).
Sparse Kernel-SARSA(lambda) with an Eligibility Trace.
In Proceedings of the European Conference on Machine Learning
and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD-11).
Athens, Greece.
[pdf]
- B. Ahmadi, K. Kersting, S. Sanner (2011).
Multi-Evidence Lifted Message Passing with Application to PageRank and the Kalman Filter.
In Proceedings of the 22nd International Joint Conference on Artificial Intelligence (IJCAI-11).
Barcelona, Spain.
[pdf]
- K. V. Delgado, S. Sanner, L. N. de Barros (2011).
Efficient Solutions to Factored MDPs with Imprecise Transition Probabilities.
Artificial Intelligence Journal (AIJ). Volume 175, pp 1498-1527.
[pdf (pre-print)]
- K. V. Delgado, L. N. de Barros, F. G. Cozman, S. Sanner (2011).
Using Mathematical Programming to Solve
Factored Markov Decision Processes with Imprecise Probabilities.
International Journal of Approximate Reasoning (IJAR). Volume 52, Issue 7, October, pp 1000-1017.
[pdf (pre-print)]
- E. Bonilla, S. Guo, and S. Sanner (2010). Gaussian Process Preference Elicitation. In Proceedings of the 24th Annual Conference on Neural Information Processing Systems (NIPS-10). Vancouver, Canada.
[pdf]
- C. Downey and S. Sanner (2010). Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda. In Proceedings of the 27th International Conference on Machine Learning (ICML-10). Haifa, Israel.
[pdf]
[slides (pdf)]
- S. Sanner and K. Kersting (2010). Symbolic Dynamic Programming for First-order POMDPs. In Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI-10). Atlanta, USA.
[pdf]
[slides (pdf)]
- S. Guo and S. Sanner (2010). Probabilistic Latent Maximal Marginal Relevance. In Proceedings of the 33rd Annual ACM SIG Information Retrieval Conference (SIGIR-10). Geneva, Switzerland.
[pdf]
- S. Sanner, W. Uther, K. V. Delgado (2010). Approximate Dynamic Programming with Affine ADDs. In Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS-10). Toronto, Canada.
[pdf]
[slides (pdf)]
[code]
- S. Guo and S. Sanner (2010). Real-time Multiattribute Bayesian Preference Elicitation with Pairwise Comparison Queries. In Proceedings of the 13th International Conference on Artificial Intelligence and Statistics (AISTATS-10). Sardinia, Italy.
[pdf]
[slides for older version of work (pdf)]
- K. V. Delgado, S. Sanner, L. N. de Barros, F. G. Cozman (2009). Efficient Solutions to Factored MDPs with Imprecise Transition Probabilities. In Proceedings of the 19th International Conference on Automated Planning and Scheduling (ICAPS-09). Thessaloniki, Greece.
[pdf]
[slides (pdf)]
- S. Sanner, R. Goetschalckx, K. Driessens, and G. Shani (2009). Bayesian Real-time Dynamic Programming. In Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI-09). San Jose, USA.
[pdf]
[slides (pdf)]
- S. Sanner, and C. Boutilier (2009). Practical solution techniques for first-order MDPs. Artificial Intelligence Journal (AIJ).
Volume 173, pp 748-788.
[pdf (pre-print)]
- R. Goetschalckx, S. Sanner, and K. Driessens (2008). Cost-sensitive parsimonious linear
regression. In Proceedings of the 8th IEEE International Conference on Data Mining (ICDM-08). Pisa, Italy.
[pdf]
- R. Goetschalckx, S. Sanner, and K. Driessens (2008). Reinforcement learning with the use of costly features. Proceedings of the 18th European Conference on Artificial Intelligence (ECAI-08). Patras, Greece.
[short version (pdf)]
Extended version in European Workshop on Reinforcement Learning (EWRL-08).
[extended version (pdf)]
- S. Sanner (2008). How to spice up your planning under uncertainty research life. Workshop on a Reality Check for Planning and Scheduling Under Uncertainty (at ICAPS-08). Sydney, Australia.
[pdf]
[slides (pdf)]
- S. Sanner, and C. Boutilier (2007). Approximate solution techniques for factored first-order MDPs. In Proceedings of the 17th Conference on Automated Planning and Scheduling (ICAPS-07).
[ps.gz] [pdf]
[slides (pdf)]
- S. Sanner, T. Graepel, R. Herbrich, and T. Minka (2007).
Learning CRFs with hierarchical features: An application to the game of Go.
In Proceedings of the Workshop on Constrained Optimization and Structured Output Spaces (at ICML-07).
[ps.gz] [pdf]
[slides (pdf)]
-
S. Sanner, and K. Kersting (2007). Symbolic dynamic programming. Chapter to appear in C. Sammut, editor, Encyclopedia of Machine Learning, Springer-Verlag.
- S. Sanner, and S. McIlraith (2006). An ordered theory
resolution calculus for
hybrid reasoning in first-order extensions of description logic.
In Proceedings of the 10th International
Conference on Principles of Knowledge Representation
and Reasoning (KR-06).
[ps.gz] [pdf]
[slides (pdf - color)]
[slides (pdf - bw)]
- S. Sanner (2006). Online feature discovery in relational reinforcement learning.
In Proceedings of the Open Problems in Statistical Relational Learning Workshop (SRL-06).
[ps.gz] [pdf]
[slides (pdf - color)]
[slides (pdf - bw)]
- S. Sanner, and D. McAllester (2005). Affine algebraic decision
diagrams (AADDs) and their application to structured probabilistic
inference. In Proceedings of the 19th International Joint
Conference on AI (IJCAI-05).
[ps.gz] [pdf]
[slides (pdf - color)]
[slides (pdf - bw)]
[code]
- S. Sanner (2005). Simultaneous learning of structure and value in
relational reinforcement learning. In Proceedings of the Rich Representations for Relational Reinforcement Learning Workshop (RRfRL-05).
[ps.gz] [pdf]
[slides (pdf - color)]
[slides (pdf - bw)]
- D. Anguelov, R. Biswas, D. Koller, B. Limketkai, S. Sanner, and S.
Thrun (2002). Learning hierarchical object maps of non-stationary
environments with mobile robots. In Proceedings of the 18th
Conference on Uncertainty in AI (UAI-02). [ps.gz
(large)] [pdf]
- R. Biswas, B. Limketkai, S. Sanner, and S. Thrun (2002). Towards
object mapping in dynamic environments with mobile robots. In
Proceedings of the Conference on Intelligent Robots and Systems
(IROS-02). [ps.gz
(large)] [pdf]
- S. Sanner, J. R. Anderson, C. Lebiere, and M. Lovett (2000).
Achieving efficient and cognitively plausible learning in
backgammon. In Proceedings of the 17th International
Conference on Machine Learning (ICML-00). [ps.gz] [pdf] (C++ Source Code [tar.gz]
and associated [README]
file.)
Presentations
- S. Sanner (2010). Tutorial Slides for
Decision Diagrams in Automated Planning and Scheduling from ICAPS-2011.
[pdf]
- S. Sanner (2010). Tutorial Slides for
Introduction to Planning Domain Modeling in RDDL from ICAPS-2011.
[pdf]
- S. Sanner (2010). Tutorial Slides for
Graphical Models from ICAPS-2010.
[pdf]
- S. Sanner (2010). Tutorial Slides for
Traffic Control from ICAPS-2010.
[pdf]
- S. Sanner (2009). Tutorial Slides for
Reinforcement Learning from SSLL-2009.
[pdf]
- S. Sanner (2008). Newly revised First-order MDP (FOMDP) Tutorial Slides.
[pdf]
Also see slides from the ICAPS-2008 tutorial on First-order Planning Techniques given with Kristian Kersting and Saket Joshi. [web site]
- S. Sanner (2006). Lecture slides for an introduction to
the field of Automated Theorem Proving.
[pdf]
- S. Sanner (2001). Talk slides for a quick
introduction to the field of Description Logics, its history, and
some of its core (and beautiful) motivating ideas. [ps.gz]
[pdf]
Thesis
- S. Sanner (2008). First-order decision-theoretic planning in structured relational environments. PhD Thesis, University of Toronto. Accepted: 12/2007; Publication Date: 3/2008.
[ps.gz] [pdf]
Technical Reports and Unpublished Works
- S. Sanner (2011). Relational Dynamic Influence Diagram Language (RDDL): Language Description.
Unpublished.
[pdf]
[tutorial slides (pdf)]
- S. Sanner (2005). Future directions for first-order decision-theoretic planning. Research Proposal, University of Toronto.
[ps.gz] [pdf]
(Presentation Slides:
[pdf (color)]
[pdf (bw)])
- S. Sanner (2004). Relational and first-order decision-theoretic
planning: Foundations and future directions. Depth Report,
University of Toronto. [ps.gz] [pdf]
(Note: There is also a deterministic planning supplement to this report.
[ps.gz] [pdf] )
- S. Sanner (2004). Refutation-complete binary decision diagrams. Unpublished.
[ps.gz] [pdf]
- S. Sanner (2003). Towards practical taxonomic classification for
description logics on the Semantic Web. Technical Report,
Stanford University, Knowledge Systems Lab: KSL-03-06. [ps.gz] [pdf] [Java Theorem Prover (JTP) software including the DAML+OIL classification reasoner]
(Note: There is also a less technical, condensed version of this
paper. [ps.gz] [pdf] )