P. Germain, A. Habrard, F. Laviolette, and E. Morvant, A PAC-Bayesian approach for domain adaptation with specialization to linear classifiers, International Conference on Machine Learning, pp.738-784, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00822685

P. Germain, A. Habrard, F. Laviolette, and E. Morvant, A new PAC-Bayesian perspective on domain adaptation, International Conference on Machine Learning, vol.48, pp.859-68, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01307045

J. Jiang, A literature survey on domain adaptation of statistical classifiers, Tech. Rep, 2008.

J. Quionero-candela, M. Sugiyama, A. Schwaighofer, and N. Lawrence, Dataset Shift in Machine Learning, p.9780262170055, 2009.

A. Margolis, A literature review of domain adaptation with unlabeled data, 2011.

M. Wang and W. Deng, Deep visual domain adaptation: A survey, Neurocomputing, 2018.

W. M. Kouw and M. Loog, A review of single-source unsupervised domain adaptation, CoRR, 2019.

E. Ievgen-redko and A. Morvant, Domain Adaptation Theory: Available Theoretical Results, 2019.

S. Pan and Q. Yang, Knowledge and Data Engineering, IEEE Transactions on, vol.22, issue.10, pp.1345-59, 2010.

S. Ben-david, T. Lu, T. Luu, and D. Pal, Impossibility theorems for domain adaptation, vol.9, pp.129-165, 2010.

S. Ben-david and R. Urner, On the hardness of domain adaptation and the utility of unlabeled target samples, Proceedings of Algorithmic Learning Theory, pp.139-53, 2012.

S. Ben-david and R. Urner, Domain adaptation-can quantity compensate for quality?, Ann Math Artif Intell, vol.70, issue.3, pp.185-202, 2014.

J. Huang, A. Smola, A. Gretton, K. Borgwardt, and B. Schölkopf, Correcting sample selection bias by unlabeled data, Advances in Neural Information Processing Systems, pp.601-609, 2006.

M. Sugiyama, S. Nakajima, H. Kashima, V. Bünau, P. Kawanabe et al., Direct importance estimation with model selection and its application to covariate shift adaptation, Advances in Neural Information Processing Systems, 2007.

C. Cortes, Y. Mansour, and M. Mohri, Learning bounds for importance weighting, Advances in Neural Information Processing Systems, pp.442-50, 2010.

C. Cortes, M. Mohri, and A. M. Medina, Adaptation algorithm and theory based on generalized discrepancy, ACM SIGKDD, pp.169-78, 2015.

M. Sugiyama, S. Nakajima, H. Kashima, P. V. Buenau, and M. Kawanabe, Direct importance estimation with model selection and its application to covariate shift adaptation, Advances in Neural Information Processing Systems, pp.1433-1473, 2008.

L. Bruzzone and M. Marconcini, Domain adaptation problems: A DASVM classification technique and a circular validation strategy. Transaction Pattern Analysis and Machine Intelligence, vol.32, pp.770-87, 2010.

A. Habrard, J. P. Peyrache, and M. Sebban, Iterative self-labeling domain adaptation for linear structured image classification, International Journal on Artificial Intelligence Tools, vol.22, issue.05, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00869404

E. Morvant, Domain adaptation of weighted majority votes via perturbed variation-based self-labeling, Pattern Recognition Letters, vol.51, pp.37-43, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01056599

X. Glorot, A. Bordes, and Y. Bengio, Domain adaptation for large-scale sentiment classification: A deep learning approach, Proceedings of the International Conference on Machine Learning, pp.513-533, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00752091

M. Chen, Z. E. Xu, K. Q. Weinberger, and F. Sha, Marginalized denoising autoencoders for domain adaptation, International Conference on Machine Learning, 2012.

N. Courty, R. Flamary, D. Tuia, and A. Rakotomamonjy, Optimal transport for domain adaptation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01377220

N. Courty, R. Flamary, D. Tuia, and A. Rakotomamonjy, Optimal transport for domain adaptation, IEEE Trans Pattern Anal Mach Intell, vol.39, issue.9, pp.1853-65, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01377220

J. Li, K. Lu, Z. Huang, L. Zhu, and H. T. Shen, Transfer independently together: A generalized framework for domain adaptation, IEEE Trans Cybernetics, vol.49, issue.6, pp.2144-55, 2019.

Y. Ganin, E. Ustinova, H. A. Germain, P. Larochelle, H. Laviolette et al., Domain-adversarial training of neural networks, Journal of Machine Learning Research, vol.17, issue.59, pp.1-35, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01624607

Z. Ding and Y. Fu, Deep domain generalization with structured low-rank constraint, IEEE Trans Image Processing, vol.27, issue.1, pp.304-317, 2018.

R. Shu, H. H. Bui, H. Narui, and S. Ermon, A DIRT-T approach to unsupervised domain adaptation, International Conference on Learning Representations, 2018.

J. Li, K. Lu, Z. Huang, L. Zhu, and H. T. Shen, Heterogeneous domain adaptation through progressive alignment, IEEE Trans Neural Netw Learning Syst, vol.30, issue.5, pp.1381-91, 2019.

A. S. Sebag, L. Heinrich, M. Schoenauer, M. Sebag, L. F. Wu et al., Multi-domain adversarial learning, International Conference on Learning Representations, 2019.
URL : https://hal.archives-ouvertes.fr/hal-01968180

I. Kuzborskij and F. Orabona, Stability and hypothesis transfer learning, International Conference on Machine Learning, pp.942-50, 2013.

I. Kuzborskij, Theory and algorithms for hypothesis transfer learning, 2018.

S. Ben-david, J. Blitzer, K. Crammer, and F. Pereira, Analysis of representations for domain adaptation, Advances in Neural Information Processing Systems, pp.137-181, 2006.

S. Ben-david, J. Blitzer, K. Crammer, A. Kulesza, F. Pereira et al., A theory of learning from different domains, Machine Learning, vol.79, issue.1-2, pp.151-75, 2010.

X. Li and J. Bilmes, A Bayesian divergence prior for classifier adaptation, International Conference on Artificial Intelligence and Statistics, pp.275-82, 2007.

C. Zhang, L. Zhang, and J. Ye, Generalization bounds for domain adaptation, Advances in Neural Information Processing Systems, 2012.

E. Morvant, A. Habrard, and S. Ayache, Parsimonious Unsupervised and Semi-Supervised Domain Adaptation with Good Similarity Functions, Knowledge and Information Systems, vol.33, issue.2, pp.309-358, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00686205

C. Cortes and M. Mohri, Domain adaptation and sample bias correction theory and algorithm for regression, Theoretical Computer Science, vol.519, pp.103-129, 2014.

I. Redko, A. Habrard, and M. Sebban, Theoretical analysis of domain adaptation with optimal transport, Joint European Conference on Machine Learning and Knowledge Discovery in Databases, vol.2017, pp.737-53
URL : https://hal.archives-ouvertes.fr/hal-01613564

Y. Mansour, M. Mohri, and A. Rostamizadeh, Domain adaptation: Learning bounds and algorithms, Conference on Learning Theory, pp.19-30, 2009.

C. Cortes and M. Mohri, Domain adaptation in regression, Algorithmic Learning Theory, pp.308-331, 2011.

Y. Mansour, M. Mohri, and A. Rostamizadeh, Multiple source adaptation and the Rényi divergence, Conference on Uncertainty in Artificial Intelligence, pp.367-74, 2009.

D. A. Mcallester, Some PAC-Bayesian theorems, Machine Learning, vol.37, pp.355-63, 1999.

P. Germain, A. Lacasse, F. Laviolette, and M. Marchand, PAC-Bayesian learning of linear classifiers, International Conference on Machine Learning, 2009.

E. Parrado-hernández, A. Ambroladze, J. Shawe-taylor, and S. Sun, PAC-Bayes bounds with data dependent priors, Journal of Machine Learning Research, vol.13, pp.3507-3538, 2012.

T. G. Dietterich, Ensemble methods in machine learning, International workshop on multiple classifier systems, pp.1-15, 2000.

M. Re and G. Valentini, Advances in machine learning and data mining for, astronomy, pp.563-82, 2012.

A. Lacasse, F. Laviolette, M. Marchand, P. Germain, and N. Usunier, PAC-Bayes bounds for the risk of the majority vote and the variance of the Gibbs classifier, Advances in Neural Information Processing Systems, pp.769-76, 2006.

P. Germain, A. Lacasse, F. Laviolette, M. Marchand, and J. F. Roy, Risk bounds for the majority vote: From a PAC-Bayesian analysis to a learning algorithm, Journal of Machine Learning Research, vol.16, pp.787-860, 2015.

O. Catoni, PAC-Bayesian supervised classification: the thermodynamics of statistical learning, Inst. of Mathematical Statistic, vol.56, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00206119

M. Seeger, PAC-Bayesian generalization bounds for gaussian processes, Journal of Machine Learning Research, vol.3, pp.233-69, 2002.

J. Langford, Tutorial on practical prediction theory for classification, Journal of Machine Learning Research, vol.6, pp.273-306, 2005.

P. Germain, A. Habrard, F. Laviolette, and E. Morvant, PAC-Bayesian theorems for domain adaptation with specialization to linear classifiers, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01134246

P. Germain, A. Lacasse, F. Laviolette, M. Marchand, and S. Shanian, From PAC-Bayes bounds to KL regularization, Advances in Neural Information Processing Systems, pp.603-613, 2009.

D. A. Mcallester and J. Keshet, Generalization bounds and consistency for latent structural probit and ramp loss, Advances in Neural Information Processing System, pp.2205-2217, 2011.

J. Langford and J. Shawe-taylor, PAC-Bayes & margins, Advances in Neural Information Processing Systems, pp.439-485, 2002.

A. Ambroladze, E. Parrado-hernández, and J. Shawe-taylor, Tighter PAC-Bayes bounds, Advances in Neural Information Processing Systems, pp.9-16, 2006.

B. Schölkopf, R. Herbrich, and A. J. Smola, A generalized representer theorem, Annual Conference on Computational Learning Theory, and European Conference on Computational Learning Theory, pp.416-442, 2001.

S. Ben-david, S. Shalev-shwartz, and R. Urner, Domain adaptation-can quantity compensate for quality, International Symposium on Artificial Intelligence and Mathematics (ISAIM), 2012.

H. Shimodaira, Improving predictive inference under covariate shift by weighting the log-likelihood function, J Statist Plann Inference, vol.90, issue.2, pp.227-271, 2000.

R. Urner, S. Shalev-shwartz, and S. Ben-david, Access to unlabeled data can speed up prediction time, International Conference on Machine Learning, pp.641-649, 2011.

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion et al., Scikit-learn: Machine learning in Python, JMLR, vol.12, pp.2825-2855, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00650905

J. Blitzer, R. Mcdonald, and F. Pereira, Domain adaptation with structural correspondence learning, Conference on Empirical Methods in Natural Language Processing, pp.120-128, 2006.

M. Chen, K. Q. Weinberger, and J. Blitzer, Co-training for domain adaptation, Advances in Neural Information Processing Systems, pp.2456-64, 2011.

T. Joachims, Transductive inference for text classification using support vector machines, International Conference on Machine Learning, pp.200-209, 1999.

C. C. Chang and C. J. Lin, LibSVM: a library for support vector machines, 2001.

E. Zhong, W. Fan, Q. Yang, O. Verscheure, and J. Ren, Cross validation framework to choose amongst models and datasets for transfer learning, Machine Learning and Knowledge Discovery in Databases, vol.6323, pp.547-62

A. Pentina and C. Lampert, A PAC-Bayesian bound for lifelong learning, JMLR W&CP, Proceedings of International Conference on Machine Learning, vol.32, pp.991-1000, 2014.

A. Goyal, E. Morvant, P. Germain, and M. R. Amini, Pac-bayesian analysis for a two-step hierarchical multiview learning approach, Joint European Conference on Machine Learning and Knowledge Discovery in Databases, vol.2017, pp.205-226
URL : https://hal.archives-ouvertes.fr/hal-01546109

K. Crammer, M. Kearns, and J. Wortman, Learning from multiple sources, Advances in Neural Information Processing Systems, vol.19, p.321, 2007.

Y. Mansour, M. Mohri, and A. Rostamizadeh, Domain adaptation with multiple sources, Advances in Neural Information Processing Systems, pp.1041-1049, 2009.

J. Hoffman, M. Mohri, and N. Zhang, Algorithms and theory for multiple-source adaptation, Conference on Neural Information Processing Systems, 2018.

S. Thrun and T. M. Mitchell, Lifelong robot learning, Robotics and Autonomous Systems, vol.15, issue.1-2, pp.25-46, 1995.

A. Maurer, A note on the PAC Bayesian theorem, CoRR, 2004.

Y. Seldin and N. Tishby, PAC-Bayesian analysis of co-clustering and beyond, Journal of Machine Learning Research, vol.11, pp.3595-646, 2010.

D. Mcallester, A PAC-Bayesian tutorial with a dropout bound, CoRR, 2013.