Stop Thinking, Just Do!

Sungsoo Kim's Blog

Lifelong and Continual Learning

tagsTags

2 April 2020


Article Source


Lifelong and Continual Learning

Learn as “Humans” do for Artificial General Intelligence (AGI)

[Second Edition: “Lifelong Machine Learning.” by Z. Chen and B. Liu, Morgan & Claypool, August 2018 (1st edition, 2016). ]

Added three new chapters: (4) Continual Learning and Catastrophic Forgetting, (5) Open-world Learning, (8) Continuous Knowledge Learning in Chatbots

Introduced the concept of learning on the job or learning while working.

Updated and/or reorganized the other chapters.

Download the first edition, Lifelong Machine Learning, Nov 2016.

[Lifelong Machine Learning Tutorial. Title: lifelong machine learning and computer reading the Web, KDD-2016, August 13-17, 2016, San Francisco, USA. ]

[Lifelong Machine Learning Tutorial, IJCAI-2015, July 25-31, 2015, Buenos Aires, Argentina.]

[A Podcast: “Machines that Learn Like Humans” by my former student Zhiyuan Chen and Francesco Gadaleta (host).]

[A Resource Site maintained by Eric Eaton’s group] [DARPA program: Lifelong Learning Machines (L2M), 3/16/2017]

The classic machine learning paradigm learns in isolation. Given a dataset, a learning algorithm is applied to a dataset to produce a model without considering any previously learned knowledge. This paradigm needs a large number of training examples and is only suitable for well-defined and narrow tasks in closed environments. Looking ahead, to deal with these limitations and to learn more like humans, I believe that it is necessary to do lifelong machine learning or simply lifelong learning (also called continual learning or even continuous learning), which tries to mimic “human learning” to build a lifelong learning machine. The key characteristic of “human learning” is the continual learning and adaptation to new environments

  • we accumulate the knowledge gained in the past and use the knowledge to help future learning and problem solving with possible adaptations. Ideally, it should also be able to discover new tasks and learn on the job in open environments in a self-supervised manner. Without the lifelong learning capability, AI systems will probably never be truly intelligent. learning machine or agent to continually learn and accumulate knowledge, and to become more and more knowledgeable and better and better at learning.

Human learning is very different: I believe that no human being has ever been given 1000 positive and 1000 negative documents (or images) and asked to learn a text classifier. As we have accumulated so much knowledge in the past and understand it, we can usually learn with little effort and few examples. If we don’t have the accumulated knowledge, even if we are given 2000 training examples, it is very hard to learn manually. For example, I don’t understand Arabic. If you give me 2000 Arabic documents and ask me to build a classifier, I cannot do it. But that is exactly what current machine learning is doing. That is not how humans learn.

Our Work

Some of my work uses sentiment analysis (SA) tasks and data because it is the problems that I encountered in a SA startup that motivated me to work on lifelong learning or continual learning. SA is very hard to scale-up without lifelong learning.

  • Continual Learning (ICLR-2019). Overcoming catastrophic forgetting for continual learning via model adaptation.
  • Lifelong Unsupervised Learning:
    • Lifelong topic modeling (ICML-2014, KDD-2014, WWW-2016): retain the topics learned from previous domains and uses the knowledge for future modeling in other domains.
    • Lifelong belief propagation (EMNLP-2016): use the knowledge learned previously to expand the graph and to obtain more accurate prior probabilty distributions.
    • Lifelong information extraction (AAAI-2016): make use of previously learned knowledge for better extraction.
  • Lifelong Supervised Learning (ACL-2015, ACL-2017):
    • Using a generative model (ACL-2015): The ACL-2015 work is about lifelong learning using a generative model. It is used for sentiment classification.
  • Learning on the Job (ACL-2017 and SIGDIAL-2019): This work is about learning after a model has been deployed in an application, i.e., learning while working.
  • Open world Learning (a.k.a. open world classification or open classification) (KDD-2016, EMNLP-2017): this learning paradigm is becoming very important as AI agents (e.g., self-driving cars and chatbots) are increasingly facing the real-world open and dynamic environments, where there are always new or unexpected objects. But traditional learning makes the close-world assumption: test instances must be from only the training/seen classes, which is not true in the open world. Ideally, an open-world learner should be able to do the following:
    • detecting instances of unseen classes - not seen in training (the DOC algorithm (EMNLP-2017) is quite powerful for this task for both text and images),
    • autmatically identifying unseen classes from the detected instances in a self-supervised manner, and
    • incrementally learning the new/unseen classes.

    In this process, the system becomes more and more knowledgeable and better at learning. It also knows what it does and does not know.

  • Continuous Learning in Dialogues (SIGDIAL-2019): Dialogue systems or Chatbots have been very popular in recent years, but they cannot learn new knowledge during conversation, i.e., their knowledge is fixed beforehand and cannot be expanded during chatting. In this work, we aim to build a lifelong and interactive knowledge learning engine for chatbots.
  • Characterisitcs of lifelong learning: (1) learning continuously (ideally in the open world), (2) accumulating the previously learned knowledge to become more and more knowledgeable, (3) using the knowledge to learn more knowledge and adapting it for problem solving, (4) discovering new problems/tasks to be learned and learning them incrementally, and (5) learning on the job or learning while working, improving model during testing or model applications.
  • Transfer learning vs. lifelong learning: Transfer learning uses the source domain labeled data to help target domain learning. Unlike lifelong learning, transfer learning is not continual and has no knowledge retention (as it uses source labeled data, not learned knowledge). The source must be similar to the target (which are normally selected by the user). It is also only one-directional: source helps target, but not the other way around because the target has no or little labeled data.
  • Multitask learning vs. lifelong learning: Multitask learning jointly optimizes learning of multiple tasks. Although it is possible to make it continual, multitask learning does not retain any explicit knowledge except data, and when the number of task is really large, it is hard to re-learn everything when faced with a new task.

Publications

         TextBook: Zhiyuan Chen and Bing Liu. Lifelong Machine Learning. Morgan & Claypool, 2018 (2nd edition), 2016 (1st edition).


comments powered by Disqus