The Human Behaviour-Change Project: harnessing the power of artificial intelligence and machine learning for evidence synthesis and interpretation

Michie, Susan; Thomas, James; Johnston, Marie; Aonghusa, Pol Mac; Shawe-Taylor, John; Kelly, Michael P.; Deleris, Léa A.; Finnerty, Ailbhe N.; Marques, Marta M.; Norris, Emma; O’Mara-Eves, Alison; West, Robert

doi:10.1186/s13012-017-0641-5

Study protocol
Open access
Published: 18 October 2017

The Human Behaviour-Change Project: harnessing the power of artificial intelligence and machine learning for evidence synthesis and interpretation

Susan Michie¹,
James Thomas²,
Marie Johnston³,
Pol Mac Aonghusa⁴,
John Shawe-Taylor⁵,
Michael P. Kelly⁶,
Léa A. Deleris⁴,
Ailbhe N. Finnerty¹,
Marta M. Marques¹,
Emma Norris¹,
Alison O’Mara-Eves² &
…
Robert West⁷

Implementation Science volume 12, Article number: 121 (2017) Cite this article

29k Accesses
158 Citations
152 Altmetric
Metrics details

Abstract

Background

Behaviour change is key to addressing both the challenges facing human health and wellbeing and to promoting the uptake of research findings in health policy and practice. We need to make better use of the vast amount of accumulating evidence from behaviour change intervention (BCI) evaluations and promote the uptake of that evidence into a wide range of contexts. The scale and complexity of the task of synthesising and interpreting this evidence, and increasing evidence timeliness and accessibility, will require increased computer support.

The Human Behaviour-Change Project (HBCP) will use Artificial Intelligence and Machine Learning to (i) develop and evaluate a ‘Knowledge System’ that automatically extracts, synthesises and interprets findings from BCI evaluation reports to generate new insights about behaviour change and improve prediction of intervention effectiveness and (ii) allow users, such as practitioners, policy makers and researchers, to easily and efficiently query the system to get answers to variants of the question ‘What works, compared with what, how well, with what exposure, with what behaviours (for how long), for whom, in what settings and why?’.

Methods

The HBCP will: a) develop an ontology of BCI evaluations and their reports linking effect sizes for given target behaviours with intervention content and delivery and mechanisms of action, as moderated by exposure, populations and settings; b) develop and train an automated feature extraction system to annotate BCI evaluation reports using this ontology; c) develop and train machine learning and reasoning algorithms to use the annotated BCI evaluation reports to predict effect sizes for particular combinations of behaviours, interventions, populations and settings; d) build user and machine interfaces for interrogating and updating the knowledge base; and e) evaluate all the above in terms of performance and utility.

Discussion

The HBCP aims to revolutionise our ability to synthesise, interpret and deliver evidence on behaviour change interventions that is up-to-date and tailored to user need and context. This will enhance the usefulness, and support the implementation of, that evidence.

Background

Many global threats to human health and wellbeing can only be solved by people, organisations and governments changing their behaviour. This includes behaviours directly relevant to health but also behaviours of policy-makers and providers responsible for promoting health and delivering healthcare. To that end, we need to use evidence being gathered about behaviour change more effectively than at present. A great deal more evidence is produced and published than it is possible for researchers to be able to use effectively with conventional methods.

The current waste in research is being increasingly recognised and addressed: for example, the Lancet series “Research: increasing value, reducing waste” [1] and the subsequent REWARD (REduce research Waste And Reward Diligence) campaign [2]. The waste occurs in biomedical and behavioural sciences and is apparent at every stage of the research process, including poor reporting of research so that evidence cannot be synthesised and implemented effectively and efficiently. The potential for implementation science to improve health promotion and delivery will remain compromised unless the problem of this waste is tackled.

The quantity, complexity and variability of reporting of behaviour change intervention (BCI)^g evaluations (see Table 1 for glossary of definitions for terms identified with the superscript ^g) severely limit the accessibility and value of this evidence for those who need it (Optimising the value of the evidence generated in Implementation Science: the use of ontologies to address the challenges, Invited submission forthcoming). The Human Behaviour-Change Project (HBCP) will develop and evaluate a BCI Knowledge System ^g: an automated system delivering comprehensive, high quality, timely and accessible syntheses and interpretations of evidence.

Table 1 Glossary of terms

Full size table

The challenges of a rapidly expanding, complex evidence base

BCIs^g are policies, activities, services or products designed to induce or support people to act differently from how they would have acted otherwise. They involve attempting to change either characteristics of members of the target population (in terms of their knowledge, skills, beliefs, feelings or habits), or their social or physical environment, or both. In the large majority of cases, the goal is to achieve change that is sustained over an extended period of time (e.g., reducing excessive alcohol consumption or smoking prevalence in the general population, or fostering new prescribing patterns among clinicians). Research findings have the potential to provide invaluable knowledge to help with developing or selecting BCIs but this evidence needs to be synthesised and interpreted. We need a cumulative, contemporaneous and accessible knowledge base^g of behaviour change findings to continue to build the science of human behaviour change.

Systematic reviews and meta-analyses provide a means of gathering and synthesising this evidence but the scientific literature on behaviour change is vast and accumulating exponentially. Considering the person-hours required for any given review, there are neither the human nor financial resources to achieve this manually at the scale required. Insufficient human resources to undertake evidence reviews and syntheses also means that these are often out of date by the time of completion [3]. The median time for primary study results to be incorporated into a systematic review has been found to range from 2.5 to 6.5 years [4] and only a minority of reviews are updated within 2 years of publication [5]. A further limitation of the current method is that there is often insufficient power in the evidence gathered to enable moderator analyses, especially for under-researched populations and geographical areas.

In addition, the diversity in the literature presents considerable challenges when it comes to making generalisations in terms of intervention effectiveness. Target behaviours^g vary widely in their characteristics, from cessation of unwanted behaviours such as tobacco smoking to increases in desired ones such as implementing evidence-based practice. The types of interventions evaluated are also subject to wide variation from policies such as raising excise duty on unhealthy products to digital mobile applications for promoting medication adherence. Populations^g also vary, with some studies involving what are intended to be general population samples and others based on participants with special characteristics, such as mental health problems. Settings^g vary across dimensions from physical locality to culture. With such diversity in the evidence base, there is a need for a coherent conceptual framework to allow evidence from different studies to be integrated and compared.

Addressing heterogeneity in the research literature is made more challenging by inconsistent and incomplete reporting of interventions and study methods and findings. The situation has been improved by the publication of a number of guidelines [6], but intervention evaluations still vary widely in quality and format, and are reported inconsistently and incompletely using terminology with limited standardisation [7].

Methods of evidence synthesis such as meta-analysis and meta-regression have substantially improved the ability to draw generalisable conclusions from intervention evaluations, but they are mostly limited to making inferences about simple effects for interventions that have been evaluated, or first-order interactions with moderator variables. More advanced statistical techniques are beginning to be developed [8], and will need to be built on. There is a need to be able to draw inferences that take account of complex interactions between intervention characteristics, populations and settings. Moreover, even with the numbers of studies retrievable by current methods, the populations and settings to which one may wish to generalise are so varied that making inferences from studies to real-world applications is problematic.

Important challenges facing evidence synthesis and interpretation, and approaches to addressing those challenges are shown in Table 2.

Table 2 Challenges facing evidence synthesis and interpretation in behaviour change

Full size table

The Human Behaviour-Change Project (HBCP)

The vision for the Human Behaviour-Change Project [9] is to build a Knowledge System that accesses the growing number of BCI evaluation reports^g, automatically annotates these reports to identify key features^g, and synthesises and interprets the findings to answer variants of the big question: ‘What works, compared with what, how well, with what exposure, with what behaviours (for how long), for whom, in what settings and why?’. The project includes the development of a user interface^g to allow intervention designers, policymakers, researchers, the general public and other computer systems to access, interrogate and update the knowledge base.

A multi-disciplinary team, spanning behavioural, computer and information scientists and system architects, supported by substantial engagement from scientists and users, will develop and evaluate the first iteration of the HBCP establishing proof of principle, with an initial focus on smoking cessation. This domain was selected due to its large and relatively well-defined evidence base and outcome measures that are relatively robust and important for public health.

Organising and classifying research, and generating inferences: The role of ontologies

The process of knowledge accumulation requires a common conceptual framework within which information can be represented. Data structures that organise knowledge in a structure that specifies entities^g and their relationships are called ‘ontologies’^g [10, 11].

In information science an ‘ontology’ is defined as a data structure consisting of a set of 1) unique identifiers representing types of ‘entity’^g (primarily objects^g, attributes^g, processes^g, or collections of these), 2) labels and definitions corresponding to these identifiers, and 3) specified relationships between the entities. The labels and definitions of entities and relationships in a given ontology^g make up a ‘controlled vocabulary’ which provides a basis for the interoperability of databases using the ontology [10, 11].

Ontologies have transformed a number of areas of science. Most notably the Gene Ontology has unified the field of biology which previously was highly fragmented [12]. Ontology development requires considerable expertise and to that end the OBO Foundry [13] was established to provide a resource for ontology developers and a set of guiding principles from which to work.

As yet, no widely-used ontology has been developed for behavioural science, although ones have been developed for public health [14] and mental entities such as emotions [15], mental disorders and mental functioning [16]. An ontology for understanding human behaviour change needs to represent both causal relationships (e.g., that a given type of intervention affects a given behaviour in a specified context) as well as semantic relationships (e.g., that a given type of intervention is a subclass of a broader type of intervention) [10, 11].

The HBCP will develop a BCI ontology (BCIO^g) that will define important entities described in BCI evaluation reports. Fig. 1 shows upper-level entities that need to be captured in the BCIO and some of their relationships. The labels for these may change in the course of development of the BCIO but this provides an indication of what information needs to be captured. Note that Fig. 1 is not the formal ontology but is shown to illustrate key parts that need to be included.

The BCIO includes entities that are important in answering questions about BCI effectiveness as follows:

BCI evaluation report is a written description of a BCI study, which provides information about one or more BCI evaluations (see below), including the intervention(s) being evaluated, study methods and findings. It will typically involve a published paper but may include information from more than one paper, for example if important features of the methods are described in a protocol paper.
BCI study is an empirical data-gathering activity consisting of one or more BCI evaluations.
BCI evaluation is a comparison between two or more BCI scenarios^g.
Method ^g defined as the set of attributes of BCI evaluation methods. These include study design (e.g., controlled trial), measures, sample identification and recruitment, sample size, and ‘quality’
Effect ^g defined as the result of a comparison between outcomes of each pair of intervention and comparator scenarios. It is specified in terms of an effect descriptor (e.g., odds ratio, risk difference), effect size and confidence intervals.
Risk of bias features ^g are features of the BCI evaluation report and method that may have an impact on the observed effect of a BCI evaluation. These include study design, blinding, method of randomisation etc.
BCI scenario ^g is a scenario (a sequence or development of events) consisting of a BCI, its target behaviours, and factors that influence the outcome of the BCI in relation to the target behaviour (Fig. 2). A BCI scenario may be hypothetical (if it is one that is being considered for modelling purposes), planned (if it is one that is or has been intended), or realised (if it has been enacted, for example in a BCI evaluation). When annotating BCI evaluation reports (see below) the aim is to capture the realised BCI scenarios based on information from the reports. When querying the knowledge base (see below) the aim will be to present features of a planned or hypothetical BCI scenario with a view to obtaining a prediction of the likely outcome.
Outcome (behaviour) ^g defined as the type(s) of behaviour that the BCI seeks to change (e.g., tobacco smoking) together with a collection of attributes (e.g., duration, frequency or incidence) that together make specific types of outcome measure (e.g., self-report of not smoking for 6 months supported by a salivary cotinine concentration of less than 15 ng/ml measured at the final follow up point) [17].
Intervention ^g defined as a set of types of policies, activities, services or products that are intended to result in a specified outcome in relation to the target behaviour. The intervention is specified in terms of summary descriptors (e.g., ‘brief opportunistic advice from a GP on smoking’) together with detailed descriptions of ‘content’^g such as the techniques used (e.g., pharmacological support, verbal persuasion about capability etc.), and ‘delivery’^g (e.g., 5 min, single session, verbal, face-to-face, during a routine consultation, by GP, trained with UK National Centre for Smoking Cessation Very Brief Advice online course). The term ‘intervention’ is also used to refer to any comparator in a BCI evaluation (e.g., usual care).
Context ^g defined as factors (consisting of characteristics of the population and setting) not directly connected with the intervention that may influence the intervention’s effect.
Exposure ^g defined as factors relating to the interaction between the intervention and the target population (the extent and nature of the target population’s access to and engagement with the intervention) that may influence the intervention’s effect. Consists of reach^g (e.g., the proportion of the target population that has access to, or is exposed to, the intervention) and engagement^g (e.g., the extent and nature of the target population’s interaction with intervention components).
Mechanism of action ^g defined as the type(s) of process by which interventions influence the target behaviour (e.g., through increasing strength and frequency of feelings of concern about the risks of an unhealthy behaviour; providing a physical or social cue to action).
Outcome (behaviour) value ^g defined as the value attaching to the target behaviour for a given BCI scenario (e.g., the outcome would be 15% of the population where the target behaviour was six months of continuous abstinence from smoking).

The entities in the BCI scenario interact in specific ways, as showed by the arrows in Fig. 2. The content and delivery of an intervention influences the target behaviour through one or more mechanisms of action. The context moderates the influence of 1) the intervention on the mechanism of action and 2) the mechanism of action on the behaviour. Exposure moderates the influence of the intervention on the mechanism of action and is itself influenced by the intervention and context.

Thus if a GP prescribes nicotine replacement therapy (intervention) to smokers interested in stopping (population), as part of a routine consultation in a GP surgery in the UK (context), and 60% of smokers obtain the medication and start the treatment, and 50% take it as prescribed (exposure), this may reduce cigarette cravings (mechanism of action) and so lead to at least 6 months of abstinence (outcome behaviour) in 15% (outcome value) of cases [18].

If one were to conduct a study to assess the effect of GPs prescribing nicotine replacement therapy, this scenario would be compared with a BCI scenario such as GP advice without the offer of a prescription. The comparison would have a number of features relating to study design (e.g., RCT), sample recruitment and selection, sample size, baseline and outcome measures etc. The comparison of outcomes between the two scenarios would constitute the ‘effect’ of the prescription intervention relative to advice without a prescription, expressed in terms of an odds ratio or risk ratio with a corresponding confidence interval. The observed effect would therefore be a function of the features of the intervention and comparator BCI scenarios together with the study methods (Fig. 1).

The role of computer science in the HBCP

Artificial intelligence (AI^g) and machine learning (ML^g) applications have been developed to generate and interrogate large, accumulating knowledge bases using ontological approaches. In the HBCP, building computer programs to extract and process knowledge from text documents at a level that is usable by experts in the domain, requires several elements that can generally be equated with intelligence, such as advanced reading ability and significant domain understanding. In this respect, a computer program performing this task can be thought of as artificially intelligent.

Building computer programs to perform tasks such as recognising patterns in text is usually achieved by applying a technique called statistical learning, where a computer program uses example patterns and examples from a training set to construct a statistical model of how a task should be performed. This model can then be generalised to process new, unseen data thereby performing the desired task with high confidence. The technique is statistical because the computer program uses weightings learned from statistical properties of the training examples - for example - frequencies with which important words appear in text.

Other approaches to artificial intelligence, such as logic-based reasoning have been successful in domains such as robotics and sensor-based systems. Here axioms or rules describe the behaviour of the world allowing a computer program to decide how to respond to inputs. Since the HBCP is concerned with learning patterns from text it is expected that statistical learning, rather than other approaches such as logic-based learning, will be most appropriate.

Artificial intelligence and machine learning have been used successfully, for example, in banking customer service [19], and in areas of medicine [20,21,22]. IBM’s ‘Watson Oncology’ uses AI and ML to extract information from research publications to help clinicians identify appropriate treatment options. Algorithms^g are used for entity recognition, information extraction, semantic query expansion in information retrieval, pattern detection, sentiment analysis, and reasoning [23,24,25,26].

In the HBCP, computer scientists will develop automated processes to annotate BCI evaluation reports in terms of key features defined according to the BCIO. These will populate a database^g structured according to the BCIO. Automated annotation^g will require developing and training ‘natural language processing’ (NLP^g) algorithms and other systems for extracting features from tables and graphs. ML together with reasoning algorithms^g will then be used to synthesise and interpret the findings to answer questions and make predictions about what would be expected in as yet unstudied scenarios^g.

Evidence from studies of human-computer interaction^g (HCI) will inform the development of the user interface through which people will use the system. Different groups of users will have different requirements and concerns, which will be addressed in the way that information is presented, and the functionalities available for interacting with it. Understanding user interaction in this project is particularly important, given the ‘black box’ nature of the knowledge base that people will be querying. Addressing concerns relating to the Knowledge System’s trustworthiness, and how the reliability of its predictions can be evidenced, are likely to be particularly important.

Aim and research questions

The aim is to develop and evaluate the first generation of a BCI Knowledge System consisting of: the first version of the BCIO; a continually growing database of annotated BCI evaluation reports and inferences drawn from these; algorithms used to create the annotations and draw inferences; and an interface that will allow human users and other computer systems to query and update the database of annotations and inferences. Fig. 3 shows the main components of the BCI Knowledge System that is proposed and how they interact.

The main research questions fall into two categories: (1) those relating to creation of the BCI Knowledge System (the BCIO, the database of annotated BCI evaluation reports, the automated feature extraction algorithms used to annotate these reports, the ML and reasoning algorithms used to synthesise the evidence and draw inferences, stored inferences, and the interface), and (2) those relating to evaluation of the BCI Knowledge System.

1. Creating the BCI knowledge system

i.
What are the key features that need to be captured from BCI evaluation reports and models of behaviour change to build the BCIO? In particular, how should we represent: i) the content and delivery of interventions and comparators; ii) exposure to interventions and comparators in terms of reach (whether the intervention/comparator reached the sample studied) and how far and in what ways the targeted population engaged with the intervention and comparator; iii) targeted behaviours in terms of type of behaviour, duration and specific outcome measures; iv) contexts in terms of the target populations and settings; v) putative mechanisms of action of the intervention, vi) outcomes and effects in terms of the statistical estimate used (e.g. rate ratio) and confidence intervals, vii) study methods and reporting features, including those that influence the weight that should be given to the evaluation and the risk of bias.
ii.
What automated feature extraction algorithms (i.e., combinations and extensions of NLP components) can be developed and trained to extract relevant information from BCI evaluation reports in order to create the database of annotated reports?
iii.
What ML and reasoning algorithms can be developed to synthesise evidence using the database of annotated reports and the BCIO to arrive at i) inferences regarding BCI effectiveness and ii) confidence estimates associated with those inferences?
iv.
What are the key features of a user interface that make it easy to use and provide answers that are understood and trusted?

2. Evaluating the output

i.
What is the inter-rater reliability of the manual annotation system for the BCIO?
ii.
What is the accuracy of the automated feature extraction system in annotating BCI evaluation reports?
iii.
What is the accuracy of the predictions and associated confidence estimates generated by the ML and reasoning algorithms?
iv.
How far does the BCI Knowledge System add value over existing methods of evidence synthesis? For example, can automated reviews produced by the system improve upon systematic reviews conducted by humans (and if so, by how much)?
v.
What are users’ assessments of the system’s accuracy, salience, validity, and utility?
vi.
What new insights about behaviour change are generated by the system?
vii.
How can information be conveyed most effectively and efficiently between the BCI Knowledge System and users of different types (e.g. scientists, expert users, practitioners, policy makers)?

Methods

Overview

Six sets of activities will be undertaken, much of the work being conducted in parallel: 1) forming and engaging with stakeholder groups; 2) developing the BCIO; 3) annotating BCI evaluations according to the BCIO using manual and automated processes and building the BCI database^g; 4) developing and applying ML and reasoning algorithms to draw inferences in response to queries; 5) developing an interface for users and other applications to query the system and provide feedback that can be used to update the BCI Knowledge System as a whole; and 6) evaluating the BCI Knowledge System and its components.

Details of the methodological approach being taken to BCI Ontology development, manual annotation of BCI evaluation reports and the development of automated annotation algorithms, machine learning and reasoning algorithms are presented in Additional file 1. Methods of working will be made accessible in Open Science Framework [27] as they are updated. Outputs and processes of the HBCP will be made available to potential collaborators who are interested in applying these or conducting complementary projects. We will engage a wide variety of stakeholders in a number of groups to enable engagement across countries, cultures, academic disciplines and behavioural domains. A summary of engagement methods are outlined in Additional file 2.

Development of the HBCP interface

An interface will be developed to facilitate querying and updating the knowledge base, and the BCI Knowledge System as a whole. It will consist of a machine interface and a user interface.

The machine interface will provide the primary means by which BCI reports are added to the database. It will provide a facility by which programs that search and screen reports can feed those that are relevant into the database, ready for annotation. It will also include an application programming interface (API) to allow for other programs to formulate queries and receive responses in machine readable form. The aim is to make the BCI Knowledge System as interoperable as possible with other software that is being, and will be, developed.

The user interface will be a website that will build on the wide range of external perspectives that have fed into the BCIO development and ML components of this work and engagement with a wide range of stakeholders. It will handle three types of scenario:

1.
Users will be able to query the system and obtain results in multiple forms (e.g., lists of individual studies, synthesised data, and inferences from the BCI database). The interface will come in several forms that are tailored for particular groups of users.
2.
HBCP stakeholders will be able to interact with the BCIO, the BCI database, and the individual BCI reports in a flexible way. For example they will be able to propose scenarios specified using a purpose-built syntax and conduct sensitivity analyses in which particular studies are included or excluded. They will need elevated privileges for some tasks (e.g., direct editing of annotated research reports).
3.
Members of the HBCP research team will be able to use the interface to evaluate, develop and refine the BCIO and ML and reasoning algorithms.

Users of the interface will be able to generate queries about BCI scenarios. They will enter fixed or constrained parameters (e.g., the behavioural outcome, the mode of delivery, the target population, the setting, or a range of effect sizes) and interrogate the BCI knowledge base for predicted values of BCIO entities that are left open. Examples of queries are shown in Table 3.

Table 3 Examples of queries from different user groups

Full size table

Because users will vary in their levels of expertise in the topic of the query, the user interface will provide a facility to guide them through the generation of the query so that they arrive at the most useful results. For example, users may start the query at too general a level of abstraction for the Knowledge System to be able to generate meaningful results, or they may not be aware of the importance of particular moderators or intervention components when generating the query. The user interface should be able to draw attention to these issues and prompt users to generate queries that get the most out of the data available.

Users will also be able to use the interface to generate a curated and annotated bibliography of research reports relevant to their query. This may be particularly useful for systematic reviewers who may want to take advantage of the precision with which the system will permit searches to be carried out, but may want to undertake data extraction and synthesis by hand or using a different program.

Evaluation of the BCI knowledge system

The HBCP involves evaluation of BCI Knowledge System as a whole as well as its parts. There will be an ongoing process of evaluation and development throughout the project, but at a certain point it will be necessary to assess to what extent the project has met its objectives, and to provide information to guide future decisions. In accordance with the HBCP research questions, the HBCP will undertake the following assessment:

i.
The adequacy, applicability, and validity of the BCIO. BC experts blind to the specific content of the BCIO will annotate intervention reports to identify all information they consider to be essential. The HBCP team will compare these annotations with the BCIO annotations to identify omissions or incompletely included information and discuss the results with the BC experts.
ii.
Inter-rater reliability of the manual annotation process. The manual annotation will form the basis for training the automated annotator and so it is important that it be as accurate as possible. In the absence of an objective gold standard against which to assess accuracy, assessing inter-rater reliability will provide an index of likely accuracy. This can be achieved using methods similar to those already in place for identifying behaviour change techniques and modes of delivery [28, 29]. This involves calculating reliability statistics for sets of annotations.
iii.
Accuracy of the automated annotator. Predictive accuracy of the automated annotator (i.e., its ability to match the study classifications of the manual annotations) will be assessed throughout the project through accuracy, precision and recall metrics, taking account of the hierarchical structure of the ontology and the inevitable dependency between classifications (e.g., a given outcome classification is highly likely to co-occur with a given intervention).
iv.
Accuracy of predictions from the ML and reasoning algorithms. We will establish manually, by collaborating with behavioural change experts, a set of established effects and associated facts and will test the ML and reasoning algorithms against it by measuring the percentage of predictions that are in agreement.
v.
Comparison of BCI Knowledge System with existing methods of evidence synthesis. We will create automated systematic reviews using the BCI Ontology to select relevant studies in conjunction with user input; use the automated data extraction and study evaluation tools to conduct syntheses and compare the results of this computer-assisted work with published systematic reviews, evaluating the automated reviews in terms of selection (are all the correct studies identified?), descriptive accuracy (are the studies correctly described and risk of bias correctly assessed?), and inferential claims (how do the conclusions compare with those from manually-conducted systematic reviews?)
vi.
User evaluation of the BCI Knowledge System’s accuracy, salience, validity, and utility. Initially for domains with simple behaviours, robust outcome measures and relatively coherent evidence, we will use an International Organisation for Standardisation (ISO)-based evaluation framework [30] to evaluate the utility of the system as a whole. We will engage a range of decision-makers (e.g. practitioners, local government officers and national policymakers) and assess the extent to which the system is able to generate knowledge that addresses specific decisions.
vii.
New insights about behaviour change that are generated by the system. We will assess the extent to which the system generates novel hypotheses and improved understanding of mechanisms of action.

Discussion

The HBCP is an ambitious project aimed at developing and evaluating the first generation of a BCI Knowledge System. This will consist of a BCI Ontology, a set of processes and resources for manually annotating BCI evaluation reports according to this ontology to populate a BCI database, an automated annotator to achieve the annotation at scale with an acceptable level of accuracy for further populating the BCI database, a set of ML and reasoning algorithms to draw inferences from the BCI database, and an interface to allow users and other computer programs and to query and input to the knowledge base.

The first generation of the BCI Knowledge System will focus on synthesising and interpreting evidence from smoking cessation intervention evaluations in Cochrane reviews. The ontology will draw on established ontologies in related domains and be part of the OBO Foundry to maximise interoperability with other ontologies. An international network of stakeholders will be established to bring key experts and users into the development, evaluation and dissemination process. The BCI Knowledge System and its parts will undergo ongoing evaluation to inform its development and summative evaluation towards the end of the project to assess how far the project objectives have been met. It is hoped that the HBCP will represent the start of a new phase in behavioural and implementation science in which much more efficient use is made of the burgeoning research literature both for theory development and practical applications.

Abbreviations

AI:: Artificial intelligence
API:: Application programming interface
BCI:: Behaviour change intervention
BCIO:: Behaviour change intervention ontology
BCT:: Behaviour change technique
HBCP:: Human behaviour change project
HCI:: Human-computer interface
ML:: Machine Learning
NLP:: Natural Language Processing

References

Glasziou P, Altman DG, Bossuyt P, Boutron I, Clarke M, Julious S, Michie S, Moher D, Wager E. Reducing waste from incomplete or unusable reports of biomedical research. Lancet. 2014;18;383(9913):267–76.
Article PubMed Google Scholar
The Lancet. www.thelancet.com/campaigns/efficiency/statement. Accessed 21 July 2017.
Elliott JH, Turner T, Clavisi O, et al. Living Systematic Reviews: An Emerging Opportunity to Narrow the Evidence-Practice Gap. PLoS Med. 2014;11(2):e1001603.
Article PubMed PubMed Central Google Scholar
Bragge P, Clavisi O, Turner T, Tavender E, Collie A, Gruen RL. The Global Evidence Mapping Initiative: scoping research in broad topic areas. BMC Med Res Methodol. 2011;11:92.
Article PubMed PubMed Central Google Scholar
Takwoingi Y, Hopewell S, Tovey D, Sutton AJ. A multicomponent decision tool for prioritising the updating of systematic reviews. BMJ. 2013;347:f7191.
Article PubMed Google Scholar
Equator Network. www.equator-network.org. Accessed 21 July 2017.
Ioannidis JP, Greenland S, Hlatky MA, et al. Increasing value and reducing waste in research design, conduct, and analysis. Lancet. 2014;383(9912):166–75.
Article PubMed PubMed Central Google Scholar
Caldwell DM, Welton NJ. Approaches for synthesising complex mental health interventions in meta-analysis. Evid Based Ment Health. 2016;19(1):16–21.
Article PubMed Google Scholar
The Human Behaviour-Change Project. www.humanbehaviourchange.org. Accessed 21 July 2017.
Arp R, Smith B, Spear AD. Building ontologies with basic formal ontology. Cambridge: MIT Press; 2015.
Book Google Scholar
Larsen KR, Michie S, Hekler EB, Gibson B, Spruijt-Metz D, Ahern D, Cole-Lewis H, Bartlett Ellis RJ, Hesse B, Moser RP, Yi J. Behavior change interventions: the potential of ontologies for advancing science and practice. J Beh Med. 2016;40(1):6–22.
Article Google Scholar
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G. Gene Ontology: tool for the unification of biology. Nat Genet. 2000;25(1):25–9.
Article CAS PubMed PubMed Central Google Scholar
OBO Foundry. www.obofoundry.org. Accessed 21 July 2017.
Okhmatovskaia A, Shaban-Nejad A, Lavigne M, Buckeridge DL. Addressing the challenge of encoding causal epidemiological knowledge in formal ontologies: a practical perspective. Stud Health Technol Inform. 2014;205:1125–9.
PubMed Google Scholar
Hastings J, Ceusters W, Smith B, Mulligan K. The Emotion Ontology: Enabling Interdisciplinary Research in the Affective Sciences. Modeling and Using Context: 7th International and Interdisciplinary Conference, CONTEXT 2011, Karlsruhe, Germany; 2011: Berlin.
Hastings J, Smith B, Ceusters W, Jensen M, Mulligan K. Representing mental functioning: Ontologies for mental health and disease. ICBO 2012: 3rd International Conference on Biomedical Ontology; Citeseer; 2012.
Google Scholar
West R, Hajek P, Stead L, Stapleton J. Outcome criteria in smoking cessation trials: proposal for a common standard. Addiction. 2005;100(3):299–303.
Article PubMed Google Scholar
West R, Raw M, McNeill A, Stead L, Aveyard P, Britton J, Stapleton J, McRobbie H, Pokhrel S, Lester-George A, Borland R. Health-care interventions to promote and assist tobacco cessation: a review of efficacy, effectiveness and affordability for use in national guideline development. Addiction. 2015;110(9):1388–403.
Article PubMed PubMed Central Google Scholar
CNBC. http://www.cnbc.com/2014/06/10/you-may-soon-get-financial-advice-from-a-machine.html. Accessed 21 July 2017.
Wired. http://www.wired.co.uk/news/archive/2013-02/11/ibm-watson-medical-doctor. Accessed 21 July 2017.
Hood L, Flores M. A personal view on systems medicine and the emergence of proactive P4 medicine: predictive, preventive, personalized and participatory. New Biotechnol. 2012;29(6):613–24.
Article CAS Google Scholar
Shaikh AR, Butte AJ, Schully SD, Dalton WS, Khoury MJ, Hesse BW. Collaborative biomedicine in the age of big data: the case of cancer. J Med Internet Res. 2014;16(4):e101.
Article PubMed PubMed Central Google Scholar
Lassoued Y, Deleris L. Thesaurus-Based Hierarchical Semantic Grouping of Medical Terms in Information Extraction. Stud Health Technol Inform. 2016;228:446–560.
PubMed Google Scholar
Deleris L, Deparis S, Sacaleanu B, Tounsi L. Risk Information Extraction and Aggregation. Algorithmic Decision Theory: Third International Conference, ADT 2013. Bruxelles: Springer; 2013.
Google Scholar
High R, Rapp B. Transforming the Way Organizations Think with Cognitive Systems. IBM Academy of Technology: IBM RedBooks; 2012.
Deleris L, Jochim C. Probability Statements Extraction with Constrained Conditional Random Fields. Stud Health Technol Inform. 2016;228:527–31.
PubMed Google Scholar
Open Science Framework. https://osf.io/. Accessed 21 July 2017.
Michie S, Carey RN, Johnston M, Rothman AJ, De Bruin M, Kelly MP, Connell L. From Theory-Inspired to Theory-Based Interventions: A Protocol for Developing and Testing a Methodology for Linking Behaviour Change Techniques to Theoretical Mechanisms of Action. Ann Behav Med. 2016: 1–12.
Michie S, Wood C, Johnston M, Abraham C, Francis J, Hardeman W. Behaviour Change Techniques: The Development and Evaluation of a Taxonomic Method for Reporting and Describing Behaviour Change Interventions. Health Technol Assess. 2015;19(99):1–188.
Article Google Scholar
King M. General principles of user-oriented evaluation. In: Dybkjær L, Hemsen H, Minke W, editors. Evaluation of text and speech systems. New York: Springer; 2007. p. 125–61.
Chapter Google Scholar
Nilsson NJ. Principles of artificial intelligence. Palo Alto: Morgan Kauffman; 2014.
Google Scholar
Davis RE, Campbell R, Hildon Z, Hobbs L, Michie S. Theories of behaviour and behaviour change across the social and behavioural sciences: a scoping review. Health Psychol Rev. 2015;9:323–34.
Article PubMed Google Scholar
West R, Michie S. A guide to development and evaluation of digital behaviour change interventions in healthcare. London: Silverback Publishing; 2015.
Google Scholar
Michie S, Johnston M, Carey R. Behaviour change techniques. In: Gellman M, Turner JR, editors. Encyclopedia of behavioural medicine. New York: Springer; 2016. p. 1–8.
Google Scholar
Michie S, Richardson M, Johnston M, Abraham C, Francis J, Hardeman W, Eccles MP, Cane J, Wood CE. The behaviour change technique taxonomy (v1) of 93 hierarchically clustered techniques: building an international consensus for the reporting of behaviour change interventions. Ann Beh Med. 2013;46:86–95.
Article Google Scholar
Cochrane Collaboration. http://uk.cochrane.org/about-us. Accessed 21 July 2017.
Alpaydin E. Introduction to machine learning. Cambridge: MIT Press; 2014.
Chowdhury G. Natural language processing. Ann Rev Info Sci Tech. 2003;37:51–89.
Article Google Scholar
Allemang D, Hendler J. Semantic web for the working ontologist. 2nd ed. Waltham: Morgan Kaufmann; 2011.
Google Scholar
Cochrane Collaboration. http://linkeddata.cochrane.org/pico-ontology. Accessed 21 July 2017.
Cochrane Collaboration. http://handbook.cochrane.org/front_page.htm. Accessed 21 July 2017.
Stravi Z, Michie S. Classification systems in behavioural sciences; Current systems and lessons from the natural, medical and social sciences. Health Psychol Rev. 2012;6113–140.
Shneidermann B. Designing the user interface: strategies for effective human-computer interaction. New York: Pearson Education; 2010.
Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

The project is funded by a Wellcome Trust collaborative award [The Human Behaviour-Change Project: Building the science of behaviour change for complex intervention development’, 201,524/Z/16/Z]. During the preparation of the manuscript RW’s salary was funded by Cancer Research UK.

Availability of data and materials

Data sharing not applicable to this article as no datasets were generated or analysed during the current study.

Author information

Authors and Affiliations

UCL Centre for Behaviour Change, University College London, 1-19 Torrington Place, London, WC1E 7HB, UK
Susan Michie, Ailbhe N. Finnerty, Marta M. Marques & Emma Norris
EPPI-Centre, Department of Social Science, University College London, London, UK
James Thomas & Alison O’Mara-Eves
Health Psychology, University of Aberdeen, Scotland, UK
Marie Johnston
IBM Research – Ireland, Dublin, Ireland
Pol Mac Aonghusa & Léa A. Deleris
Department of Computer Science, UCL, London, UK
John Shawe-Taylor
Primary Care Unit, Institute of Public Health, University of Cambridge, Cambridge, UK
Michael P. Kelly
Department of Epidemiology and Public Health, University College London, London, UK
Robert West

Authors

Susan Michie
View author publications
You can also search for this author in PubMed Google Scholar
James Thomas
View author publications
You can also search for this author in PubMed Google Scholar
Marie Johnston
View author publications
You can also search for this author in PubMed Google Scholar
Pol Mac Aonghusa
View author publications
You can also search for this author in PubMed Google Scholar
John Shawe-Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Michael P. Kelly
View author publications
You can also search for this author in PubMed Google Scholar
Léa A. Deleris
View author publications
You can also search for this author in PubMed Google Scholar
Ailbhe N. Finnerty
View author publications
You can also search for this author in PubMed Google Scholar
Marta M. Marques
View author publications
You can also search for this author in PubMed Google Scholar
Emma Norris
View author publications
You can also search for this author in PubMed Google Scholar
Alison O’Mara-Eves
View author publications
You can also search for this author in PubMed Google Scholar
Robert West
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The study was conceived by SM. SM, JT, MJ, RW, MK, PM and JS-T contributed to the design of the study. SM, RW and JT led the drafting of the paper. All authors contributed to the manuscript, commented on its successive drafts and read and approved the final version. SM is the guarantor of the paper.

Corresponding author

Correspondence to Susan Michie.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

SM is Director of UCL’s Centre for Behaviour Change which has received funds from industry and government agencies.

JT directs development of EPPI-Reviewer in collaboration with NICE and other stakeholders. EPPI-Reviewer licences access to a large international user base on a not-for-profit cost-recovery basis.

MJ: None.

PMacA & LD are employees of IBM. IBM provides commercial offerings in healthcare and related domains. IBM Watson technologies have been incorporated in commercial offerings related to healthcare. IBM Research participates in EU funded research programs with aspects related to behaviour change and healthcare.

JS-T is a partner in Realedge Ltd., a company developing machine learning solutions for a variety of applications including optimisation of ambulance services.

MK has received for consultancy funding from Slimming World and the Swiss Olympic Association.

ANF: None.

MM: None.

EN: None.

AOME: None.

RW has undertaken research and consultancy for companies that develop and manufacture smoking cessation medications.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Methodological approach to the development of the BCI Ontology, manual and automated annotation and machine learning and reasoning algorithms. (DOCX 30 kb)

Additional file 2:

Methods for engaging stakeholders in the HBCP. (DOCX 23 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Michie, S., Thomas, J., Johnston, M. et al. The Human Behaviour-Change Project: harnessing the power of artificial intelligence and machine learning for evidence synthesis and interpretation. Implementation Sci 12, 121 (2017). https://doi.org/10.1186/s13012-017-0641-5

Download citation

Received: 14 August 2017
Accepted: 28 August 2017
Published: 18 October 2017
DOI: https://doi.org/10.1186/s13012-017-0641-5

The Human Behaviour-Change Project: harnessing the power of artificial intelligence and machine learning for evidence synthesis and interpretation

Abstract

Background

Methods

Discussion

Background

The challenges of a rapidly expanding, complex evidence base

The Human Behaviour-Change Project (HBCP)

Organising and classifying research, and generating inferences: The role of ontologies

The role of computer science in the HBCP

Aim and research questions

1. Creating the BCI knowledge system

2. Evaluating the output

Methods

Overview

Development of the HBCP interface

Evaluation of the BCI knowledge system

Discussion

Abbreviations

References

Acknowledgements

Funding

Availability of data and materials

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Additional files

Additional file 1:

Additional file 2:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Implementation Science

Contact us