Data Science Institute

We aim to set the global standard for a truly interdisciplinary approach to contemporary data-driven research challenges. Established in 2015, the Data Science Institute (DSI) has over 300 members and has raised £50 million in research grants.

An abstract diagram of networks

Linked icons

10-year anniversary of DSI – “Decade of Data Science”

In 2025, the Data Science Institute (DSI) at Lancaster University proudly marks its 10th anniversary. Since its founding in 2015, the DSI has established itself as a leading hub for cutting-edge research, interdisciplinary collaboration, and real-world impact in data science and artificial intelligence. Over the past decade, our researchers and partners have tackled some of the most pressing challenges in society, science, and industry—advancing the foundations of data science, fostering ethical and trustworthy AI, driving innovation across sectors and training 100s of data science practitioners.

As we celebrate this milestone, we reflect on the achievements of our vibrant research community and the transformative projects that have shaped the field. Looking ahead, the DSI remains committed to pushing the boundaries of data science and AI research, strengthening global collaborations, and supporting the next generation of data scientists.

About us

We are working to create a world-class Data Science Institute at Lancaster (DSI@Lancaster) that sets the global standard for a truly interdisciplinary approach to contemporary data-driven research challenges. DSI@Lancaster aims to have an internationally recognised and distinctive strength in being able to provide an end-to-end interdisciplinary research capability - from infrastructure and fundamentals through to globally relevant problem domains and the social, legal and ethical issues raised by the use of Data Science.

The Institute is initially focusing on the fundamentals of Data Science including security and privacy together with cross-cutting theme areas consisting of environment, resilience and sustainability;health and ageing, data and society and creating a world-leading institute with over 300 affiliated academics, researchers, and students.

Our data science, health data science and business analytics programmes have launched the careers of hundreds of data professionals over the last 10 years. Students from our programmes have progressed to data science roles at Amazon, PWC, Ernst & Young, Hawaiian Airlines, eBay, Zurich Insurance, the Co-operative Group, N Brown, the NHS and many others - please look at our Education pages for further details of the courses on offer.

Decade of Data motif

Latest News

New Health Theme Lead announcement

Professor Neil Reeves has been appointed Health Theme Lead in the Data Science Institute. Neil brings internationally recognised expertise in secure digital health technologies and diabetes, a track record of ~180 publications and major EPSRC, NIH (USA) and Diabetes UK funding. Serving on EPSRC’s Healthcare Technologies Strategic Advisory Team (to March 2028) and the BSI medical devices committee, he will drive interdisciplinary collaboration, secure use of large health datasets (including NHS secure data environments), and capacity-building across the University’s health data science portfolio.

We warmly thank Professor Heather Brown for her impactful leadership of the Health Theme since September 2022. Heather as the theme lead for health has been a champion for cross-faculty working. Helping to build collaborations between FASS and FHM demonstrated by an ADR fellowship for an ECR based in sociology, successful funding applications to the NIHR with LEC and supporting a number of workshops.

Data Dialogues - Autumn 2025

We would like your suggestions for speakers for Autumn 2025 - please get in touch if you would like to present or have a nomination to make!

Data Dialogues is an informal, discussion-driven event where members of the DSI and the broader university community share insights into their work, spark interdisciplinary conversations and explore potential collaborations. The focus is on interactive engagement rather than formal presentations—so no slides (or just a few, if needed)! Instead, the idea is to introduce your work in an accessible way, followed by an open discussion and Q&A with attendees.

Get fresh perspectives and think about new ways of approaching your own research, meet new people and explore potential research collaborations. Come be part of the DSI community!

blank

Events

STOR-i and DSI invite you to a talk - 1st October at 2pm in Sky Lounge

Paul H Taylor, School of Earth and Oceans - The University of Western Australia

WATER WAVE IMPACTS AND HYPERSONICS - APPLICATIONS OF NEWTON’S MISTAKE

Why are violent wave impacts, on prismatic and cylindrical bodies on the deck of an FPSO or a container ship, or an oil platform with wave-in-deck, or violent tsunami wave impacts, all like the Apollo space capsule during re-entry or the X-15 rocket plane in hypersonic flight? Each of these flow-structure interaction problems occurs at high Froude or Mach number. Hence, the fluid dynamics is dominated by the appearance of violently projected fluid sheets or strong shocks and fast flows across the surface of the body. In this regime, the shallow water equations resemble those of high-speed compressible flow, and Newton’s mistake - the Newtonian or corpuscular flow model - is directly applicable. For wave impacts this is confirmed by extensive wave flume testing and CFD using OpenFOAM. Despite some inevitable limitations, this simple Newtonian theory provides a pen-and-paper force prediction tool for practical use in design.

Join via TEAMS here

DSI Data Engineering – Funding Application Workshop Day - Wednesday 8th October 9.30 - 4pm

In collaboration with the Library’s Open Research team, the Data Engineering theme is hosting a one-day funding application workshop. The day will offer a crash course in all the application essentials from proposals, data management plans, to data ethics and how to embed sustainable practices in your projects. Through a mixture of talks and hands-on workshops, the day will provide researchers with practice advice for applying for large grants for data-driven projects from the UK’s major funding bodies such as UKRI research councils and the Wellcome Trust. Brining together support from Research Services, the Library, and the Reimagining Research Practices project, this workshop would be particularly suited to those seeking to those applying for a large grant for the first time. There will also be opportunities to network with colleagues across the university in the field of data science and AI.

Sign up via the Eventbrite Link

Speaker details will be added soon

  • Welcome, 9.30 – 9.45
  • Project proposals and research design, 9.45 – 10.45 (60 mins)
  • Break, 10.45 – 11.00
  • Data Management Plans and reproducibility, 11.00 – 12.00 (60 mins)
  • Lunch, 12.00 - 13.00
  • Trusted Research and Data Ethics, 13.00 – 14.00 (60 mins)
  • Break, 14.00 – 14.10
  • Cuppa Conundrums: AI ethics and sustainability workshop, 14.10 – 15.10 (60 mins) [relaxed discussion workshop with tea/coffee]
  • Embedding sustainability, engagement and EDI, 15.10 – 16.00 (50 mins)

This event will take place in the Event Space at The Library

Refreshments will be provided. Let us know your dietary requirements via the Eventbrite sign up order form.

data engineering image

Ecology and AI Workshop - 13th October LEC Training Room - 9.30 - 4.30

SOLD OUT

Are you interested in research at the interface of ecology and AI? Please read on…

With our strong LEC-based ecology grouping, UKCEH (through CEEDS), the establishment of MARS (Mathematics for AI in Real-World Systems), and DSI, we have critical mass to create a hub at Lancaster around AI for Ecology (and adjacent fields).

To create a new interdisciplinary hub, we will run a 1-day workshop on Monday 13th October 09.30-16.30 in the LEC Training Rooms on “Harnessing AI to Accelerate Solutions for Biodiversity in Crisis”.

The hub will help us to develop new interdisciplinary projects (or find solutions to old ones), target new funding opportunities, and capitalise on excitement in this area within the student community with a visible external presence.

Please sign up on Eventbrite to be on the list for further information. We are very keen to involve researchers across disciplines and career stages and particularly encourage early career researchers (PhDs and upwards) to join.

Refreshments and lunch will be provided. The workshop will have a professional facilitator and is a collaboration between MARS, DSI and CEEDS, with funding from BBSRC AIBIO-UK.

Please sign up on Eventbrite to be on the list for further information.

Questions? Contact Julia Carradus j.carradus1@lancaster.ac.uk

Ai and the Environment

Björn Andersson is a professor at the Centre for Educational Measurement, University of Oslo (CEMO), Norway - 16th October at 2pm

Title: Joint latent variable modelling of binary, ordinal, count, and continuous data for social science and psychological research

Abstract: Research in the social sciences increasingly utilizes mixed data types, such as a combination of ordinal item scores, continuous response times, and discrete variables based on the response process. Generalized linear latent variable models (GLLVMs) fitted to such data can be used to infer relationships between multiple latent constructs and their development over time. These models are often high-dimensional and require efficient estimation methods. We propose an estimator for GLLVMs based on maximizing an approximation to the marginal likelihood and discuss the finite sample and large sample properties of the estimator. The modelling approach is illustrated through joint modelling of response times, action counts, and item scores, where we examine the impact on measurement precision of the proficiency estimates when including multiple types of variables. Extensions of the modelling approach to longitudinal data analysis and joint modelling with non-ignorable missing data are also discussed.

16th October at 2pm in the Post Grad Stats Centre Lecture Theatre at 2pm

Sign up via Eventbrite

Biography

Björn Andersson is a professor at the Centre for Educational Measurement, University of Oslo (CEMO), Norway. He obtained his Ph.D. in statistics from Uppsala University in 2014 and has worked as a post-doctoral researcher (2015-2017) at the Collaborative Innovation Center of Assessment towards Basic Education Quality, Beijing Normal University in Beijing, China. His research interests include estimation methods for latent variable models, methods to ensure comparability of test scores in applied measurement and applications of item response theory in education, mental health, and psycholog

Unlocking the power of data science and ai for environmental research - 24th October, 11am - Sky Lounge

Date: 24th October

Location: Sky Lounge, Infolab 21, Lancaster University

Time: 11:00-14:00

Free event plus pizza lunch!

Are you an early career researcher (PhDs, postdocs and more) at Lancaster Uni or UKCEH working in ecology, environmental science or conservation? Whether you’re already using data science (includes AI and machine learning) or eager to explore new ways it can advance your research, this interactive session is for you.

Join us to:

  • Collaborate on a real-world environmental data challenge in a friendly, engaging environment.
  • Explore fresh approaches and tools to elevate your data skills.
  • Connect with fellow PhDs, postdocs and ECRs from across disciplines.
  • Learn about the Data Science Institute (DSI)and the Centre of Excellence for Environmental Data Science (CEEDS)* at Lancaster University — vibrant communities offering:
    • Access to seminars, workshops, and networking events
    • Funding opportunities and collaborative projects
    • Training and support tailored to environmental data science

Whether your focus is on the biodiversity of micro-organisms or global atmospheric dynamics, this event will inspire new ideas and connections.

Whatever your current level of data science expertise, come join us and see how DSI and CEEDS can support your research journey! If nothing else, you get free pizza.

* CEEDS is a joint centre that aims to build collaboration between UKCEH and Lancaster University. Funding is available via CEEDS to support travel for UKCEH participants not based at Lancaster, and will be allocated on a first-come, first-served basis.

Reserve your spot now

Questions? Contact Julia Carradus j.carradus1@lancaster.ac.uk

Cybernetic Culture Workshop Logo

Research Themes

Data Science at Lancaster was founded in 2015 on Lancaster’s historic research strengths in Computer Science, Statistics and Operational Research. The environment is further enriched by a broad community of data-driven researchers in a variety of other disciplines including the environmental sciences, health and medicine, sociology and the creative arts.

  • Foundations

    Foundations research sits at the interface of methods and application: with an aim to develop novel methodology inspired by the real-world challenge. These could be studies about the transportation of people, goods & services, energy consumption and the impact of changes to global weather patterns.

  • Health

    The Health theme has a wide scope. Current areas of strength include spatial and spatiotemporal methods in global public health, design and analysis of clinical trials, epidemic forecasting and demographic modelling, health informatics and genetics.

  • Society

    Data Science has brought new approaches to understanding long-standing social problems concerning energy use, climate change, crime, migration, the knowledge economy, ecologies of media, design and communication in everyday life, or the distribution of wealth in financialised economies.

  • Environment

    The focus of the environment theme has been to seek methodological innovations that can transform our understanding and management of the natural environment. Data Science will help us understand how the environment has evolved to its current state and how it might change in the future.

  • Data Engineering

    The Data Engineering theme aims to explore how we can utilise digital technologies to accelerate and enhance our research processes across the University.

Research Software Engineering

Within the Data Science Institute, our aim is to improve the reproducibility and replicability of research by improving the reusability, sustainability and quality of research software developed across the University. We are currently funded by the N8CIR, and work closely with our partner institutions across N8 Research.

Research Software Engineering

Upcoming Events