25-02-2025

State Data Lake: Transparency, Value, and Responsibility

LT_00_Fichier-de-reproduction-Next-Generation-EU_POS_POS.pngNKL-Logo_Horizontal-RGB.png

 

The State Data Agency (Statistics Lithuania) is the key institution responsible for processing, integrating, managing and opening state information resources. Our goal is to ensure that data is used transparently, responsibly, and in a way that creates real value for society.

“We are aware of the doubts and questions about how state data is managed, where and why it is used in certain ways. We want to emphasize that the fundamental principles of our activities and the overall state data policy are transparency and responsibility”, notes Dr Jūratė Petrauskienė, Director General of the State Data Agency.

Broad Data Integration – Greater Transparency and Benefits

So far, the State Data Agency has integrated nearly 500 state information resources into the State Data Lake, including key national registries, information systems, state and municipal enterprises, schools, hospitals, and other sources. Based on these resources, more than 900 data tables have been made publicly available. These data have enabled the creation of analytical spaces for addressing governance challenges, where more than 100 different institutions are actively working.

To ensure greater transparency, we are introducing new tools that allow both the public and institutions to clearly see how state data is processed, managed and utilized:

  • “Use of Sources” – provides insight into which information systems are integrated, what data is used, which tables are made available, and in which analytical spaces analyses are conducted.
  • “Data Flows Between Institutions” – visually represents how institutions exchange data and how it is used for the development and improvement of public services.

These tools help ensure that every instance of data usage is transparent and justified.

Responsible Data Management and Protection

The State Data Agency ensures that all instances of data usage are documented and publicly accessible. Our goal is to enable institutions to work efficiently with data while ensuring full compliance with legal and privacy protection requirements.

“It is important to note that when data is transferred to another institution, it is safeguarded by minimizing its scope to the level necessary for completing the institution’s task. Institutions work with data in secure analytical spaces that we have developed, and access to these spaces is granted only to a limited number of designated specialists working on a specific task”, explains Vainius Indilas, Project Manager of the State Data Governance Group.

Active Data Usage – Driving State Progress

We observe that institutions are increasingly taking advantage of the opportunities we provide –analytical spaces and open data sets. This enables them to make informed decisions faster, manage resources more efficiently, and improve public services.

However, we recognize that some institutions still lack data analysis competencies. That is why we not only open up data but also provide consultations, help delineate and refine task definitions, as well as assist in identifying the most effective ways to utilize data.

“Our tools dispel the myth that data can be processed non-transparently or that the State Data Agency is ‘doing who knows what with this vast amount of data’. Everything is very clear: how data is exchanged at the interinstitutional level, what the Agency holds, how it is used, what is made available, and what else can potentially be opened. There are no secret missions. We fully understand that state data is not just a source of information but also a tool for driving positive change. We can assure you that data is processed responsibly and transparently”, says Dr J. Petrauskienė.

We will continue working to ensure that everyone who uses state data, from institutions to businesses and academia, can do so transparently, responsibly, and effectively.