Data on publicist corpus of the ideological narrative of modern identity have been opened
The data set of the Institute of the Lithuanian Language on publicist corpus of the ideological narrative of modern identity has been opened on the Open Data Portal of Lithuania.
The data set provides texts, in full and in paragraphs, from the modern identity ideological narrative publicist corpus of the Institute of the Lithuanian Language. This corpus is a text database used for linguistic, statistical and sociological analysis of written language.
The data were collected into this database from 2018 to 2021. The corpus contains texts from various months of the pre-war period (1928 and 1930), the Soviet period (1945, 1956–1957, 1962) and the restored independent Lithuanian press (1992 and 1998). Other updates of the data are possible in the future, when additional data collection projects are carried out by the Institute of Lithuanian Language.
The State Data Agency (Statistics Lithuania) implements the project "Integration of State Information Resources into the Data Lake", which is financed by the Economic Recovery and Resilience Facility Plan "New Generation Lithuania" and the State budget of Lithuania.
