Bielik7B v0.1: Polish language model – development, insights, and evaluation

Ociepa, Krzysztof; Flis, Łukasz; Wróbel, Krzysztof; Gwoździej, Adrian; Kinas, Remigiusz

doi:https://doi.org/10.7494/csci.2025.26.4.7689

Article

Bielik7B v0.1: Polish language model – development, insights, and evaluation

creativeworkseries.issn	1508-2806
dc.contributor.author	Ociepa, Krzysztof
dc.contributor.author	Flis, Łukasz
dc.contributor.author	Wróbel, Krzysztof
dc.contributor.author	Gwoździej, Adrian
dc.contributor.author	Kinas, Remigiusz
dc.date.issued	2025
dc.description.abstract	We introduce Bielik 7B v0.1 – a seven-billion-parameter generative text model for Polish language processing. Trained on curated Polish corpora, this model addresses key challenges in language model development through innovative techniques; these include Weighted Instruction Cross-Entropy Loss (which balances the learning of different instruction types) and Adaptive Learning Rate (which dynamically adjusts the learning rate based on training progress). To evaluate performance, we created the Open PL LLM Leaderboard and Polish MT-Bench – novel frameworks assessing various NLP tasks and conversational abilities. Bielik 7B v0.1 demonstrates significant improvements, achieving a ninepercentage- point increase in its average score compared to Mistral-7B-v0.1 on the RAG Reader task. It also excels in the Polish MT-Bench – particularly in the Reasoning (6.15/10) and Role-playing (7.83/10) categories. This model represents a substantial advancement in Polish language AI, offering a powerful tool for diverse linguistic applications and setting new benchmarks in the field.	en
dc.description.placeOfPublication	Kraków
dc.description.version	wersja wydawnicza
dc.identifier.doi	https://doi.org/10.7494/csci.2025.26.4.7689
dc.identifier.eissn	2300-7036
dc.identifier.issn	1508-2806
dc.identifier.uri	https://repo.agh.edu.pl/handle/AGH/117700
dc.language.iso	eng
dc.publisher	Wydawnictwa AGH
dc.relation.ispartof	Computer Science
dc.rights	Attribution 4.0 International
dc.rights.access	otwarty dostęp
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/legalcode
dc.subject	Polish language model	en
dc.subject	natural language processing	en
dc.subject	transformer architecture	en
dc.subject	language model evaluation	en
dc.subject	instruction tuning	en
dc.title	Bielik7B v0.1: Polish language model – development, insights, and evaluation	pl
dc.type	artykuł
dspace.entity.type	Publication
publicationissue.issueNumber	No. 4
publicationissue.pagination	pp. 131–161
publicationvolume.volumeNumber	Vol. 26
relation.isAuthorOfPublication	0f91427b-755d-4a51-a9b6-d984f245c4a6
relation.isAuthorOfPublication	8ebcbe9d-8d39-4e52-b441-9e20c5a65e16
relation.isAuthorOfPublication.latestForDiscovery	0f91427b-755d-4a51-a9b6-d984f245c4a6
relation.isJournalIssueOfPublication	ad13a817-a4f4-49ce-aa26-a74828c46103
relation.isJournalIssueOfPublication.latestForDiscovery	ad13a817-a4f4-49ce-aa26-a74828c46103
relation.isJournalOfPublication	020291ee-249b-4dcf-98a3-276a2f7981aa

Files

Original bundle

Now showing 1 - 1 of 1

Name:: csci.2025.26.4.131.pdf
Size:: 811.51 KB
Format:: Adobe Portable Document Format

Download

Collections

Artykuły (CN-csci)