<?xml version="1.0" encoding="UTF-8"?>
<collection xmlns="http://www.loc.gov/MARC21/slim">
 <record>
  <leader>04595ntm a22005777i 4500</leader>
  <controlfield tag="001">000723868</controlfield>
  <controlfield tag="003">CZ-PrVSE</controlfield>
  <controlfield tag="005">20250629141137.0</controlfield>
  <controlfield tag="006">m        d</controlfield>
  <controlfield tag="007">cr n||||||||||</controlfield>
  <controlfield tag="008">250629s2025    xr     fsbm   000 0 eng d</controlfield>
  <datafield tag="STA" ind1=" " ind2=" ">
   <subfield code="a">NEZPRACOVANÝ IMPORT</subfield>
  </datafield>
  <datafield tag="040" ind1=" " ind2=" ">
   <subfield code="a">ABA006</subfield>
   <subfield code="b">cze</subfield>
   <subfield code="c">ABA006</subfield>
   <subfield code="d">ABA006</subfield>
   <subfield code="e">rda</subfield>
  </datafield>
  <datafield tag="100" ind1="1" ind2=" ">
   <subfield code="a">Pivoda, Šimon</subfield>
   <subfield code="%">ISIS:170048</subfield>
   <subfield code="4">dis</subfield>
  </datafield>
  <datafield tag="242" ind1="1" ind2="0">
   <subfield code="a">Inovace velkých jazykových modelů (LLM) pomocí doladěné metody Retrieval-Augmented Generation (RAG) :</subfield>
   <subfield code="b">Návrh a vyhodnocení</subfield>
   <subfield code="y">eng</subfield>
  </datafield>
  <datafield tag="245" ind1="1" ind2="0">
   <subfield code="a">Innovating Large Language Models (LLMs) with Fine-Tuned Retrieval-Augmented Generation (RAG) :</subfield>
   <subfield code="b">Design and Evaluation</subfield>
   <subfield code="c">Šimon Pivoda</subfield>
  </datafield>
  <datafield tag="264" ind1=" " ind2="0">
   <subfield code="c">2025</subfield>
  </datafield>
  <datafield tag="300" ind1=" " ind2=" ">
   <subfield code="a">?? stran :</subfield>
   <subfield code="3">digital, PDF soubor</subfield>
  </datafield>
  <datafield tag="500" ind1=" " ind2=" ">
   <subfield code="a">Vedoucí práce: Richard Antonín Novák</subfield>
  </datafield>
  <datafield tag="502" ind1=" " ind2=" ">
   <subfield code="a">Bakalářská práce (Bc.)—Vysoká škola ekonomická v Praze. Fakulta informatiky a statistiky, 2025</subfield>
  </datafield>
  <datafield tag="504" ind1=" " ind2=" ">
   <subfield code="a">Obsahuje bibliografii</subfield>
  </datafield>
  <datafield tag="516" ind1=" " ind2=" ">
   <subfield code="a">Textový (vysokoškolská kvalifikační práce)</subfield>
  </datafield>
  <datafield tag="518" ind1=" " ind2=" ">
   <subfield code="a">Rok obhajoby 2025</subfield>
  </datafield>
  <datafield tag="520" ind1="3" ind2=" ">
   <subfield code="a">Artificial intelligence (AI) is reshaping the way organizations manage and interact with information. Its influence is visible not only in daily operations but also in critical strategic decision-making. In recent years, an innovative approach known as Retrieval-Augmented Generation (RAG) has emerged, extending the capabilities of large language models (LLMs) by providing access to relevant and up-to-date data sources. This enables companies to better leverage their internal knowledge in real time and apply AI across a wide range of use cases. However, despite the fast advancements in large language models, their direct deployment within corporate environments often faces major challenges. Main among these are concerns related to data security and the risk of data leakage. As a result, many organizations are hesitant to expose sensitive internal data to language models. Additionally, LLMs often suffer from response accuracy issues, including the generation of incorrect or hallucinated responses when lacking domain-specific knowledge. The RAG method capably furnishes resolutions for these challenges. Safe as well as governed admittance to internal information that is stored within specialized repositories known as vector databases is permitted instead of mainly depending on closed training datasets. This approach greatly curtails the risk regarding data breaches and inaccuracies. This paper digs into the basic tenets and salient constituents of the RAG architecture. Many efficacious document segmentation and retrieval strategies are implemented via vector databases and embeddings. In the practical part of this thesis, a complex RAG system is proposed and tested, targeting automotive data. A PostgreSQL database enhanced with vector search support (PGVector) is used to store and manage vector representations. The primary data source is the large dataset from the Kaggle platform.</subfield>
  </datafield>
  <datafield tag="520" ind1="8" ind2=" ">
   <subfield code="a">System evaluation is conducted using a combination of expert queries and ground truth. The evaluation aims to assess the system’s accuracy, output validation, and its ability to minimize hallucinations. The results indicate that the RAG-based architecture is effective in enhancing conversational systems with internal data sources, resulting in more accurate responses, increased reliability, and a notable reduction in hallucinated or fabricated content.</subfield>
  </datafield>
  <datafield tag="538" ind1=" " ind2=" ">
   <subfield code="a">Způsob přístupu: Internet</subfield>
  </datafield>
  <datafield tag="653" ind1="0" ind2=" ">
   <subfield code="a">aplikovaná informatika [obor bakal. práce]</subfield>
  </datafield>
  <datafield tag="655" ind1=" " ind2="7">
   <subfield code="a">bakalářské práce</subfield>
   <subfield code="7">fd132403</subfield>
   <subfield code="2">czenas</subfield>
  </datafield>
  <datafield tag="655" ind1=" " ind2="9">
   <subfield code="a">bachelor's theses</subfield>
   <subfield code="2">eczenas</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2=" ">
   <subfield code="a">model evaluation</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2=" ">
   <subfield code="a">prompt engineering</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2=" ">
   <subfield code="a">embedding</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2=" ">
   <subfield code="a">LLM</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2=" ">
   <subfield code="a">RAG</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2=" ">
   <subfield code="a">retriever</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2=" ">
   <subfield code="a">AI</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2=" ">
   <subfield code="a">context data</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2=" ">
   <subfield code="a">Generative AI</subfield>
  </datafield>
  <datafield tag="700" ind1="1" ind2=" ">
   <subfield code="a">Novák, Richard,</subfield>
   <subfield code="d">1973-</subfield>
   <subfield code="7">ntka173578</subfield>
   <subfield code="4">ths</subfield>
  </datafield>
  <datafield tag="700" ind1="1" ind2=" ">
   <subfield code="a">Korčák, Jiří,</subfield>
   <subfield code="d">1994-</subfield>
   <subfield code="7">xx0336605</subfield>
   <subfield code="4">opn</subfield>
  </datafield>
  <datafield tag="710" ind1="2" ind2=" ">
   <subfield code="a">Vysoká škola ekonomická v Praze.</subfield>
   <subfield code="b">Fakulta informatiky a statistiky</subfield>
   <subfield code="7">kn20010709399</subfield>
   <subfield code="4">dgg</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2="0">
   <subfield code="u">https://insis.vse.cz/zp/91036/podrobnosti</subfield>
   <subfield code="y">VŠKP v InSIS</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2="0">
   <subfield code="u">https://insis.vse.cz/zp/91036</subfield>
   <subfield code="y">Hlavní práce</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2="0">
   <subfield code="u">https://insis.vse.cz/zp/91036/posudek/vedouci</subfield>
   <subfield code="y">Hodnocení vedoucího</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2="0">
   <subfield code="u">https://insis.vse.cz/zp/91036/posudek/oponent/85620</subfield>
   <subfield code="y">Oponentura</subfield>
  </datafield>
  <datafield tag="999" ind1="4" ind2="0">
   <subfield code="u">https://insis.vse.cz/zp/91036/podrobnosti</subfield>
   <subfield code="y">dc:identifier</subfield>
  </datafield>
  <datafield tag="993" ind1=" " ind2=" ">
   <subfield code="x">NEPOSILAT</subfield>
   <subfield code="y">VSKP</subfield>
  </datafield>
  <datafield tag="999" ind1="4" ind2="9">
   <subfield code="a">vse91036</subfield>
   <subfield code="b">250625</subfield>
  </datafield>
  <datafield tag="999" ind1="4" ind2="5">
   <subfield code="x">91036</subfield>
  </datafield>
 </record>
</collection>
