change paths to visuals (29d40664) · Commits · hillengass / SynDRA

documentation/project_plan/project_plan_full_text.md

+3 −3

Original line number	Diff line number	Diff line
		@@ -41,7 +41,7 @@ Finally, if there is time, we want to try a MultiAgent Approach, in which tw

		## Visualisation of Project Plan

		![](project_map.png)
		![](visualisations/project_map.png)

		<!-- in den prompts des multiagent, dialog prompt, sagen, du hast nur eine der beiden rollen

		@@ -57,13 +57,13 @@ Of the open-source models available, we decided on LLama-2 [(Touvron et al. 2023

		We chose LLama-2, as it performs very well in comparison to other open-source models on number of benchmarks (see [figure 1](#figure-1-llama-2-overall-performance-on-grouped-academic-benchmarks-compared-to-open-source-base-models-from-the-paper-by-touvron-et-al-2023)).

		![](llama_comparison_other_models.png)
		![](visualisations/llama_comparison_other_models.png)

		##### Figure 1: Llama-2: overall performance on grouped academic benchmarks compared to open-source base models (from the paper by [Touvron et al. 2023](#8-touvron-hugo-louis-martin-kevin-stone-peter-albert-amjad-almahairi-yasmine-babaei-nikolay-bashlykov-et-al-2023-llama-2-open-foundation-and-fine-tuned-chat-models-arxiv-httparxivorgabs230709288))

		Additionally, a lot of emphasis has been placed on respectful and non-discriminatory language use in the fine-tuning of the LLama-2 models to the chat-versions through Reinforcement learning from Human Feedback (RLHF). We consider this to be of great importance in generating data points as potential training data for other chatbots (our [objective](#objective)). This is why we preferred Llama-2 over Mistral-7B, even though Mistral-7B [(Jiang et al. 2023)](#3-jiang-albert-q-alexandre-sablayrolles-arthur-mensch-chris-bamford-devendra-singh-chaplot-diego-de-las-casas-florian-bressand-et-al-2023-mistral-7b-arxiv-httpsdoiorg1048550arxiv231006825) in its base version outperforms the Llama-2-7B base version and sometimes even the Llama-2-13B base version on a number of benchmarks (see [figure 2](#figure-2-comparison-of-mistral-7b-with-llama-from-the-paper-by-jiang-et-al-2023)).

		![](comparison_mistral_llama.png)
		![](visualisations/comparison_mistral_llama.png)

		##### Figure 2: Comparison of Mistral 7B with Llama (from the paper by [Jiang et al. 2023](#3-jiang-albert-q-alexandre-sablayrolles-arthur-mensch-chris-bamford-devendra-singh-chaplot-diego-de-las-casas-florian-bressand-et-al-2023-mistral-7b-arxiv-httpsdoiorg1048550arxiv231006825))