Re: LLM Experiments, Part 1: Corrections

emacs-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: LLM Experiments, Part 1: Corrections

From:	Andrew Hyatt
Subject:	Re: LLM Experiments, Part 1: Corrections
Date:	Mon, 22 Jan 2024 20:52:18 -0400
User-agent:	Gnus/5.13 (Gnus v5.13)

On 22 January 2024 14:06, "T.V Raman" <raman@google.com> wrote:Some more related thoughts below, mostly thinking aloud: 1.From using gptel and ellama against the same model, I seedifferentstyle responses, and that kind of inconsistency would begood to get a handle on; LLMs are difficult enough tofigure out re what they're doing without this additionalvariation.

Is this keeping the prompt and temperature constant? There'sinconsistency, though, even keeping everything constant due to therandomness of the LLM. I often get very different results, forexample, to make the demo I shared, I had to run it like 5 timesbecause it would either do things too well (no need to democorrections), or not well enough (for example, it wouldn't followmy orders to put everything in one paragraph).

2. Package LLM has the laudible goal of bridgeing betweenmodels andfront-ends, and this is going to be vital.3. (1,2) above lead to the following question: 4. Can wewrite down a list of common configuration vars --- herecommon across the model axis. Make it a union of all suchparams.

I think the list of common model-and-prompt configuration shouldalready be already in the llm package already, but we probablywill need to keep expanding this.

5. Next, write down a list of all configurable params on theUI side.

This will change quite a bit depending on the task. It's unclearhow much should be configurable - for example, in the demo, I haveediff so the user can see and evaluate the diff. But maybe thatshould be configurable, so if the user wants to see just a diffoutput instead, perhaps that should be allowed? When I wasthinking about a state machine, I was thinking that parts of thestate machine might be overridable by the user, such as a "havethe user check the results of the operation" is a state in thestate machine that the user can just define their own functionfor. I suspect we'll have a better idea of this after a few moredemos.

6. When stable, define a single data-structure in elisp thatacts asthe bridge between the front-end emacs UI and the LLMmodule.

If I understand you correctly, this would be the configuration youlisted in your point (4) and (5)?

7. Finally factor out the settings of that structure and makeitpossible to create "profiles" so that one can predictablyexperiment across front-ends and models.

I like this idea, thanks!

[Prev in Thread]

Current Thread

[Next in Thread]

LLM Experiments, Part 1: Corrections, Andrew Hyatt, 2024/01/21
- Re: LLM Experiments, Part 1: Corrections, Sergey Kostyaev, 2024/01/22
  - Re: LLM Experiments, Part 1: Corrections, Andrew Hyatt, 2024/01/22
    - Re: LLM Experiments, Part 1: Corrections, T.V Raman, 2024/01/22
    - Re: LLM Experiments, Part 1: Corrections, Andrew Hyatt <=
    - Re: LLM Experiments, Part 1: Corrections, T.V Raman, 2024/01/22
    - Re: LLM Experiments, Part 1: Corrections, Emanuel Berg, 2024/01/22
    - Re: LLM Experiments, Part 1: Corrections, Andrew Hyatt, 2024/01/22
- Re: LLM Experiments, Part 1: Corrections, João Távora, 2024/01/22
  - Re: LLM Experiments, Part 1: Corrections, T.V Raman, 2024/01/22
  - Re: LLM Experiments, Part 1: Corrections, Andrew Hyatt, 2024/01/23
- Re: LLM Experiments, Part 1: Corrections, Karthik Chikmagalur, 2024/01/23
- Re: LLM Experiments, Part 1: Corrections, contact, 2024/01/23
  - Re: LLM Experiments, Part 1: Corrections, T.V Raman, 2024/01/23
    - Re: LLM Experiments, Part 1: Corrections, Andrew Hyatt, 2024/01/24

Prev by Date: Re: master 37889523278: Add new `swap` macro and use it
Next by Date: Re: LLM Experiments, Part 1: Corrections
Previous by thread: Re: LLM Experiments, Part 1: Corrections
Next by thread: Re: LLM Experiments, Part 1: Corrections
Index(es):
- Date
- Thread