Local AI model inference #522

lucksus · 2024-09-05T10:17:42Z

This adds AI model inference to ADAM, including downloading and managing of models from Huggingface, registering tasks and prompting models - based on Kalosm.

As a first step, this PR only focusses on managing and making available a couple of hard-coded models, but introducing an new section "AI" to the Ad4mClient. Other, future PRs will add the ability to have the user select and manage the exact models (#504), as well as registering external modals via API, and UI in the ADAM Launcher for all of this.

New AI interface functions in Ad4mClient

This introduced 3 different kinds of model interactions that ADAM provides to apps via it's interface

Language processing with LLMs via tasks

const task = await ad4mClient.ai.addTask(
  //Model name, currently irrelevant with hard-coded model,
  "Llama", 
  //System prompt
  "You analyse incoming text for topics and respond with a JSON array including the topic names", 
  // Examples
  [
    {
      input: "Hey guys, how is it going?",
      output: '["greeting"]'
    },
    {
      input: "I really want to try the new Synergy plugin in Flux",
      output: '["testing", "Synergy", "Flux"]'
    },
  ]
);

This spawns a session of the given model and configures it with the prompt and the examples, so that subsequently it can be run with:

const answer = await ad4mClient.ai.prompt(task.taskId, "Hello there, yeah I also would like to test that plugin!")

Emedding of text

const vector = await ad4mClient.ai.embed("Bert", "This is a test string");

Transcription of audio

const streamId = await ad4mClient.ai.openTranscriptionStream("Whisper", (text) => {
  console.log(text);
  });
  
// Feed raw sample data (float 32)
await ad4mClient.ai.feedTranscriptionStream(streamId, [0, 10, 20, 30]);

Progress

# Conflicts: # cli/Cargo.toml # cli/src/main.rs

…gration

# Conflicts: # cli/src/eve.rs

…d improve model loading status display

…ndencies

lucksus and others added 30 commits October 30, 2023 11:49

LLM crate based Eve in CLI experiment

515ad1d

Reduced training prompt fits in context window

809564a

Update llm crate to master branch for ggml v3 models

ec7d19f

Reduced training that fits into context window

bc558c3

Use feature “metal”

34b8bab

Extract Eve command to eve.rs and switch to cbor

8e874ea

Use my fork to fix snapshots

2b5edc9

Halt inference

9f817b9

Use gguf branch of llm crate for new models

b41a37b

Break down eve prompt so fits in context window

3859917

Merge branch 'dev' into llm-integration

bfa501e

# Conflicts: # cli/Cargo.toml # cli/src/main.rs

Merge branch 'dev' into llm-integration

60fee41

chore: Add kalosm crate to Cargo.toml

c03afe0

chore: Update kalosm crate to version 0.3.2 with language feature

569ff3b

Activate eve in ad4m binary

0ef599b

Use kalosm instead of llama

a5a55ac

Merge commit '569ff3bad66231db431304b6c915b7cbde3e8158' into llm-inte…

e076f1a

…gration

fmt

e8cf636

Merge branch 'llm-integration' into llm-integration-1

2c44de0

# Conflicts: # cli/src/eve.rs

Use default chat model and activate metal

fca4ced

First prompt to preload examples

41d870d

fmt

c268a3f

clippy

542a2d7

cargo.lock

76ca47f

Use ChatBuilder with history

0a05bca

feat: Add AIClient and AIResolver for AI functionality

e3a824f

Added new mutation for embed

2979da0

feat: Update AIResolver embed method to compress and decompress vectors

782da91

feat: Add AIClient for AI functionality

b121cb8

Smoke test ad4mClient.ai.embed()

e31a53f

fayeed and others added 30 commits October 4, 2024 12:37

chore: Refactor AI service to fix typo in downloaded variable name an…

a3ccd45

…d improve model loading status display

chore: Refactor AudioStream to handle None value from receiver poll

fd45285

Refactor AI service to enable audio streaming for transcription

6cd9de7

chore: Add missing dynamic libraries to tauri.conf.json resources

02642d9

Refactor AudioStream to handle None value from receiver poll

9fe6781

Refactor AI service to remove unused code and dependencies

0000351

Refactor AI service to optimize imports and formatting

0bbf398

Refactor AI service to import Whisper for sound processing

3613b40

Refactor build scripts to add rpath for dynamic libraries

c214157

Refactor build scripts to add rpath for dynamic libraries

7e26985

chore: Refactor build scripts to remove unused code and optimize imports

fdba22a

Refactor build scripts to add libfuse2 dependency

c7750de

Refactor build scripts to add mesa-utils and mesa-vulkan-drivers depe…

612edb0

…ndencies

Refactor build scripts to add mesa-utils and mesa-vulkan-drivers depe…

cfcb713

…ndencies

On Linux, copy libc++.so to libc++_chrome.so

a37d202

Increase clippy timeout in CI

50e99ed

set version to 0.10.0-rc8

fbd95e0

update v8 rev

2ce623d

Update cli/build.rs to handle gn_out/<ARCH> directory correctly

0e1dc7c

fmt

38eaf1c

Update rusty_v8 rev

0e78ee9

Fix cli build after introducing arch directory in v8 output

be6a4fd

ad4m-executor debug launch config

9b59604

Fix launch.json

b1569b2

catch SIGURG

ef2b52f

Fix cli args in ad4m-executor

980f105

fmt

1abe406

clippy

a690f9d

fmt

fa39e4d

Fix paths in our build.rs files pointing to v8 dlylib outputs

0efe25b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local AI model inference #522

Local AI model inference #522

lucksus commented Sep 5, 2024 •

edited

Loading

Local AI model inference #522

Are you sure you want to change the base?

Local AI model inference #522

Conversation

lucksus commented Sep 5, 2024 • edited Loading

New AI interface functions in Ad4mClient

Language processing with LLMs via tasks

Emedding of text

Transcription of audio

Progress

lucksus commented Sep 5, 2024 •

edited

Loading