An Alternative Approach to Building Generative AI Models

buildinstuff5432 · 2 years ago

An Alternative Approach to Building Generative AI Models

buildinstuff5432 · 2 years ago

Wow. This project is off to a great start and is reusing today’s generation of ai models/techniques to explore alternative models for a new generation.

I am excited to see I’m not the only one fired up about addressing today’s model limitations like context size/window (https://github.com/arthurwolf/llmi/blob/main/README.md#recursive-redaction). Once we pop the weights out, we can reuse the weights in a new model configuration that has a larger context size (hopefully haha!).

Are you thinking about using a multimodal transformer for the “Thinking with code” section or something new and exciting I’ve never heard of (https://github.com/arthurwolf/llmi/blob/main/README.md#thinking-with-code)? I like the “Checking with Accuracy” section too (https://github.com/arthurwolf/llmi/blob/main/README.md#checking-for-accuracy), this is what I’m thinking of as a watermark for verifying a model’s at-rest weights have “trained knowledge” kind of like security scanning container images at rest in the CICD space vs verification the model answered the question(s) correctly while running/in-memory.

I could keep going, but what do you think are the next steps for your project?