Working On: (Patreon)
Content
Hi all, just a post on what is being worked on:
For stable diffusion:
I'm working on improving the GUI overall, add some more options and fixing bug, like multiple loras, also making Control Net work with the GUI.
Whisper:
Implementing WhisperX, that is some cases can improve the timestamp of the output. Also trying to add an option to take the audio from the computer input for more of a Real Time Experience. Perhaps in the future I can also see if this could see if this can run on a phone.
Language Model Interface, Neo/ GPT-J / llama?
Slowly working on a interface for Language Model (check attachments).
The main problem is the model. Neo / GPT-J are a little fun, but hardly useful, perhaps if the GUI can handle Fine-tune, it could become something better?
llama is obvious the better choice, but I can make the weights available, even if I make the weights as a external load, the good weights are still way to big. For now there is some options to handle the (not best) model with 16vram. Let's see if this improve in the next few days.
SD should be ready in a few more days. Whisper and the new Language Interface will take a little more time.