Richard Kuo

Generative AI - sample codes

gTTS.py : to generate gTTS.mp3
gT2T.py : google translate

Text-to-Text (LLMs)

LLM prompting

python gpt4free.py (gpt-3.5-turbo)
python gpt4all_prompting.py
python LLM_prompting.py
colab_LLM_prompting.ipynb (on Colab T4)

LLM Server & Client

python llm_server.py (on GPU)
python post_text.py (on PC)

Colab’s LLM Server & Client

colab_pyNgrok_LLM_server (on Colab T4)
post-text client (on PC)

Ollama

ollama list
ollama run llama3.1

ollama chat/generate

python ollama_chat.py
python ollama_stream.py (print text in streaming mode)
python ollama_curl.py

ollama speak

python ollama_speak.py (ollama generated text, gTTS to speech, then mpg123 to speak)
python ollama_speak_t2t.py (ollama generated text, gTTS to speech, deep-translator to zh-TW, mpg123 to speak)

Audio-to-Text

local ASR+LLM Server (on your PC+GPU)

run server on local PC (on your PC+GPU): python whisper_llm_server.py
Generate audio file: python ../gTTS.py "Hello, how are you?" en
Post Audio to Server: python post_audio.py

Colab ASR+LLM Server (on Colab T4)

Open colab to run pyngrok_Whisper_LLM_Server.ipynb on Colab T4
Generate audio file: python ../gTTS.py "Hello, how are you?" en
Post Audio to Server: python post_audio.py

Image-to-Text (VLM)

VLM servers

For running server, (use one of the following)

python llava_server.py
python llava_next_server.py
python phi3-vision_server.py

For running client, (post image & text to VLM server)
python post_imgtxt.py images/barefeet1.jpg

ASR + VLM servers

python whisper_llava_server.py
python ../gTTS.py "這是什麼有名的台南美食?" zh (TTS)
python post_imgau.py (client)

Whisper+LLaVA Server (ASR+VLM)

Text-to-Speech

Parler TTS: python parler.py
Bark TTA: python bark_en.py, python bark_cn.py
Coqui TTS: python coqui_en.py, python coqui_zh.py
text-to-speech: python text_to_speech.py
gTTS: python gTTS.py "你好?" zh
gTranslate: python gTranslate.py

Text-to-Image

sdxl-base.py - run SDXL-base model to input text and generate an image
sdxl-lightning-lora.py - run SDXL-Lightning with LoRA model to use text to generate an image
sdxl-lightning-unet.py - run SDXL-Lightning with UNet model to use text to generate an image

Image-to-3D

TripoSR

Text-to-3D

gTranslate + SDXL-Lightning + TripoSR + AppInventor2