Commit Graph

84 Commits

Author SHA1 Message Date
oobabooga
0abff499e2
Use image.thumbnail 2023-03-22 16:03:05 -03:00
oobabooga
104212529f
Minor changes 2023-03-22 15:55:03 -03:00
Φφ
5389fce8e1 Extensions performance & memory optimisations
Reworked remove_surrounded_chars() to use regular expression ( https://regexr.com/7alb5 ) instead of repeated string concatenations for elevenlab_tts, silero_tts, sd_api_pictures. This should be both faster and more robust in handling asterisks.

Reduced the memory footprint of send_pictures and sd_api_pictures by scaling the images in the chat to 300 pixels max-side wise. (The user already has the original in case of the sent picture and there's an option to save the SD generation).
This should fix history growing annoyingly large with multiple pictures present
2023-03-22 11:51:00 +03:00
oobabooga
536d0a4d93
Add an import 2023-03-20 14:00:40 -03:00
oobabooga
a90f507abe
Exit elevenlabs_tts if streaming is enabled 2023-03-20 11:49:42 -03:00
oobabooga
31ab2be8ef Remove redundant requirements #309 2023-03-19 22:10:55 -03:00
oobabooga
b552d2b58a Remove unused imports
o
2023-03-19 19:24:41 -03:00
oobabooga
eab8de0d4a Merge branch 'main' into Brawlence-main 2023-03-19 13:09:59 -03:00
oobabooga
7da742e149
Merge pull request #207 from EliasVincent/stt-extension
Extension: Whisper Speech-To-Text Input
2023-03-17 12:37:23 -03:00
oobabooga
1c378965e1 Remove unused imports 2023-03-16 10:18:34 -03:00
awoo
0552ab2e9f flask_cloudflared for shared tunnels 2023-03-16 02:00:16 +03:00
awoo
30281122a5 KoboldAI api 2023-03-15 23:52:46 +03:00
oobabooga
1413931705 Add a header bar and redesign the interface (#293) 2023-03-15 12:01:32 -03:00
Φφ
d537b28d02 Extension: Stable Diffusion Api integration
Lets the bot answer you with a picture!
2023-03-14 06:49:10 +03:00
oobabooga
3fb8196e16 Implement "*Is recording a voice message...*" for TTS #303 2023-03-13 22:28:00 -03:00
EliasVincent
02e1113d95 add auto-transcribe option 2023-03-13 21:41:19 +01:00
stefanhamburger
91c2a8e88d
Fix: tuple object does not support item assignment 2023-03-13 07:42:09 +01:00
oobabooga
77294b27dd Use str(Path) instead of os.path.abspath(Path) 2023-03-13 00:08:01 -03:00
oobabooga
1ddcd4d0ba Clean up silero_tts
This should only be used with --no-stream.

The shared.still_streaming implementation was faulty by design:
output_modifier should never be called when streaming is already over.
2023-03-12 23:42:49 -03:00
EliasVincent
48aa52849b use Gradio microphone input instead 2023-03-12 21:03:07 +01:00
Elias Vincent Simon
3b4145966d
Merge branch 'oobabooga:main' into stt-extension 2023-03-12 19:19:43 +01:00
oobabooga
d8bea766d7
Merge pull request #192 from xanthousm/main
Add text generation stream status to shared module, use for better TTS with auto-play
2023-03-12 13:40:16 -03:00
oobabooga
4066ab4c0c Reorder the imports 2023-03-12 13:36:18 -03:00
Phuoc-Hieu Le
781c09235c
Fix typo error in script.py 2023-03-12 15:21:50 +07:00
Xan
b3e10e47c0 Fix merge conflict in text_generation
- Need to update `shared.still_streaming = False` before the final `yield formatted_outputs`, shifted the position of some yields.
2023-03-12 18:56:35 +11:00
Xan
d4afed4e44 Fixes and polish
- Change wav naming to be completely unique using timestamp instead of message ID, stops browser using cached audio when new audio is made with the same file name (eg after regenerate or clear history).
- Make the autoplay setting actually disable autoplay.
- Make Settings panel a bit more compact.
- Hide html errors when audio file of chat history is missing.
- Add button to permanently convert TTS history to normal text messages
- Changed the "show message text" toggle to affect the chat history.
2023-03-12 17:56:57 +11:00
oobabooga
195e99d0b6 Add llama_prompts extension 2023-03-11 16:11:15 -03:00
oobabooga
8f8da6707d Minor style changes to silero_tts 2023-03-11 11:17:13 -03:00
Xan
0dfac4b777 Working html autoplay, clean up, improve wav naming
- New autoplay using html tag, removed from old message when new input provided
- Add voice pitch and speed control
- Group settings together
- Use name + conversation history to match wavs to messages, minimize problems when changing characters

Current minor bugs:
- Gradio seems to cache the audio files, so using "clear history" and generating new messages will play the old audio (the new messages are saving correctly). Gradio will clear cache and use correct audio after a few messages or after a page refresh.
- Switching characters does not immediately update the message ID used for the audio. ID is updated after the first new message, but that message will use the wrong ID
2023-03-11 16:34:59 +11:00
EliasVincent
1c0bda33fb added installation instructions 2023-03-10 11:47:16 +01:00
EliasVincent
a24fa781f1 tweaked Whisper parameters 2023-03-09 21:18:46 +01:00
EliasVincent
00359ba054 interactive preview window 2023-03-09 21:03:49 +01:00
EliasVincent
7a03d0bda3 cleanup 2023-03-09 20:33:00 +01:00
EliasVincent
4c72e43bcf first implementation 2023-03-09 12:46:50 +01:00
Xan
a2b5383398 Merge in audio generation only on text stream finish., postpone audioblock autoplay
- Keeping simpleaudio until audio block "autoplay" doesn't play previous messages
- Only generate audio for finished messages
- Better name for autoplay, clean up comments
- set default to unlimited wav files. Still a few bugs when wav id resets

Co-Authored-By: Christoph Hess <9931495+ChristophHess@users.noreply.github.com>
2023-03-09 10:48:44 +11:00
Xan
738be6dd59 Fix merge errors and unlimited wav bug 2023-03-08 22:25:55 +11:00
Xan
5648a41a27 Merge branch 'main' of https://github.com/xanthousm/text-generation-webui 2023-03-08 22:08:54 +11:00
Xan
ad6b699503 Better TTS with autoplay
- Adds "still_streaming" to shared module for extensions to know if generation is complete
- Changed TTS extension with new options:
   - Show text under the audio widget
   - Automatically play the audio once text generation finishes
   - manage the generated wav files (only keep files for finished generations, optional max file limit)
   - [wip] ability to change voice pitch and speed
- added 'tensorboard' to requirements, since python sent "tensorboard not found" errors after a fresh installation.
2023-03-08 22:02:17 +11:00
oobabooga
44e6d82185 Remove unused imports 2023-03-07 22:56:15 -03:00
oobabooga
eebec65075 Improve readability 2023-03-06 19:46:46 -03:00
oobabooga
944fdc03b2 Rename the folder 2023-03-06 19:38:36 -03:00
oobabooga
49ae183ac9 Move new extension to a separate file 2023-03-06 19:28:53 -03:00
MetaIX
9907bee4a4
Support for Eleven Labs TTS
As per your suggestion at https://github.com/oobabooga/text-generation-webui/issues/159 here's my attempt.

I'm brand new to python and github. Completely different from unreal + visual coding, so forgive my amateurish code. This essentially adds support for Eleven Labs TTS. Tested it without major issues, and I believe it's functional (hopefully).

Extra requirements: elevenlabslib https://github.com/lugia19/elevenlabslib, sounddevice0.4.6, and soundfile

Folder structure is the same as the SileroTTS Extension.
2023-03-05 19:04:22 -06:00
oobabooga
024d30d1b4 Reorder imports 2023-03-02 01:23:18 -03:00
oobabooga
e81c023b77 Slight style changes in gallery 2023-02-27 13:49:55 -03:00
oobabooga
43b6ab8673 Store thumbnails as files instead of base64 strings
This improves the UI responsiveness for large histories.
2023-02-27 13:41:00 -03:00
oobabooga
a08802bf70 Sort the gallery alphabetically 2023-02-27 13:04:06 -03:00
oobabooga
83368875eb Minor style change 2023-02-27 11:55:36 -03:00
oobabooga
9c7fe2ec5d
Handle overflows in character names 2023-02-27 11:46:15 -03:00
conanak99
b5e31c8fef fix responsive issue 2023-02-27 22:30:52 +08:00