[ home ] [ q / qa / oc / chud / brabant / zellig ] [ overboard ] [ v ] [ archive / telegram / zelligwiki / booru ] [ execution list / pph ] [ Rules / Contacts ] [] [ log in ]

/oc/ - Original Content

Sharing Original Content and organising community OC projects
Options
Comment
File
Embed
Password (For file deletion.)

File: 1698475284251.png (208.72 KB, 450x497, 1696278595650.png)

 No.1881

/AI/ general thread
Post AI zelliged 'zarty-saving audio, image and video gemeralds and also teach me how to do them too since I'm a selfish little fuck and I can't learn elsewhere

 No.1882

File: 1698504478282.png (63.19 KB, 1269x357, FireShot Capture 347 - AI ….png)


In the AI MEGA link, you can find instruction for using the google colab.
At this point, most of you know that Google has been doing some crackdowns on AI voice colabs, so I will give some instructions as to how to download a local program. Note that there is currently an effort to make the instlation and usage of RVC programs more user-firendly.

Firstly, you will need to download a RVC program. I will recomend Mangio, since it doesn't crash 99% of the time. Here is the Git page:
https://github.com/Mangio621/Mangio-RVC-Fork

If you are having toruble with the download, you can always type "RVC Mangio download" on youtube to find some help.

Once you are done with the download, run "go-web.bat" to open the web UI.

You will only need to know about three files:

audio - this is where you put the audio that you want to be edited by the AI.

weights - this is where you will put the the .pth file of your AI mvoice model.

logs- this is where you put your .index files. Note that here you will have to first creat a folder with the name of the voice model and then put the .index file inside that folder. For example:
Mangio-RVC-v23.7.0\logs\cocomodel\coco.index

As for ny other information as to how to use the UI, you will find links in the AI MEGA file folder "ai tools" or you will learn about it in the youtube tutorials.


Here is the AI mega link:
https://mega.nz/folder/wH1FxTqD#TaV9p40IymmrR9brpOq0KQ

 No.1883

File: 1698546257514-0.mp3 (8.18 MB, mymytarget.mp3)

File: 1698546257514-1.mp3 (8.18 MB, cocotarget.mp3)

I don't think I've ever shared these outside of /co/

 No.1884

File: 1698731520086.png (622.82 KB, 840x1113, ClipboardImage.png)

I used that ElevenLabs free auto-translate for videos thing to make a bunch of lines of Vera speaking in almost every language. This is so that I could train a new model with some more data (it totals at ~21 minutes). I tried to clean and cut out the silence between words as best as I could, I'll probably put out a dataset where all the audio in this one is passed through Adobe Enhance so that it'll sound a little cleaner and less shit (Vera1/2/3NEW have already been passed through Enhance however). In either case I'm posting it here now because I still have to figure out how to train it since I don't know which program or collab to use now that Google killed off the decent collab links from 2 months back.
https://www.mediafire.com/file/fsgh7ixsyhq932c/verasetNEWER.zip/file

 No.1885

Apparently you can use this program called Replay to make AI covers, if any willing to try it go ahead because it not work on my laptop https://www.tryreplay.io/

 No.1886

>>1885
I will try it out.
I think that a zaryan already made a AI cover with replay.

 No.1887

File: 1699049992476.mp3 (209.23 KB, Cocomodelrmvpee 1250 sings….mp3)

>>1885
>>1886
I have installed it and wow. It's pretty easy to use! Here is a Coco vocal test using the AI model.

We should tell about it to other teens. I will make a tutorial on how to use it.

 No.1888

File: 1699787481405-0.mp3 (3.85 MB, Paranoid_Maya_Final.mp3)

File: 1699787481405-1.mp3 (6.35 MB, ongesabbath (1).mp3)

I was going to do a thumbnail, but I have little time for that.
The midi was not edited by me, but another zaryan. It uses the sound sources from here: https://docs.google.com/spreadsheets/d/1nj-a6mtLuWhhUCrbB9XktQE5DErR4RtMGtx0h4LPzE0/edit?usp=sharing
Also, I originaly intended to sing it myself. It turned out decent, but then a friend sent me a better singing record (maybe taken from Guitar Hero).

 No.1889

File: 1699821854968.mp3 (1.34 MB, Mymy - Banana Man.mp3)

>>1881
hey guys i downloaded rvc-mangio 2 days ago and i started working on some ai covers
as you might have noticed by pressing the mp3 file the voices my pc generated are absolute trash could someone help me in making the voices sound less robotic and more "human" sounding??
i used the web-ui (go-web.bat) in the RVC-Mangio folder and used the voices with the most epochs
also my graphics card is a bit old and my computer crashed a few times if the mp3 sound file was too long

 No.1890

>>1889
You could try to thinker with the options. Try upping the voice tone higher (6-12). That usually fixes the lowpitch robotic sounds.
Some people have used some addobe programs or audacity to edit the voices.
If you still cant get good results, then the original voice could be the problem. Try singing yourself the parts that sound bad with AI.

 No.1891

File: 1700238962320.webm (6.05 MB, 1024x1024, mymydammit.webm)


 No.1892


 No.1893

File: 1700262746130.webm (5.91 MB, 1920x1080, 24 hour mymy.webm)

>>1890
hey man i finally got the opportunity to work on smth again i listened to your advice and made this
if you have some tips on how to make the voice sound better even with another program that would be helpful

 No.1894

>>1893
(might have fucked up the voice sync)

 No.1895

>>1894
>>1893
It sounds pretty good!
Don't try to make it perfect, since AI has it's limitations.
I like it doe!

I don't think I have any other tips. Just experiment with editing and you will get great results.

 No.1896

>>1895
thanks man

 No.1897

File: 1700286677665.webm (6.35 MB, 1000x1400, mymy-rebel.webm)


 No.1898

File: 1700304277804.webm (306.75 KB, 1080x1456, MymySkibidi.webm)

caca

 No.1899


 No.1900

File: 1700337870470.webm (8.1 MB, 640x360, keyed mymy plays tf2.webm)


 No.1901

>>1900
I think that the pitch is too high. Tone it down a bit.
But I bloody love the idea!

 No.1902

>>1901
lower than that it sounds robotic

 No.1903

>>1901
actually wait i'll see what i can do

 No.1904

File: 1700339631500.webm (8.1 MB, 640x360, keyed mymy plays tf2 and ….webm)


 No.1905

>>1904
I think it's better, even if it has some parts that sound a bit robotic.
Gem in the rough.

 No.1906

>>1904
>>1900
Most glistering gemerald all week, and likely for many more weeks to come

 No.1907

>>4171
how do you people make this
also mymy is shorter than maya and coco is the tallest (if i'm not wrong she is supposed to be like 1,8 meters tall)

 No.1908

>>1907
i think it was some sort of ai, not sure what was the name. check the ai zellig thread on /qa/, i think that you may find the name somewhere.

 No.1909

File: 1700654965971-0.png (62.62 KB, 255x144, ClipboardImage.png)

File: 1700654965971-1.mp3 (6.48 MB, baath duet.mp3)

File: 1700654965971-2.png (66.37 KB, 255x138, ClipboardImage.png)

Testing out something and these two images have inspired me to make this (no I have not gotten to converting that middle part yet, sorry)

 No.1910

>>1909
Shining, glistening, zarty saving 'rald. I hope you finish it.

 No.1911

File: 1701087273960-0.png (157.24 KB, 530x293, ClipboardImage.png)

File: 1701087273960-1.mp3 (266.12 KB, maya6.mp3)

Maya can finally scream now

 No.1912

File: 1701227003522-0.png (628.49 KB, 512x768, pixai-1685341241239516086-….png)

File: 1701227003522-1.png (722.3 KB, 512x768, pixai-1685339065751486725-….png)

File: 1701227003522-2.png (677.11 KB, 512x768, pixai-1683510121657764151-….png)

File: 1701227003522-3.png (675.52 KB, 512x768, pixai-1683371110721131284-….png)


 No.1913

File: 1701361161168-0.png (674.91 KB, 700x700, zygalvql.png)

File: 1701361161168-1.mp3 (8.48 MB, Coco - Pollyanna.mp3)

New Coco models trained with new data from ElevenLabs. Still in a bit of a testing phase, but this song used 180 epochs
DL: https://huggingface.co/filler5736/cocomodelNEWER/tree/main

 No.1914

>>1913
shining gem!!!
also question how did you get the voice to sound so clean??? i tried to use both RVC and Replay but the damn voice keeps sounding robotic
also could someone explain what difference has a model with low epochs and one with many epochs?

 No.1915

Bumo

 No.1916

>>1914
Might just be the instrumentals doing a good job covering up the shittier parts of the voice + mixing and reverb.
Epochs is just how long a model is trained to "sound" like the voice, the two I have available for download where ones I think were at a good stopping point before it went on to overtrain too much.

 No.1917


 No.1918

File: 1705023249368.jpeg (404.79 KB, 512x768, 644DD5F0-7570-4C39-84D5-E….jpeg)

AIs are bad at hands and eyes just like real humans but at least they released the Coco model already https://pixai.art/model/1644300693524083429

 No.1919

File: 1705230073160-0.png (604.48 KB, 829x542, bnD1odXivY.png)

File: 1705230073160-1.mp3 (7.76 MB, Fake Vera - Summer Breeze.mp3)

Made a model of Deel 6 Vera when her VA (Kayleigh) dropped out after Deel 3 and Joyce (Mymy's VA) decided to take her role for Vera. I used ElevenLabs to generate a bunch of voicelines for data.
https://huggingface.co/filler5736/VeraJoyceVA/resolve/main/verajoycemodel.zip?download=true
I might do something fun with this, who knows.

 No.1920

File: 1707885687001.webm (7.41 MB, 1160x1476, BelgianWithRage.webm)

Bvmp

 No.1921

>>1919

what song is that? its kinda of a bop

 No.1922


 No.1923

File: 1723673971139.wav (26.94 MB, Maya sings at the world ca….wav)

>>1922
Thanks for bumping this thread. Now i can upload this gem:

 No.1924

File: 1724274609224-0.jpg (26.84 KB, 686x386, hoi osama.jpg)

File: 1724274609224-1.mp3 (8.82 MB, deel team 6.mp3)


 No.1925

File: 1729662577796-0.mp3 (13.7 MB, Vera - Waiting For You.mp3)

File: 1729662577796-1.png (332.15 KB, 200x200, VeraWat.png)

Kino is back on the menu fellas
>Applio colab link so now you don't have to have a NASA computer to run it or wait 9000 years on huggingface: https://colab.research.google.com/github/iahispano/applio/blob/master/assets/Applio.ipynb

 No.2275

File: 1731218423861-0.png (348.76 KB, 1581x977, ClipboardImage.png)

File: 1731218423861-1.mp3 (9.78 MB, Mymy - Kali Yuga.mp3)

Someone a while back requested one of the girls sing Kali Yuga, so if that person is still here this is for you.

 No.2276

File: 1731222701535-0.gif (2.85 MB, 320x240, 1105.gif)

File: 1731222701535-1.mp3 (5.99 MB, mymy - krzyczę głośno disc….mp3)


 No.2279

File: 1731233668380.mp4 (27.94 MB, 1280x720, VID_20240626_011759_358.mp4)

>>2276
Oh I remember this song, I wanted to cover it once but then forgot
<original.mp4

 No.3454

File: 1738472589115-0.png (347.17 KB, 653x573, Vera30.png)

File: 1738472589115-1.mp3 (9.07 MB, Vera - Love Pslam.mp3)

Another Vera AI cover has hit the towers

 No.3456

>>3454
Fact checked by multiple peer reviewed studies to be something she'd play and sing

 No.3472

File: 1738585032769-0.mp3 (4.17 MB, A Kiss To Build A Dream On….mp3)

File: 1738585032769-1.jpg (110.41 KB, 1080x1608, 1738342246787.jpg)


 No.3473

File: 1738585560519.png (816.84 KB, 1488x831, theyarecalm.png)

>>3472
wonderful

 No.3474

File: 1738586703579-0.png (1.03 MB, 1280x720, 596 - big_eyes binky car d….png)

File: 1738586703579-1.mp3 (1.87 MB, lv_0_20250203134207.mp3)

(the AI shits itself if it continues past that part)

 No.3475

File: 1738587081636-0.png (1.79 MB, 1248x1080, ClipboardImage.png)

File: 1738587081636-1.mp3 (9.07 MB, Vera - Love Pslam.mp3)

>>3454
Completely fucked up by not including the backing vocals by mistake, this should fix that.

 No.3476

>>3475
This should be uploaded in video format on jewtube

 No.3501

File: 1738657847519-0.png (57.18 KB, 500x250, ClipboardImage.png)

File: 1738657847519-1.mp3 (10.78 MB, Vera - Your Rain.mp3)

Silent Hill 4 - Your Rain

 No.3508

>>3501
This one worked out really well.

 No.3509

File: 1738668857023-0.jpg (310.89 KB, 2048x2048, 3735 - black_and_white may….jpg)

File: 1738668857023-1.mp3 (3.61 MB, lv_0_20250204122702.mp3)


 No.3527

File: 1738695491395-0.jpg (131.76 KB, 1157x1599, youhavenoideahowlongittook….jpg)

File: 1738695491395-1.mp3 (3.64 MB, output_track (6).mp3)


 No.3530

File: 1738768060050-0.jpg (347.77 KB, 3000x4000, 1733176255693.jpg)

File: 1738768060050-1.mp3 (3.6 MB, lv_0_20250205160417.mp3)

Tomorrow

 No.3576

File: 1739076830549-0.jpg (474.06 KB, 1080x1080, 1711997290902.jpg)

File: 1739076830549-1.mp3 (3.3 MB, mayaneuroticc.mp3)

File: 1739076830549-2.mp3 (5.19 MB, mayaplzbepatient.mp3)

File: 1739076830549-3.mp3 (2.43 MB, maya21.mp3)

NEET Maya

 No.3577

>>3576
Zaryan don't you know that truecelcore is literally fact checked to cause violence

 No.3578

File: 1739100873409.png (161.01 KB, 512x512, file_148294439.png)

>Zaryan don't you know that truecelcore is literally fact checked to cause violence

 No.3579

File: 1739160462665-0.png (543.63 KB, 990x1848, 1734900612270.png)

File: 1739160462665-1.mp3 (5.57 MB, mymyoutt.mp3)

Falling Zellig

 No.3580

>>3579
Needs more tuning at the start

 No.3581

>>3580
not wrong, I simply could not figure out how to make it sound better, probably needed a better mymy model

 No.3582

>>3581
I guess you're using Applio, try these models
https://soyzellig.party/zellig/res/29468.html#29515
And if you can't really find a good solution, go to weights.gg set the Consonant protection to Safe and it should fix it



[Return][Go to top] Catalog [Post a Reply]
Delete Post [ ]
[ home ] [ q / qa / oc / chud / brabant / zellig ] [ overboard ] [ v ] [ archive / telegram / zelligwiki / booru ] [ execution list / pph ] [ Rules / Contacts ] [] [ log in ]