I am back with a new overpriced and even more power hungry gpu but more interestingly some AI advancements you can run at home as long as you have spent money on a gpu that is š
Let me introduce ControlNet. Its a new technique which allows the insertion of extra generation parameters in the form of images but not in the same way as img2img. Images are first run through a selected model to create said parameters, this can be a depth map, normal map, or even a detected pose as you can see from the example images on the github repo.
In my opinion the "openpose" detection is the most interesting and useful for generating realism with depth map being a close second. It allows you much more natural (or kinky) poses in generation without impacting the quality through a different method like basing a generation on img2img. You could even take a quick phone photo of yourself in the perfect pose you want to generate!
There are still some tells here and there in my generations but I am confident I could fool the casual scroller which puts us into the "danger zone" as I like to call it. 99% of people aren't pixel peeping every image they see or zooming in on clothing straps to see if that's how they would really sit on someones shoulder. We have blown straight past uncanny valley and I have no idea where we ended up.
Within 40 seconds I am now able to create four photos that could pass as real photos as long as you don't start zooming in or pixel peeping which you can see below (I've also attached some latex ones just for fun)
Slightly off topic but there is an AI focused video from Tom Scott which really interested me and got me thinking about where we could end up. If you are browsing this thread it may also interest you. He mentions the sigmoid curve of tech development and I must wonder just where we are and where we will end up. It will be a very interesting year...
Source: https://www.youtube.com/watch?v=jPhJbKBuNnA
Let me introduce ControlNet. Its a new technique which allows the insertion of extra generation parameters in the form of images but not in the same way as img2img. Images are first run through a selected model to create said parameters, this can be a depth map, normal map, or even a detected pose as you can see from the example images on the github repo.
In my opinion the "openpose" detection is the most interesting and useful for generating realism with depth map being a close second. It allows you much more natural (or kinky) poses in generation without impacting the quality through a different method like basing a generation on img2img. You could even take a quick phone photo of yourself in the perfect pose you want to generate!
(10 Feb 2023, 02:01 )BoundĀ Whore Wrote: The next couple of years are going to be absolutely bonkers.I already don't trust them. I think of myself as someone with a pretty good eye for this sort of thing but I have second guessed multiple twitter posts recently.
You won't be able to trust your own eyes.
There are still some tells here and there in my generations but I am confident I could fool the casual scroller which puts us into the "danger zone" as I like to call it. 99% of people aren't pixel peeping every image they see or zooming in on clothing straps to see if that's how they would really sit on someones shoulder. We have blown straight past uncanny valley and I have no idea where we ended up.
Within 40 seconds I am now able to create four photos that could pass as real photos as long as you don't start zooming in or pixel peeping which you can see below (I've also attached some latex ones just for fun)
Slightly off topic but there is an AI focused video from Tom Scott which really interested me and got me thinking about where we could end up. If you are browsing this thread it may also interest you. He mentions the sigmoid curve of tech development and I must wonder just where we are and where we will end up. It will be a very interesting year...
Source: https://www.youtube.com/watch?v=jPhJbKBuNnA
(This post was last modified: 16 Feb 2023, 16:06 by dhf7b8g. Edit Reason: Removing the video link because I dont know how to get rid of the embed )

I'm completely open to ideas.