/g/ - /sdg/ - Stable Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/sdg/ - Stable Diffusion Gener(...) 04/21/24(Sun)13:39:44 No.100115740

File: nightracing-34928762.jpg (258 KB, 1776x1224)

258 KB JPG

/sdg/ - Stable Diffusion General Anonymous 04/21/24(Sun)13:39:44 No.100115740 Archived

Previous /sdg/ thread : >>100110132

>Beginner UI local install
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI (Node-based): https://rentry.org/comfyui
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Auto1111 forks
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
Anapnoe UX: https://github.com/anapnoe/stable-diffusion-webui-ux
Vladmandic: https://github.com/vladmandic/automatic

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
Inpainting: https://huggingface.co/spaces/fffiloni/stable-diffusion-inpainting
pixart: https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma

>Models, LoRAs & embeddings
https://civitai.com
https://huggingface.co
https://rentry.org/embeddings

>Animation
https://rentry.org/AnimAnon
https://rentry.org/AnimAnon-AnimDiff
https://rentry.org/AnimAnon-Deforum

>SDXL info & download
https://rentry.org/sdg-link#sdxl

>Index of guides and other tools
https://codeberg.org/tekakutli/neuralnomicon
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg

Official: discord.gg/stablediffusion

Anonymous
04/21/24(Sun)13:46:19 No.100115837

Anonymous 04/21/24(Sun)13:46:19 No.100115837

File: SDG_News_00167_.png (1.71 MB, 1560x896)

1.71 MB PNG

>mfw Resource news

04/21/2024

>FlashFace Inference Code Released
https://github.com/ali-vilab/FlashFace

>ComfyUI MagickWand: Proper implementation of ImageMagick
https://github.com/Fannovel16/ComfyUI-MagickWand

>Moving Object Segmentation: All You Need Is SAM (and Flow)
https://www.robots.ox.ac.uk/~vgg/research/flowsam/

>Image Effect Scheduler Node Set for ComfyUI
https://github.com/hannahunter88/anodes/

>ComfyUI-Tripo: Generate 3D models using the Tripo API
https://github.com/VAST-AI-Research/ComfyUI-Tripo

04/20/2024

>Basic Stable Diffusion API GUI
https://github.com/ThioJoe/BasicStabilityAPI-GUI/

>IPAdapter Advanced Weighting support added to sd-webui-controlnet
https://github.com/Mikubill/sd-webui-controlnet/discussions/2770

04/19/2024

>Customizing Text-to-Image Diffusion with Camera Viewpoint Control
https://customdiffusion360.github.io/

>StyleBooth: Image Style Editing with Multimodal Instruction
https://ali-vilab.github.io/stylebooth-page/

>Sketch-guided Image Inpainting with Partial Discrete Diffusion Process
https://github.com/vl2g/Sketch-Inpainting

>ComfyUI ImageMagick: Image processing powered by ImageMagick
https://github.com/jtydhr88/ComfyUI-ImageMagick

04/18/2024

>Meta has releases meta.ai, Multimodal AI including image generation
https://www.meta.ai/

>Stability AI lays off roughly 10 percent of its workforce
https://www.theverge.com/2024/4/18/24133996/stability-ai-lay-off-emad-mostaque

>Stability API nodes for ComfyUI
https://github.com/Stability-AI/ComfyUI-SAI_API

>Dynamic Typography: Bringing Text to Life via Video Diffusion Prior
https://animate-your-word.github.io/demo/

>InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
https://johanan528.github.io/Infusion/

>Factorized Diffusion: Perceptual Illusions by Noise Decomposition
https://dangeng.github.io/factorized_diffusion/

>KGen - A System for Prompt Generation to Improve Text-to-Image Performance
https://github.com/KohakuBlueleaf/KGen

Anonymous
04/21/24(Sun)13:47:20 No.100115859

Anonymous 04/21/24(Sun)13:47:20 No.100115859

>mfw Research news

04/21/2024

>Prompt-Driven Feature Diffusion for Open-World Semi-Supervised Learning
https://arxiv.org/abs/2404.11795

>MultiPhys: Multi-Person Physics-aware 3D Motion Estimation
https://www.iri.upc.edu/people/nugrinovic/multiphys/

>ProTA: Probabilistic Token Aggregation for Text-Video Retrieval
https://arxiv.org/abs/2404.12216

>BLINK: Multimodal Large Language Models Can See but Not Perceive
https://arxiv.org/abs/2404.12390

>Generating Human Interaction Motions in Scenes with Text Control
https://arxiv.org/abs/2404.10685

>Dual Modalities of Text: Visual and Textual Generative Pre-training
https://arxiv.org/abs/2404.10710

>DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling
https://arxiv.org/abs/2404.09227

>Conditional Prototype Rectification Prompt Learning
https://arxiv.org/abs/2404.09872

04/20/2024

>Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
https://research.nvidia.com/labs/toronto-ai/AlignYourSteps/

>Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach
https://arxiv.org/abs/2404.11732

>Partial Large Kernel CNNs for Efficient Super-Resolution
https://arxiv.org/abs/2404.11848

>From Image to Video, what do we need in multimodal LLMs?
https://arxiv.org/abs/2404.11865

>GhostNetV3: Exploring the Training Strategies for Compact Models
https://arxiv.org/abs/2404.11202

>ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis
https://arxiv.org/abs/2404.10141

>StyleCity: Large-Scale 3D Urban Scenes Stylization with Vision-and-Text Reference via Progressive Optimization
https://arxiv.org/abs/2404.10681

>Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers
https://arxiv.org/abs/2404.09326

>Exploring Text-to-Motion Generation with Human Preference
https://arxiv.org/abs/2404.09445

>Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression
https://arxiv.org/abs/2404.09601

Anonymous
04/21/24(Sun)13:49:14 No.100115884

Anonymous 04/21/24(Sun)13:49:14 No.100115884

File: Default_macro_photography(...).jpg (393 KB, 704x1024)

393 KB JPG

Anonymous
04/21/24(Sun)13:50:28 No.100115898

Anonymous 04/21/24(Sun)13:50:28 No.100115898

Anyone tried fp8 training using Transformer Engine? Anyway gonna hope I can make this Docker container work and see what comes out.

Anonymous
04/21/24(Sun)13:53:02 No.100115936

Anonymous 04/21/24(Sun)13:53:02 No.100115936

can anon post a gen using PAG that doesnt look fried?

Anonymous
04/21/24(Sun)13:53:40 No.100115948

Anonymous 04/21/24(Sun)13:53:40 No.100115948

I am retarded, how do I actually run Comfy UI on Ubuntu?

I git cloned the repo, but I don't see a start/run/webui.sh

And I checked the readme before asking I swear.

Anonymous
04/21/24(Sun)13:55:06 No.100115972

Anonymous 04/21/24(Sun)13:55:06 No.100115972

File: 1713721974635_01.png (3.61 MB, 1552x1552)

3.61 MB PNG

>>100114450

Anonymous
04/21/24(Sun)13:55:53 No.100115982

Anonymous 04/21/24(Sun)13:55:53 No.100115982

File: 00035-2227307230.png (1.88 MB, 1072x1376)

1.88 MB PNG

>>100115948
I'm not taking the bait but here's your (You)

Anonymous
04/21/24(Sun)13:57:03 No.100115995

Anonymous 04/21/24(Sun)13:57:03 No.100115995

File: dena_00062_.png (2.55 MB, 1728x1344)

2.55 MB PNG

Anonymous
04/21/24(Sun)13:57:53 No.100116003

Anonymous 04/21/24(Sun)13:57:53 No.100116003

File: 01379-Multiple planets in(...).png (2.33 MB, 1024x1536)

2.33 MB PNG

Anonymous
04/21/24(Sun)14:00:21 No.100116044

Anonymous 04/21/24(Sun)14:00:21 No.100116044

>>100115982
I'm not baiting

There's no instructions on initiating the software. Server.py or execution.py just return an error and don't start.

Anonymous
04/21/24(Sun)14:01:41 No.100116065

Anonymous 04/21/24(Sun)14:01:41 No.100116065

File: 00037-834085774.png (1.98 MB, 1072x1376)

1.98 MB PNG

>>100116044
lol (You)

Anonymous
04/21/24(Sun)14:02:36 No.100116082

Anonymous 04/21/24(Sun)14:02:36 No.100116082

Planning on training an SD3 model, what would you want to see most in a new model?

Anonymous
04/21/24(Sun)14:03:40 No.100116094

Anonymous 04/21/24(Sun)14:03:40 No.100116094

File: roboquok.png (2.21 MB, 1024x1024)

2.21 MB PNG

>>100115877
good day

Anonymous
04/21/24(Sun)14:03:41 No.100116095

Anonymous 04/21/24(Sun)14:03:41 No.100116095

>>100116082
There's a severe lack of general purpose/versatile models.

Anonymous
04/21/24(Sun)14:04:24 No.100116105

Anonymous 04/21/24(Sun)14:04:24 No.100116105

File: 01389-adorable Quokka flo(...).png (2.54 MB, 1536x1024)

2.54 MB PNG

Anonymous
04/21/24(Sun)14:06:29 No.100116130

Anonymous 04/21/24(Sun)14:06:29 No.100116130

File: 00002-1109164631.png (1.97 MB, 1024x1536)

1.97 MB PNG

>>100116082
ANIME
That's all we care about.
Of course, please train the model on furniture, different kinds of clothes, facial expressions, etc.
It takes a lot to make a model usable and not just some one-trick pony. Best of luck, friend.

Anonymous
04/21/24(Sun)14:08:23 No.100116165

Anonymous 04/21/24(Sun)14:08:23 No.100116165

File: dena_00055_.png (2.58 MB, 1728x1344)

2.58 MB PNG

>>100116082
seems like you should just go straight for an nsfw model since the most common complaint is gonna be "it can't do nsfw"

Anonymous
04/21/24(Sun)14:09:13 No.100116187

Anonymous 04/21/24(Sun)14:09:13 No.100116187

>>100116082
ideally it'd understand how to do anything going on in manga/hentai when instructed by a reasonably powerful LLM or (you) including the difficult ones like ha ku ronofu jin nsfw where things cause things.

Anonymous
04/21/24(Sun)14:09:27 No.100116192

Anonymous 04/21/24(Sun)14:09:27 No.100116192

>>100116044
main.py retard

Anonymous
04/21/24(Sun)14:09:46 No.100116195

Anonymous 04/21/24(Sun)14:09:46 No.100116195

File: 0.jpg (404 KB, 1024x1024)

404 KB JPG

Anonymous
04/21/24(Sun)14:10:04 No.100116200

Anonymous 04/21/24(Sun)14:10:04 No.100116200

>>100116082
a token limit greater than 75

Anonymous
04/21/24(Sun)14:10:27 No.100116207

Anonymous 04/21/24(Sun)14:10:27 No.100116207

>>100116082
the obvious answer is just anime porn. i wouldn't really know until i actually get to try it with a proper workflow (copium) and see what it's bad it. every finetune out there just completely butchers it into either a sameface anime porn generator or a sameface 1girl portrait generator

Anonymous
04/21/24(Sun)14:11:01 No.100116216

Anonymous 04/21/24(Sun)14:11:01 No.100116216

File: 1.jpg (112 KB, 1152x864)

112 KB JPG

Anonymous
04/21/24(Sun)14:13:12 No.100116256

Anonymous 04/21/24(Sun)14:13:12 No.100116256

have they even released a set of captions or their captioner prompt? wasn't this supposed to be trained with natural language captions? it would be nice to know the length/terminology used so that finetune captions don't conflict

Anonymous
04/21/24(Sun)14:13:30 No.100116263

Anonymous 04/21/24(Sun)14:13:30 No.100116263

We haven't fully explored XL yet why are we thinking about SD3

Anonymous
04/21/24(Sun)14:14:32 No.100116279

Anonymous 04/21/24(Sun)14:14:32 No.100116279

File: g_0002.jpg (419 KB, 1983x1983)

419 KB JPG

Anonymous
04/21/24(Sun)14:14:52 No.100116281

Anonymous 04/21/24(Sun)14:14:52 No.100116281

>>100116263
Go slow, get lapped.

Anonymous
04/21/24(Sun)14:16:03 No.100116300

Anonymous 04/21/24(Sun)14:16:03 No.100116300

>>100116207 >>100116130
Honestly *booru like danbooru 202x probably is going to least waste your time tagging data anyhow.

Anonymous
04/21/24(Sun)14:16:31 No.100116310

Anonymous 04/21/24(Sun)14:16:31 No.100116310

File: sci-fi31.jpg (163 KB, 1024x1536)

163 KB JPG

Anonymous
04/21/24(Sun)14:18:01 No.100116343

Anonymous 04/21/24(Sun)14:18:01 No.100116343

>>100116281
I'm never getting an exquisite details tier XL model am I

Anonymous
04/21/24(Sun)14:20:34 No.100116388

Anonymous 04/21/24(Sun)14:20:34 No.100116388

File: DreamShaper_v7_Cuteness_o(...).jpg (356 KB, 512x960)

356 KB JPG

>>100116263
I have not fully explored 1.5 yet.

Anonymous
04/21/24(Sun)14:21:12 No.100116398

Anonymous 04/21/24(Sun)14:21:12 No.100116398

File: 00017-1423708437.png (2.2 MB, 1536x1024)

2.2 MB PNG

Anonymous
04/21/24(Sun)14:22:41 No.100116424

Anonymous 04/21/24(Sun)14:22:41 No.100116424

File: g_0003.jpg (312 KB, 1983x1983)

312 KB JPG

Anonymous
04/21/24(Sun)14:23:29 No.100116438

Anonymous 04/21/24(Sun)14:23:29 No.100116438

File: Default_space_capsule_atm(...).jpg (799 KB, 1280x768)

799 KB JPG

Anonymous
04/21/24(Sun)14:23:43 No.100116444

Anonymous 04/21/24(Sun)14:23:43 No.100116444

File: 00009-136854725.png (1.84 MB, 1536x1024)

1.84 MB PNG

>>100116300
>implying booru tagging is good
>implying booru tagging is competent
>implying booru tagging is consistent enough to make a good dataset
Shortcuts just cause more problems for us all.
fuck off

Anonymous
04/21/24(Sun)14:24:55 No.100116465

Anonymous 04/21/24(Sun)14:24:55 No.100116465

File: 00027-2113660913.jpg (411 KB, 1792x2304)

411 KB JPG

Anonymous
04/21/24(Sun)14:25:25 No.100116472

Anonymous 04/21/24(Sun)14:25:25 No.100116472

File: 00001-2536060942.png (2.07 MB, 1536x1024)

2.07 MB PNG

Preparing a dataset needs to be a team effort.

Anonymous
04/21/24(Sun)14:26:23 No.100116489

Anonymous 04/21/24(Sun)14:26:23 No.100116489

File: file.png (2.06 MB, 1024x1024)

2.06 MB PNG

>>100116444
Yes, all of this is just fine overall.

Anonymous
04/21/24(Sun)14:26:41 No.100116493

Anonymous 04/21/24(Sun)14:26:41 No.100116493

>>100116343
Probably not if SD3 can do more exquisite and more XL.

Of course, that will need to happen fast or it'll get bumped to SD4.

Anonymous
04/21/24(Sun)14:28:25 No.100116516

Anonymous 04/21/24(Sun)14:28:25 No.100116516

>>100116493
>or it'll get bumped to SD4
If Stability AI lives to make that at all. Aren't they deep in the red? They are probably screwed, but only time will tell.

Anonymous
04/21/24(Sun)14:28:59 No.100116520

Anonymous 04/21/24(Sun)14:28:59 No.100116520

julien is shit

Anonymous
04/21/24(Sun)14:29:45 No.100116532

Anonymous 04/21/24(Sun)14:29:45 No.100116532

>>100116082
good dataset, train on copyrighted artists and characters, and keep tagging similar to the base model. if you do have to dig through booru tags, be aware that there will be a conflict between the natural language of the model and the tags you might end up using. booru style tags were a good solution to a dumb model problem. as the model gets smarter, tags like that are going to cause more harm than good.

Anonymous
04/21/24(Sun)14:31:55 No.100116566

Anonymous 04/21/24(Sun)14:31:55 No.100116566

caring about regular posters drama is very very low iq

Anonymous
04/21/24(Sun)14:32:14 No.100116571

Anonymous 04/21/24(Sun)14:32:14 No.100116571

File: tmp19bid3n3.png (609 KB, 768x1024)

609 KB PNG

Anonymous
04/21/24(Sun)14:32:29 No.100116576

Anonymous 04/21/24(Sun)14:32:29 No.100116576

File: ..png (525 KB, 672x384)

525 KB PNG

Anonymous
04/21/24(Sun)14:33:08 No.100116587

Anonymous 04/21/24(Sun)14:33:08 No.100116587

>>100116532
> as the model gets smarter, tags like that are going to cause more harm than good
I don't think that's a fact that has been objectively demonstrated anywhere.

You can create a system where this is the case but if it's a competent system, why wouldn't it be able to find via tags as well as natural language? If anything natural language people use is less exact.

Anonymous
04/21/24(Sun)14:34:38 No.100116614

Anonymous 04/21/24(Sun)14:34:38 No.100116614

>>100116516
Buy them for a dollar when they collapse, release the assets, the Internet finishes the job.

>>100116532
Natural language for the win.

Are we not able to get SD to understand that
>these, are, tags, just, put, them, somewhere
and
>when the line of text passes a basic grammar check it's natural language time
...?

Hell, give us two sets of prompt. Natural prompt, tags prompt, and negatives for those. Boomers and Boorus will be happy, and AI kings will master working both together.

Anonymous
04/21/24(Sun)14:36:44 No.100116655

Anonymous 04/21/24(Sun)14:36:44 No.100116655

>>100116614
>the Internet finishes the job.
With what money, programmers, and hardware?

Anonymous
04/21/24(Sun)14:38:10 No.100116680

Anonymous 04/21/24(Sun)14:38:10 No.100116680

>>100116587
Even given how English works and how English speakers use it, you need to be able to point many alternative tokens at the same concept where it overlaps.

There should be no issue whatsoever if a tag is used too, if anything the tag should often have the most precise idea of a concept.

>>100116614
Obviously both, but let's also note that among search systems tags have been far more successful and useful so far than natural language boomer descriptions. The issue might as well be on the human side, with people generally having more clear agreement on what tags mean than what every word in English actually visually or structurally or otherwise means exactly.

Anonymous
04/21/24(Sun)14:42:07 No.100116754

Anonymous 04/21/24(Sun)14:42:07 No.100116754

>>100116587
>If anything natural language people use is less exact.
You mean "natural language" of undereducated promptlets?
Idiot-proofing products is the stupidest, most pointless thing to try and do, countless companies can tell you that.

Anonymous
04/21/24(Sun)14:44:28 No.100116795

Anonymous 04/21/24(Sun)14:44:28 No.100116795

File: 0.jpg (202 KB, 1024x1024)

202 KB JPG

Anonymous
04/21/24(Sun)14:46:54 No.100116832

Anonymous 04/21/24(Sun)14:46:54 No.100116832

>hyperfine intricate details

Anonymous
04/21/24(Sun)14:48:01 No.100116845

Anonymous 04/21/24(Sun)14:48:01 No.100116845

So what should happen if a prompt has contradictory tokens? Like, say, black and blue hair, or holding a rifle and also crossing arms?

Anonymous
04/21/24(Sun)14:48:25 No.100116851

Anonymous 04/21/24(Sun)14:48:25 No.100116851

File: ComfyUI_temp_mksgg_00061_.png (3.15 MB, 1360x1744)

3.15 MB PNG

Anonymous
04/21/24(Sun)14:49:10 No.100116857

Anonymous 04/21/24(Sun)14:49:10 No.100116857

>>100116680
>with people generally having more clear agreement on what tags mean than what every word in English actually visually or structurally or otherwise means exactly.

People who want words to have specific meanings use the tag prompt, people wanting to control composition and style would go into natural language. It's really hard for a tag based prompt to respect a described composition. You'll get the things you asked for, but positional relationships are a lost cause. But if the AI could be trained in a context where "X on top of Y" could be learned that "book on top of table" and "flowers on top of grave" and "top hat on top of dancing frog" means what precedes "on top of" is higher on the canvas than what follows, we ought to then be able to use tag prompting to specify exact content and nat lang to put those things into the drawn space rather than rolling seeds till one accidentally gets the arrangement right.

Anonymous
04/21/24(Sun)14:50:21 No.100116879

Anonymous 04/21/24(Sun)14:50:21 No.100116879

File: elf_0017f.jpg (1.1 MB, 1664x2432)

1.1 MB JPG

I like both. When I am making a posed girl booru tags are great. With pony it can also simple multicharacter stuff when it hews closely to the kind of images boorus feature. When I am working on a more complicated image with multiple subjects and a lot of fine background details, it starts to get harder and harder to represent this with just tags. Ideally you'd use both: natural language for the base gen to set up the composition, then a fine tuned model for people using tags to inpaint their poses and personal details precisely.

Anonymous
04/21/24(Sun)15:00:23 No.100117030

Anonymous 04/21/24(Sun)15:00:23 No.100117030

File: ..png (1.32 MB, 1216x832)

1.32 MB PNG

Anonymous
04/21/24(Sun)15:02:05 No.100117057

Anonymous 04/21/24(Sun)15:02:05 No.100117057

File: ComfyUI_04132_.jpg (656 KB, 2312x1920)

656 KB JPG

>>100116845
dual colored hair unless there is two subjects. for the rifle, you can hold a rife with crossed arms at rest if it's slung correctly

Anonymous
04/21/24(Sun)15:06:24 No.100117105

Anonymous 04/21/24(Sun)15:06:24 No.100117105

File: g_0004.jpg (438 KB, 1983x1983)

438 KB JPG

Anonymous
04/21/24(Sun)15:07:29 No.100117126

Anonymous 04/21/24(Sun)15:07:29 No.100117126

Now that I have released my jizz to fat Frieren pussy caught in a mimic trap all is good in the world again.

Anonymous
04/21/24(Sun)15:07:43 No.100117130

Anonymous 04/21/24(Sun)15:07:43 No.100117130

>>100116754
>Idiot-proofing products is the stupidest, most pointless thing to try and do, countless companies can tell you that.
Only fools use fool-proof products.

Anonymous
04/21/24(Sun)15:08:28 No.100117140

Anonymous 04/21/24(Sun)15:08:28 No.100117140

it it with greater pleasure that i announce cute horse girls with cute horse tails are welcomed in this thread

Anonymous
04/21/24(Sun)15:08:57 No.100117150

Anonymous 04/21/24(Sun)15:08:57 No.100117150

>>100117057
So how about 2girls and solo?

Anonymous
04/21/24(Sun)15:10:33 No.100117175

Anonymous 04/21/24(Sun)15:10:33 No.100117175

>>100117148
youtube or follow the github instructions in the OP links. get things running first and then go peruse some models on civit

>>100117150
that's a good one. dunno

Anonymous
04/21/24(Sun)15:11:06 No.100117185

Anonymous 04/21/24(Sun)15:11:06 No.100117185

>>100116279

FOUR TWENTY BLAZE IT.

Anonymous
04/21/24(Sun)15:17:14 No.100117298

Anonymous 04/21/24(Sun)15:17:14 No.100117298

>>100117140
fuck off to your furry boards gooner

Anonymous
04/21/24(Sun)15:17:57 No.100117311

Anonymous 04/21/24(Sun)15:17:57 No.100117311

>>100117185
edibles were too lit yesterday, so today is the stoner posting

Anonymous
04/21/24(Sun)15:17:57 No.100117312

Anonymous 04/21/24(Sun)15:17:57 No.100117312

File: ComfyUI_temp_mksgg_00096_.png (3.85 MB, 1264x1848)

3.85 MB PNG

Anonymous
04/21/24(Sun)15:19:16 No.100117330

Anonymous 04/21/24(Sun)15:19:16 No.100117330

>>100117311
West coast gooners have just woken up from the drug/goon overload last night. Everything goes to shit 11am west coast time.

Anonymous
04/21/24(Sun)15:20:46 No.100117355

Anonymous 04/21/24(Sun)15:20:46 No.100117355

>absolutely outstanding image

Anonymous
04/21/24(Sun)15:20:50 No.100117359

Anonymous 04/21/24(Sun)15:20:50 No.100117359

File: The image is a close-up p(...).jpg (94 KB, 1024x1024)

94 KB JPG

>>100116200
Sigma 300 cap

Anonymous
04/21/24(Sun)15:21:17 No.100117367

Anonymous 04/21/24(Sun)15:21:17 No.100117367

File: Default_a_retrofuturistic(...).jpg (395 KB, 704x1024)

395 KB JPG

>>100117312
cool helmet

Anonymous
04/21/24(Sun)15:23:56 No.100117422

Anonymous 04/21/24(Sun)15:23:56 No.100117422

world is gay, can't wait for the sweet release of death

Anonymous
04/21/24(Sun)15:24:55 No.100117440

Anonymous 04/21/24(Sun)15:24:55 No.100117440

File: 3569451300.png (2.02 MB, 944x1184)

2.02 MB PNG

>>100115936
? Are you having problems with it? When using PAG its recommended to lower the CFG a bit

Anonymous
04/21/24(Sun)15:25:08 No.100117444

Anonymous 04/21/24(Sun)15:25:08 No.100117444

>>100117422
>claims to hate living
>doesn't even kill himself
poser

Anonymous
04/21/24(Sun)15:26:45 No.100117484

Anonymous 04/21/24(Sun)15:26:45 No.100117484

>>100116857
I'd personally prefer to use positional / relational / logical information with the tags even then rather than natural language, but it doesn't actually work.

Anonymous
04/21/24(Sun)15:28:07 No.100117502

Anonymous 04/21/24(Sun)15:28:07 No.100117502

File: Default_two_stunned_girls(...).jpg (876 KB, 1336x768)

876 KB JPG

Anonymous
04/21/24(Sun)15:28:13 No.100117507

Anonymous 04/21/24(Sun)15:28:13 No.100117507

File: 1.jpg (98 KB, 1080x1440)

98 KB JPG

Anonymous
04/21/24(Sun)15:30:52 No.100117545

Anonymous 04/21/24(Sun)15:30:52 No.100117545

File: 1689587347683676.png (1.48 MB, 1008x1008)

1.48 MB PNG

>UnboundLocalError: cannot access local variable 'h' where it is not associated with a value

What is this message telling me? It happens in img2img when I try to upscale beyond this resolution. Seems like if I want a higher resolution, I need to go with another program, then bring that image back in and run it through img2img again to get some detail.

Anonymous
04/21/24(Sun)15:32:36 No.100117573

Anonymous 04/21/24(Sun)15:32:36 No.100117573

>>100117484
>but it doesn't actually work
Which is kind of a problem.

We need some ChatGPT action that can let us do something like run a gen, then repeat it after adding fixes to the boomer prompt (like "four visible fingers and one thumb on right hand of leftmost woman") and have it actually get that where it drew a hand over there that six fingers and two thumbs was a bit too ambitious.

Anonymous
04/21/24(Sun)15:43:38 No.100117736

Anonymous 04/21/24(Sun)15:43:38 No.100117736

File: 1.jpg (155 KB, 1080x1440)

155 KB JPG

Anonymous
04/21/24(Sun)15:44:42 No.100117758

Anonymous 04/21/24(Sun)15:44:42 No.100117758

File: ComfyUI_temp_mksgg_00110_.png (3.7 MB, 1552x1552)

3.7 MB PNG

Anonymous
04/21/24(Sun)15:51:24 No.100117872

Anonymous 04/21/24(Sun)15:51:24 No.100117872

File: AlbedoBase_XL_pain_anger_(...).jpg (621 KB, 1280x768)

621 KB JPG

>>100117758
nicely surreal and creepy

Anonymous
04/21/24(Sun)15:52:57 No.100117902

Anonymous 04/21/24(Sun)15:52:57 No.100117902

File: upscaledturbo_00076_.png (1.18 MB, 1024x1024)

1.18 MB PNG

Anonymous
04/21/24(Sun)15:53:10 No.100117905

Anonymous 04/21/24(Sun)15:53:10 No.100117905

File: ComfyUI_temp_mksgg_00118_.png (1.76 MB, 1344x768)

1.76 MB PNG

Anonymous
04/21/24(Sun)15:54:07 No.100117918

Anonymous 04/21/24(Sun)15:54:07 No.100117918

>>100117758
>>100117872
Another progressive rock album cover for songs we'll never get to listen to.

Anonymous
04/21/24(Sun)15:54:09 No.100117920

Anonymous 04/21/24(Sun)15:54:09 No.100117920

File: 0-AFH074262024.jpg (96 KB, 1288x1288)

96 KB JPG

Anonymous
04/21/24(Sun)15:55:22 No.100117941

Anonymous 04/21/24(Sun)15:55:22 No.100117941

File: ComfyUI_temp_mksgg_00119_.png (3.89 MB, 2040x1160)

3.89 MB PNG

this is really just retreading old ground applying new gen settings

Anonymous
04/21/24(Sun)15:59:22 No.100118015

Anonymous 04/21/24(Sun)15:59:22 No.100118015

Anyone feels like proompting my schizo (legit) vision I had
a dragon, of the DB type but so fucking massive in the sky that I perceived it as a god

Anonymous
04/21/24(Sun)15:59:33 No.100118018

Anonymous 04/21/24(Sun)15:59:33 No.100118018

File: 1.jpg (114 KB, 1152x864)

114 KB JPG

Anonymous
04/21/24(Sun)16:01:25 No.100118044

Anonymous 04/21/24(Sun)16:01:25 No.100118044

File: 00002-74371-scales, negat(...).jpg (729 KB, 4824x2016)

729 KB JPG

>>100117758
p cool

Anonymous
04/21/24(Sun)16:01:57 No.100118053

Anonymous 04/21/24(Sun)16:01:57 No.100118053

File: DreamShaper_v7_half_body_(...).jpg (353 KB, 640x832)

353 KB JPG

>>100117920
my Trypophobia

Anonymous
04/21/24(Sun)16:05:38 No.100118108

Anonymous 04/21/24(Sun)16:05:38 No.100118108

File: Default_Style_of_Tomer_Ha(...).jpg (751 KB, 768x1280)

751 KB JPG

Anonymous
04/21/24(Sun)16:07:57 No.100118145

Anonymous 04/21/24(Sun)16:07:57 No.100118145

What the fuck causes regional prompter to gen slightly off pictures sometimes?

Anonymous
04/21/24(Sun)16:10:12 No.100118175

Anonymous 04/21/24(Sun)16:10:12 No.100118175

File: upscaledturbo_00077_.png (1.24 MB, 1152x896)

1.24 MB PNG

Anonymous
04/21/24(Sun)16:10:24 No.100118178

Anonymous 04/21/24(Sun)16:10:24 No.100118178

File: succ_0007f.jpg (1.1 MB, 1664x2432)

1.1 MB JPG

>>100118145
What do you mean by slightly off?

Anonymous
04/21/24(Sun)16:12:21 No.100118216

Anonymous 04/21/24(Sun)16:12:21 No.100118216

>>100118178
Like it occasionally it ignores some of the prompts and creates "generic" looking picture.

Anonymous
04/21/24(Sun)16:12:54 No.100118229

Anonymous 04/21/24(Sun)16:12:54 No.100118229

File: ComfyUI_temp_mksgg_00134_.png (3.29 MB, 1160x2040)

3.29 MB PNG

Anonymous
04/21/24(Sun)16:16:01 No.100118299

Anonymous 04/21/24(Sun)16:16:01 No.100118299

Can I not use multiple style loras with regional prompter on forge without it looking completely fucked?

Anonymous
04/21/24(Sun)16:16:20 No.100118309

Anonymous 04/21/24(Sun)16:16:20 No.100118309

>>100118216
Might be a seed that just doesn't look like what you want it to find. Is it deterministic?

Anonymous
04/21/24(Sun)16:17:31 No.100118333

Anonymous 04/21/24(Sun)16:17:31 No.100118333

>>100118309
It's not a seed, it seems to be some random words in some particular order that seems to fuck with it.

Anonymous
04/21/24(Sun)16:19:02 No.100118367

Anonymous 04/21/24(Sun)16:19:02 No.100118367

File: ComfyUI_temp_mksgg_00137_.png (3.84 MB, 1160x2040)

3.84 MB PNG

Anonymous
04/21/24(Sun)16:19:16 No.100118374

Anonymous 04/21/24(Sun)16:19:16 No.100118374

File: 1713730702127.jpg (73 KB, 768x1152)

73 KB JPG

Anonymous
04/21/24(Sun)16:19:37 No.100118378

Anonymous 04/21/24(Sun)16:19:37 No.100118378

File: file.png (1000 KB, 960x1088)

1000 KB PNG

>>100117573
I wonder if something like that will show up eventually. Would be nice.

Anonymous
04/21/24(Sun)16:19:47 No.100118384

Anonymous 04/21/24(Sun)16:19:47 No.100118384

File: ComfyUI_temp_mksgg_00138_.png (3.8 MB, 1160x2040)

3.8 MB PNG

Anonymous
04/21/24(Sun)16:19:56 No.100118387

Anonymous 04/21/24(Sun)16:19:56 No.100118387

File: ComfyUI_00162_.png (1.78 MB, 1216x832)

1.78 MB PNG

Anonymous
04/21/24(Sun)16:25:30 No.100118462

Anonymous 04/21/24(Sun)16:25:30 No.100118462

File: ComfyUI_temp_mksgg_00142_.png (3.45 MB, 1160x2040)

3.45 MB PNG

Anonymous
04/21/24(Sun)16:26:20 No.100118477

Anonymous 04/21/24(Sun)16:26:20 No.100118477

File: ComfyUI_temp_mksgg_00140_.png (3.86 MB, 1160x2040)

3.86 MB PNG

Anonymous
04/21/24(Sun)16:33:00 No.100118573

Anonymous 04/21/24(Sun)16:33:00 No.100118573

File: ..png (466 KB, 672x384)

466 KB PNG

Anonymous
04/21/24(Sun)16:36:45 No.100118636

Anonymous 04/21/24(Sun)16:36:45 No.100118636

File: ComfyUI_temp_mksgg_00149_.png (1.45 MB, 768x1344)

1.45 MB PNG

the gaunaburger, only at toha heavy agriculture

Anonymous
04/21/24(Sun)16:37:21 No.100118644

Anonymous 04/21/24(Sun)16:37:21 No.100118644

File: upscaledturbo_00078_.png (1.06 MB, 896x1152)

1.06 MB PNG

Anonymous
04/21/24(Sun)16:39:39 No.100118689

Anonymous 04/21/24(Sun)16:39:39 No.100118689

File: ComfyUI_temp_mksgg_00151_.png (3.28 MB, 1160x2040)

3.28 MB PNG

Anonymous
04/21/24(Sun)16:45:42 No.100118807

Anonymous 04/21/24(Sun)16:45:42 No.100118807

File: seraphic_chicken.png (1.25 MB, 992x1456)

1.25 MB PNG

is there any model that actually creates proper pixel art without errors?

Anonymous
04/21/24(Sun)16:48:21 No.100118858

Anonymous 04/21/24(Sun)16:48:21 No.100118858

File: 00002-72894-scales, negat(...).jpg (828 KB, 2808x3624)

828 KB JPG

Anonymous
04/21/24(Sun)16:49:08 No.100118872

Anonymous 04/21/24(Sun)16:49:08 No.100118872

File: dena_00051_.png (2.63 MB, 1728x1344)

2.63 MB PNG

>>100117573
instruction-based image editing is a thing but I dunno what resources are actually good
https://github.com/ali-vilab/Ranni
https://github.com/modelscope/scepter

Anonymous
04/21/24(Sun)16:49:18 No.100118878

Anonymous 04/21/24(Sun)16:49:18 No.100118878

File: 00434-2827966013.jpg (46 KB, 672x1440)

46 KB JPG

Anonymous
04/21/24(Sun)16:51:36 No.100118914

Anonymous 04/21/24(Sun)16:51:36 No.100118914

File: ComfyUI_00282_.png (1.21 MB, 1216x768)

1.21 MB PNG

Anonymous
04/21/24(Sun)16:52:54 No.100118934

Anonymous 04/21/24(Sun)16:52:54 No.100118934

File: ComfyUI_04227_.png (2.55 MB, 1920x2312)

2.55 MB PNG

>>100118299
is there no way to concat the loras?

Anonymous
04/21/24(Sun)16:53:08 No.100118939

Anonymous 04/21/24(Sun)16:53:08 No.100118939

File: pixelart.jpg (153 KB, 1024x1024)

153 KB JPG

>>100118644
slick

>>100118689
what even is going on here?

>>100118807
depends on what you consider an error. if it's a prefect pixel grid that's a matter for postprocessing

Anonymous
04/21/24(Sun)16:53:43 No.100118950

Anonymous 04/21/24(Sun)16:53:43 No.100118950

File: ..png (1.31 MB, 1216x832)

1.31 MB PNG

Anonymous
04/21/24(Sun)16:54:33 No.100118959

Anonymous 04/21/24(Sun)16:54:33 No.100118959

>>100117573
i think that'll either be when parameters are high enough to sufficiently capture nuance and specificity of english, and/or when/if image editor build up enough controlnets and quick/dirty interactive edits to corral the models

Anonymous
04/21/24(Sun)16:54:34 No.100118960

Anonymous 04/21/24(Sun)16:54:34 No.100118960

File: 00666-TFT_2601701.png (3.89 MB, 2560x1536)

3.89 MB PNG

Anonymous
04/21/24(Sun)16:55:47 No.100118988

Anonymous 04/21/24(Sun)16:55:47 No.100118988

File: pixelart2.jpg (181 KB, 1024x1024)

181 KB JPG

>>100118934
https://github.com/hako-mikan/sd-webui-regional-prompter#latent
>Slower, but allows separating LoRAs to some extent.
But also not completely. So perhaps you can't really with this.

Anonymous
04/21/24(Sun)16:56:50 No.100119010

Anonymous 04/21/24(Sun)16:56:50 No.100119010

File: ComfyUI_temp_mksgg_00164_.png (3.02 MB, 2040x1160)

3.02 MB PNG

Anonymous
04/21/24(Sun)16:57:21 No.100119018

Anonymous 04/21/24(Sun)16:57:21 No.100119018

>>100118914
why would you do this

Anonymous
04/21/24(Sun)16:59:20 No.100119055

Anonymous 04/21/24(Sun)16:59:20 No.100119055

File: upscaledturbo_00079_.png (1.14 MB, 1024x1024)

1.14 MB PNG

Anonymous
04/21/24(Sun)17:00:30 No.100119082

Anonymous 04/21/24(Sun)17:00:30 No.100119082

File: ComfyUI_04239_.png (2.79 MB, 1920x2312)

2.79 MB PNG

>>100118988
there was an attention couple one that came out not too long ago. maybe try that instead?
https://github.com/Haoming02/sd-forge-couple

Anonymous
04/21/24(Sun)17:01:25 No.100119098

Anonymous 04/21/24(Sun)17:01:25 No.100119098

>>100119082
jesus christ what is that thing

Anonymous
04/21/24(Sun)17:02:14 No.100119112

Anonymous 04/21/24(Sun)17:02:14 No.100119112

File: ComfyUI_04219_.png (667 KB, 896x1200)

667 KB PNG

>>100119098
>jesus christ what is that thing
picrel is a hint

Anonymous
04/21/24(Sun)17:02:28 No.100119116

Anonymous 04/21/24(Sun)17:02:28 No.100119116

File: dena_00050_.png (2.34 MB, 1728x1344)

2.34 MB PNG

>>100119018
I find his gens rather fascinating

Anonymous
04/21/24(Sun)17:03:13 No.100119135

Anonymous 04/21/24(Sun)17:03:13 No.100119135

File: grid-0006.jpg (1.38 MB, 3600x3200)

1.38 MB JPG

Anonymous
04/21/24(Sun)17:04:07 No.100119153

Anonymous 04/21/24(Sun)17:04:07 No.100119153

>>100119116
Actually yes

Anonymous
04/21/24(Sun)17:04:38 No.100119163

Anonymous 04/21/24(Sun)17:04:38 No.100119163

File: 0-AFH079262024.jpg (189 KB, 1288x1288)

189 KB JPG

What's for dinner lads

Anonymous
04/21/24(Sun)17:06:38 No.100119187

Anonymous 04/21/24(Sun)17:06:38 No.100119187

>>100118939
I just want to mix them but I see a sharp degradation when using forge. One Lora is fine but two style loras don't play well

Anonymous
04/21/24(Sun)17:12:19 No.100119276

Anonymous 04/21/24(Sun)17:12:19 No.100119276

>>100119163
Lotsa Spaghetti!

Anonymous
04/21/24(Sun)17:12:59 No.100119291

Anonymous 04/21/24(Sun)17:12:59 No.100119291

File: upscaledturbo_00080_.png (1.34 MB, 1024x1024)

1.34 MB PNG

Anonymous
04/21/24(Sun)17:14:25 No.100119324

Anonymous 04/21/24(Sun)17:14:25 No.100119324

File: 0-AFH103252024.jpg (278 KB, 1288x1288)

278 KB JPG

>>100119276
What will you put on it?

Anonymous
04/21/24(Sun)17:14:55 No.100119337

Anonymous 04/21/24(Sun)17:14:55 No.100119337

File: file.png (1.62 MB, 1025x1026)

1.62 MB PNG

Anonymous
04/21/24(Sun)17:16:48 No.100119366

Anonymous 04/21/24(Sun)17:16:48 No.100119366

File: ComfyUI_04265_.png (3.17 MB, 1920x2312)

3.17 MB PNG

Anonymous
04/21/24(Sun)17:19:23 No.100119401

Anonymous 04/21/24(Sun)17:19:23 No.100119401

File: ..png (474 KB, 672x384)

474 KB PNG

Anonymous
04/21/24(Sun)17:19:33 No.100119403

Anonymous 04/21/24(Sun)17:19:33 No.100119403

File: file.png (284 KB, 1920x1080)

284 KB PNG

>>100119366

Anonymous
04/21/24(Sun)17:20:38 No.100119417

Anonymous 04/21/24(Sun)17:20:38 No.100119417

File: ComfyUI_04268_.png (3.65 MB, 1920x2312)

3.65 MB PNG

>>100119403
kek

Anonymous
04/21/24(Sun)17:23:10 No.100119459

Anonymous 04/21/24(Sun)17:23:10 No.100119459

File: catbox_5po1hy.png (3.32 MB, 1360x1744)

3.32 MB PNG

Anonymous
04/21/24(Sun)17:23:38 No.100119466

Anonymous 04/21/24(Sun)17:23:38 No.100119466

File: ComfyUI_04278_.png (3.77 MB, 1920x2312)

3.77 MB PNG

Anonymous
04/21/24(Sun)17:24:07 No.100119474

Anonymous 04/21/24(Sun)17:24:07 No.100119474

File: 00002-11227-scales, negat(...).jpg (803 KB, 2808x3624)

803 KB JPG

Anonymous
04/21/24(Sun)17:28:45 No.100119534

Anonymous 04/21/24(Sun)17:28:45 No.100119534

File: ComfyUI_04270_.png (920 KB, 960x1152)

920 KB PNG

Anonymous
04/21/24(Sun)17:31:01 No.100119565

Anonymous 04/21/24(Sun)17:31:01 No.100119565

File: cover-4781384115560542.png (312 KB, 512x512)

312 KB PNG

>>100106072
Thank you for the helpful guidance! I'll try to apply what you've elaborated upon.

>>100107578
>>100107667
>>100107688
>>100107726
>>100107802
Ngl, this is my favorite thus far in these threads. Mysterious anon with godlike gens, please show yourself (and give me your Patreon kek)

Anonymous
04/21/24(Sun)17:34:17 No.100119613

Anonymous 04/21/24(Sun)17:34:17 No.100119613

File: 0.jpg (190 KB, 1024x1024)

190 KB JPG

Anonymous
04/21/24(Sun)17:38:04 No.100119659

Anonymous 04/21/24(Sun)17:38:04 No.100119659

>>100119474
brilliant glitch dada collage

Anonymous
04/21/24(Sun)17:38:11 No.100119660

Anonymous 04/21/24(Sun)17:38:11 No.100119660

>>100119565
he's on a 3 day vacation

Anonymous
04/21/24(Sun)17:40:03 No.100119690

Anonymous 04/21/24(Sun)17:40:03 No.100119690

File: ComfyUI_04303_.png (3.52 MB, 1920x2312)

3.52 MB PNG

Anonymous
04/21/24(Sun)17:40:36 No.100119699

Anonymous 04/21/24(Sun)17:40:36 No.100119699

File: ComfyUI_00428_.png (1.42 MB, 1216x768)

1.42 MB PNG

Anonymous
04/21/24(Sun)17:40:53 No.100119703

Anonymous 04/21/24(Sun)17:40:53 No.100119703

>>100119690
momentous work, jules

Anonymous
04/21/24(Sun)17:42:28 No.100119724

Anonymous 04/21/24(Sun)17:42:28 No.100119724

File: ..png (547 KB, 672x384)

547 KB PNG

>>100119699
my humble abode

Anonymous
04/21/24(Sun)17:43:03 No.100119732

Anonymous 04/21/24(Sun)17:43:03 No.100119732

File: dena_00049_.png (2.57 MB, 1728x1344)

2.57 MB PNG

>>100119699
>9x8ft one room house
>1.4mil
but its in a nice neighborhood!

Anonymous
04/21/24(Sun)17:43:43 No.100119746

Anonymous 04/21/24(Sun)17:43:43 No.100119746

File: ComfyUI_04291_.png (946 KB, 960x1152)

946 KB PNG

>momentous work, jules

Anonymous
04/21/24(Sun)17:44:44 No.100119766

Anonymous 04/21/24(Sun)17:44:44 No.100119766

>>100119699
indistinguishable from reality, gj.

Anonymous
04/21/24(Sun)17:45:34 No.100119783

Anonymous 04/21/24(Sun)17:45:34 No.100119783

File: 00146-2368601654.png (1.61 MB, 896x1088)

1.61 MB PNG

Anonymous
04/21/24(Sun)17:49:23 No.100119823

Anonymous 04/21/24(Sun)17:49:23 No.100119823

File: ..png (1.31 MB, 1216x832)

1.31 MB PNG

Anonymous
04/21/24(Sun)17:49:45 No.100119828

Anonymous 04/21/24(Sun)17:49:45 No.100119828

File: grid-0007.jpg (1.48 MB, 3600x3200)

1.48 MB JPG

Anonymous
04/21/24(Sun)17:51:49 No.100119853

Anonymous 04/21/24(Sun)17:51:49 No.100119853

File: upscaledturbo_00081_.png (1.27 MB, 1024x1024)

1.27 MB PNG

Anonymous
04/21/24(Sun)17:53:00 No.100119875

Anonymous 04/21/24(Sun)17:53:00 No.100119875

File: ComfyUI_00480_.png (1.56 MB, 1216x768)

1.56 MB PNG

Anonymous
04/21/24(Sun)17:59:22 No.100119956

Anonymous 04/21/24(Sun)17:59:22 No.100119956

File: 0.jpg (551 KB, 1024x1024)

551 KB JPG

Anonymous
04/21/24(Sun)18:04:23 No.100120017

Anonymous 04/21/24(Sun)18:04:23 No.100120017

>>100119324
sliced bananas and pineapple

Anonymous
04/21/24(Sun)18:09:32 No.100120089

Anonymous 04/21/24(Sun)18:09:32 No.100120089

File: dekm_00057_.png (3.38 MB, 1344x1728)

3.38 MB PNG

has anyone tried out kohaku epsilon? I've been finding it rather hard to work with and dunno if its just me or if its a weak model

Anonymous
04/21/24(Sun)18:20:16 No.100120254

Anonymous 04/21/24(Sun)18:20:16 No.100120254

File: grid-0009.jpg (1.96 MB, 3600x3200)

1.96 MB JPG

Anonymous
04/21/24(Sun)18:22:45 No.100120296

Anonymous 04/21/24(Sun)18:22:45 No.100120296

>>100115837
>>StyleBooth: Image Style Editing with Multimodal Instruction
>https://ali-vilab.github.io/stylebooth-page/
i am once again asking if anyone here has successfully tried this out, and if so, can you catbox a working python or google colab notebook

Anonymous
04/21/24(Sun)18:27:39 No.100120366

Anonymous 04/21/24(Sun)18:27:39 No.100120366

>>100120089
It just seems strictly worse than animagineXL3.1 in every way.
What a shame.

Anonymous
04/21/24(Sun)18:28:07 No.100120371

Anonymous 04/21/24(Sun)18:28:07 No.100120371

>>100120296
be the change you want to see
take a dive and let us know how it goes

Anonymous
04/21/24(Sun)18:30:03 No.100120394

Anonymous 04/21/24(Sun)18:30:03 No.100120394

File: dekm_00055_.png (2.91 MB, 1344x1728)

2.91 MB PNG

>>100120296
we have enough followers. we need a leader

>>100120366
thats exactly how I feel about it :(

Anonymous
04/21/24(Sun)18:30:27 No.100120401

Anonymous 04/21/24(Sun)18:30:27 No.100120401

>>100119565
>Shitter with shit opinions
Fuck off

Anonymous
04/21/24(Sun)18:32:23 No.100120422

Anonymous 04/21/24(Sun)18:32:23 No.100120422

>>100120371
>>100120394
i'm trying but i'm retarded with python and dependencies n sheeit and I can't figure out why even when i manually pip install torch==2.2.1 why I get an error saying ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
torchaudio 2.2.1+cu121 requires torch==2.2.1, but you have torch 2.0.1 which is incompatible.
torchtext 0.17.1 requires torch==2.2.1, but you have torch 2.0.1 which is incompatible.

Anonymous
04/21/24(Sun)18:33:03 No.100120437

Anonymous 04/21/24(Sun)18:33:03 No.100120437

File: ComfyUI_00679_.png (1.75 MB, 1216x768)

1.75 MB PNG

Anonymous
04/21/24(Sun)18:33:58 No.100120454

Anonymous 04/21/24(Sun)18:33:58 No.100120454

>>100115740
Am I the only one getting way better outputs with 1024x1024 on Pony compared to e.g. 896x1152?

Anonymous
04/21/24(Sun)18:34:44 No.100120466

Anonymous 04/21/24(Sun)18:34:44 No.100120466

>>100120454
I find pony works best at 768x1280 however 1024x1024 isn't bad

Anonymous
04/21/24(Sun)18:38:01 No.100120513

Anonymous 04/21/24(Sun)18:38:01 No.100120513

File: dekm_00054_.png (3.07 MB, 1344x1728)

3.07 MB PNG

>>100120437
this hiking trail is only a few miles from where I live. there's benches down near the cliff where you can watch the waves crash

Anonymous
04/21/24(Sun)18:39:55 No.100120544

Anonymous 04/21/24(Sun)18:39:55 No.100120544

>>100120437
are you just feeding in a real picture at low denoising to get the filename, or is there a "weekend away snapshit" lora out there

Anonymous
04/21/24(Sun)18:42:04 No.100120571

Anonymous 04/21/24(Sun)18:42:04 No.100120571

File: ComfyUI_00697_.png (1.46 MB, 1216x768)

1.46 MB PNG

>>100120544
check out the boring reality lora. it advises to use it with the base sdxl model but I've been using it with sds_film

Anonymous
04/21/24(Sun)18:43:18 No.100120597

Anonymous 04/21/24(Sun)18:43:18 No.100120597

>>100120571
NTA but cool, I might take this for a spin
I was eying the VHS ones for a bit to see if I could make some creepy gens but this also tickles my fancy

Anonymous
04/21/24(Sun)18:44:57 No.100120621

Anonymous 04/21/24(Sun)18:44:57 No.100120621

File: 00003-30064-scales, negat(...).jpg (788 KB, 2808x3624)

788 KB JPG

>>100119659
thanks, heres a portrait just for you

Anonymous
04/21/24(Sun)18:46:19 No.100120647

Anonymous 04/21/24(Sun)18:46:19 No.100120647

File: this_ones_fine_i_guess.png (1.09 MB, 832x1216)

1.09 MB PNG

>>100120394
It's fine for 1girl stuff I guess, but I feel the results are always worse than what Animagine would have produced.

It fucks up hands a lot more and losing good gens to that always sucks.
I'll keep trying it, but I don't have too high hopes desu...

Anonymous
04/21/24(Sun)18:46:33 No.100120650

Anonymous 04/21/24(Sun)18:46:33 No.100120650

File: 00152-2729173364.png (593 KB, 768x512)

593 KB PNG

Anonymous
04/21/24(Sun)18:46:48 No.100120653

Anonymous 04/21/24(Sun)18:46:48 No.100120653

File: cover-07868558621106725.png (285 KB, 512x512)

285 KB PNG

>>100119660
Very specific answer desu
>>100120401
I am a shitter. We all gotta start somewhere.

Anonymous
04/21/24(Sun)18:47:24 No.100120663

Anonymous 04/21/24(Sun)18:47:24 No.100120663

>>100120653
Unfortunately for you there is no hope based on your taste.

Anonymous
04/21/24(Sun)18:49:18 No.100120697

Anonymous 04/21/24(Sun)18:49:18 No.100120697

File: dekm_00052_.png (2.96 MB, 1344x1728)

2.96 MB PNG

>>100120647
the only thing I've found I really like about it is that it does really interesting manga layouts. but then it blunders all the details so its worthless. maybe I should do a dual-model workflow with KE for the first pass and animagine for the hires

Anonymous
04/21/24(Sun)18:52:14 No.100120731

Anonymous 04/21/24(Sun)18:52:14 No.100120731

>>100120697
I want to know what it says

Anonymous
04/21/24(Sun)18:53:47 No.100120755

Anonymous 04/21/24(Sun)18:53:47 No.100120755

File: 00875-TFT_26016757.jpg (1.27 MB, 2560x1536)

1.27 MB JPG

>backgrounds in pony
it's slightly better after I experimented with some merges but still horrible compared to 1.5

Anonymous
04/21/24(Sun)18:57:22 No.100120793

Anonymous 04/21/24(Sun)18:57:22 No.100120793

>>100120755
that looks fine and fitting for the character's illustrated style? it looks better than most 1.5 garbage. the background shouldn't be equally or more detailed than the character, that's one of the telltale signs of 1.5 ai slop: overly detailed nonsensical backgrounds

Anonymous
04/21/24(Sun)18:57:44 No.100120799

Anonymous 04/21/24(Sun)18:57:44 No.100120799

File: dekm_00050_.png (1.92 MB, 1344x1728)

1.92 MB PNG

>>100120731
sadly we'll never know what the Ai was thinking

Anonymous
04/21/24(Sun)18:58:32 No.100120806

Anonymous 04/21/24(Sun)18:58:32 No.100120806

File: 00196-[TFT]-878290469.png (1.75 MB, 1024x1536)

1.75 MB PNG

>>100120793
hm well compare to this 1.5 picture, I think the background looks better

Anonymous
04/21/24(Sun)18:58:35 No.100120807

Anonymous 04/21/24(Sun)18:58:35 No.100120807

>>100120755
>horrible compared to 1.5
I fucking HATE when my backgrounds are consistent. I won't even give you any help cause you're trollin

Anonymous
04/21/24(Sun)18:59:55 No.100120826

Anonymous 04/21/24(Sun)18:59:55 No.100120826

File: 00098-587887629.png (559 KB, 512x768)

559 KB PNG

Anonymous
04/21/24(Sun)19:02:19 No.100120854

Anonymous 04/21/24(Sun)19:02:19 No.100120854

File: bandage.png (1.52 MB, 832x1216)

1.52 MB PNG

Anonymous
04/21/24(Sun)19:02:33 No.100120855

Anonymous 04/21/24(Sun)19:02:33 No.100120855

>>100120806
that is rather subjective, and you are comparing two completely different types of shots

Anonymous
04/21/24(Sun)19:02:39 No.100120859

Anonymous 04/21/24(Sun)19:02:39 No.100120859

So is stable diffusion 3 released or not?

What does API release mean?

Anonymous
04/21/24(Sun)19:02:50 No.100120861

Anonymous 04/21/24(Sun)19:02:50 No.100120861

File: cute_but_flawed.png (1.02 MB, 832x1216)

1.02 MB PNG

>>100120697
What I've found so far is that it's a lot better at generating zouri and tabi (the kinds of sandals and socks miko wear).
Other models always render generic socks and individual toes and that always bothered me.

Anonymous
04/21/24(Sun)19:03:15 No.100120865

Anonymous 04/21/24(Sun)19:03:15 No.100120865

>>100120859
no and it mean's ignore it until they actually release

Anonymous
04/21/24(Sun)19:04:41 No.100120890

Anonymous 04/21/24(Sun)19:04:41 No.100120890

>>100120859
> What does API release mean?
money and the usual free pass of coping that isn't the final version, just like XL on clipdrop. also don't bother with SD3, the license is a mess.

Anonymous
04/21/24(Sun)19:05:58 No.100120904

Anonymous 04/21/24(Sun)19:05:58 No.100120904

>>100120855
the picture he posted that "looks better" makes 0 sense. why is there a 40 foot sand dune behind that rock formation? why does that rock formation have a perfectly straight pillar? what does that rock formation have a gun trigger? etc etc etc

Anonymous
04/21/24(Sun)19:06:53 No.100120912

Anonymous 04/21/24(Sun)19:06:53 No.100120912

>>100120859
SD3 doesn't matter until 6mo+ after people can start training models for it

Anonymous
04/21/24(Sun)19:09:21 No.100120931

Anonymous 04/21/24(Sun)19:09:21 No.100120931

>>100120904
>sand dunes are huge
>wtf are ruins
Here's your (You) since you're starving

Anonymous
04/21/24(Sun)19:09:48 No.100120935

Anonymous 04/21/24(Sun)19:09:48 No.100120935

>>100120890
>the license is a mess.
What changed between the license to SDXL and SD3?

If they release the model people are gonna fine tune it anyway, license or not

Anonymous
04/21/24(Sun)19:11:51 No.100120957

Anonymous 04/21/24(Sun)19:11:51 No.100120957

File: upscaledturbo_00082_.png (1.27 MB, 1024x1024)

1.27 MB PNG

Anonymous
04/21/24(Sun)19:12:29 No.100120965

Anonymous 04/21/24(Sun)19:12:29 No.100120965

>>100120422
you need matching versions of all the torch packages. add torchaudio==2.2.1 torchtext==2.2.1 to the install command.

Anonymous
04/21/24(Sun)19:14:05 No.100120984

Anonymous 04/21/24(Sun)19:14:05 No.100120984

File: portrait.png (927 KB, 832x1216)

927 KB PNG

...It *does* generate some pretty cute images, can't deny that...

Anonymous
04/21/24(Sun)19:15:50 No.100121003

Anonymous 04/21/24(Sun)19:15:50 No.100121003

File: file.png (163 KB, 256x457)

163 KB PNG

>>100120931
>non Euclidian backgrounds are LE GOOD

Anonymous
04/21/24(Sun)19:16:48 No.100121017

Anonymous 04/21/24(Sun)19:16:48 No.100121017

File: dekm_00049_.png (2.69 MB, 1344x1728)

2.69 MB PNG

>>100120984
I think it's sole purpose is "portrait of character" and it completely falls apart if you try to do anything else

Anonymous
04/21/24(Sun)19:17:48 No.100121029

Anonymous 04/21/24(Sun)19:17:48 No.100121029

File: 2391936076-3356790775.jpg (929 KB, 2688x1536)

929 KB JPG

I wonder if SD3 has the classic stable diffusion tendency that when you put 'elf' in the prompt it wants to give you a cross between green christmas elves and keebler elves.

Anonymous
04/21/24(Sun)19:19:26 No.100121051

Anonymous 04/21/24(Sun)19:19:26 No.100121051

>>100120957
60 year old saggers on a 20 year old

Anonymous
04/21/24(Sun)19:19:50 No.100121061

Anonymous 04/21/24(Sun)19:19:50 No.100121061

>>100121003
What? Image generations aren't perfect?
oh my godddddddddddd

Anonymous
04/21/24(Sun)19:21:25 No.100121086

Anonymous 04/21/24(Sun)19:21:25 No.100121086

File: astolf.png (1.12 MB, 832x1216)

1.12 MB PNG

Anonymous
04/21/24(Sun)19:21:45 No.100121091

Anonymous 04/21/24(Sun)19:21:45 No.100121091

File: ..png (477 KB, 672x384)

477 KB PNG

Anonymous
04/21/24(Sun)19:21:54 No.100121096

Anonymous 04/21/24(Sun)19:21:54 No.100121096

File: upscaledturbo_00083_.png (1.43 MB, 896x1152)

1.43 MB PNG

Anonymous
04/21/24(Sun)19:22:04 No.100121100

Anonymous 04/21/24(Sun)19:22:04 No.100121100

>>100121051
lol
lmao

Anonymous
04/21/24(Sun)19:22:42 No.100121108

Anonymous 04/21/24(Sun)19:22:42 No.100121108

>>100121061
you're clutching your black and white tv and screaming that it's better when anyone with eyes can see that you're wrong.

Anonymous
04/21/24(Sun)19:22:48 No.100121111

Anonymous 04/21/24(Sun)19:22:48 No.100121111

File: unprocessable_facial_expr(...).png (958 KB, 832x1216)

958 KB PNG

Anonymous
04/21/24(Sun)19:24:02 No.100121126

Anonymous 04/21/24(Sun)19:24:02 No.100121126

>>100121108
What are you even going on about?
More (You)'s for the starving third-worlder

Anonymous
04/21/24(Sun)19:25:06 No.100121138

Anonymous 04/21/24(Sun)19:25:06 No.100121138

File: dekm_00048_.png (2.64 MB, 1344x1728)

2.64 MB PNG

Anonymous
04/21/24(Sun)19:25:10 No.100121141

Anonymous 04/21/24(Sun)19:25:10 No.100121141

File: 000000_12046_.png (2.2 MB, 1434x932)

2.2 MB PNG

>>100121096
Nice

Anonymous
04/21/24(Sun)19:27:22 No.100121167

Anonymous 04/21/24(Sun)19:27:22 No.100121167

File: 0.jpg (387 KB, 1024x1024)

387 KB JPG

Anonymous
04/21/24(Sun)19:28:48 No.100121182

Anonymous 04/21/24(Sun)19:28:48 No.100121182

File: ..png (559 KB, 672x384)

559 KB PNG

Anonymous
04/21/24(Sun)19:28:52 No.100121183

Anonymous 04/21/24(Sun)19:28:52 No.100121183

File: upscaledturbo_00084_.png (1.52 MB, 896x1152)

1.52 MB PNG

Anonymous
04/21/24(Sun)19:31:11 No.100121221

Anonymous 04/21/24(Sun)19:31:11 No.100121221

File: 00068-TFT_26016762.png (3.77 MB, 2560x1536)

3.77 MB PNG

Anonymous
04/21/24(Sun)19:35:06 No.100121280

Anonymous 04/21/24(Sun)19:35:06 No.100121280

I don't understand why the amount of pictures you have changes the amount of steps necessary to train a lora.

Anonymous
04/21/24(Sun)19:36:17 No.100121294

Anonymous 04/21/24(Sun)19:36:17 No.100121294

File: ..png (1.22 MB, 1216x832)

1.22 MB PNG

Anonymous
04/21/24(Sun)19:41:01 No.100121353

Anonymous 04/21/24(Sun)19:41:01 No.100121353

File: ComfyUI_00798_.png (1.43 MB, 1024x1024)

1.43 MB PNG

Anonymous
04/21/24(Sun)19:41:53 No.100121360

Anonymous 04/21/24(Sun)19:41:53 No.100121360

>>100121280
inverse fletchet cosine

Anonymous
04/21/24(Sun)19:42:04 No.100121362

Anonymous 04/21/24(Sun)19:42:04 No.100121362

File: variety.png (334 KB, 1537x865)

334 KB PNG

>>100121280
kek..

Anonymous
04/21/24(Sun)19:42:14 No.100121363

Anonymous 04/21/24(Sun)19:42:14 No.100121363

File: image_2024-04-22_014202033.png (1.23 MB, 896x1152)

1.23 MB PNG

Anonymous
04/21/24(Sun)19:46:42 No.100121433

Anonymous 04/21/24(Sun)19:46:42 No.100121433

>>100121280
the images themselves add steps dingus
(Training Images * repeats)/batch size * epochs

Anonymous
04/21/24(Sun)19:47:54 No.100121444

Anonymous 04/21/24(Sun)19:47:54 No.100121444

So I've been training a few LORAs on the same dataset recently and I can't help but notice how larger network ranks are directly tied to the quality of the output.
I feel like people recommending anything less than the largest rank your GPU can handle is just vramlet cope.

Anonymous
04/21/24(Sun)19:49:53 No.100121469

Anonymous 04/21/24(Sun)19:49:53 No.100121469

>>100121444
post them then faggot
>you wont
It's all larp

Anonymous
04/21/24(Sun)19:51:04 No.100121478

Anonymous 04/21/24(Sun)19:51:04 No.100121478

>>100121444
Thank you for saying so, it's great to hear this kind of information. Do you have any comparison between network ranks? It'd be great to see evidence of the kind of difference it makes.

Anonymous
04/21/24(Sun)19:52:02 No.100121487

Anonymous 04/21/24(Sun)19:52:02 No.100121487

>>100121469
Gimme a moment, I'm training some right now so I can't gen any comparisons. But I stand by what I said.

Anonymous
04/21/24(Sun)19:53:17 No.100121505

Anonymous 04/21/24(Sun)19:53:17 No.100121505

>>100121362
trudeau blackface lora when?

Anonymous
04/21/24(Sun)19:54:00 No.100121509

Anonymous 04/21/24(Sun)19:54:00 No.100121509

>>100121444
"optimal" is def higher than people recommend but it isn't as simple as higher = better.

On pony I have found 128 best, I can train at 256 but it starts to look fucked.

People saying train it on 8 are retards though

Anonymous
04/21/24(Sun)19:54:04 No.100121510

Anonymous 04/21/24(Sun)19:54:04 No.100121510

File: image_2024-04-22_015354386.png (1.3 MB, 896x1152)

1.3 MB PNG

Anonymous
04/21/24(Sun)19:55:30 No.100121523

Anonymous 04/21/24(Sun)19:55:30 No.100121523

>>100121487
alright cool, I'm curious about the science you're doing/have done

Anonymous
04/21/24(Sun)19:55:35 No.100121524

Anonymous 04/21/24(Sun)19:55:35 No.100121524

>>100121509

True, 256 starts basically reprinting the training data very fast but in fucked up ways.

Anonymous
04/21/24(Sun)19:57:58 No.100121556

Anonymous 04/21/24(Sun)19:57:58 No.100121556

>>100120935
you need to pay for commercial use

Anonymous
04/21/24(Sun)19:58:04 No.100121560

Anonymous 04/21/24(Sun)19:58:04 No.100121560

>>100121505
When I grab a 3090.

Anonymous
04/21/24(Sun)19:58:19 No.100121564

Anonymous 04/21/24(Sun)19:58:19 No.100121564

>>100121556
How would they ever police that?

Anonymous
04/21/24(Sun)19:59:20 No.100121580

Anonymous 04/21/24(Sun)19:59:20 No.100121580

>>100121509
>>100121524
I used to do requests for LORA training and trained at 128 network and 64 alpha, but people complained about the size of the LORA file. I ended up reducing it to 64/32. If there is evidence that 128 is better though I'd definitely want to switch back.

Anonymous
04/21/24(Sun)20:00:03 No.100121595

Anonymous 04/21/24(Sun)20:00:03 No.100121595

>>100121580
>nooo not the heckin 200mb file
tell them to keep themselves safe

Anonymous
04/21/24(Sun)20:01:19 No.100121615

Anonymous 04/21/24(Sun)20:01:19 No.100121615

>>100121564
sue you if you're using an sd3 generated image commercially?
desu, that'd probably cause a big court loss in the corner of generative ai though...

Anonymous
04/21/24(Sun)20:02:00 No.100121623

Anonymous 04/21/24(Sun)20:02:00 No.100121623

>>100121564
emad unironically said "honesty"

Anonymous
04/21/24(Sun)20:04:11 No.100121651

Anonymous 04/21/24(Sun)20:04:11 No.100121651

>>100121595
the vast majority of celebrity loras on civit are between 800 and 900 MB. It's a real problem.

Anonymous
04/21/24(Sun)20:05:08 No.100121663

Anonymous 04/21/24(Sun)20:05:08 No.100121663

>>100121651
honestly, is it?
it's like 50$ per TB of storage at most

Anonymous
04/21/24(Sun)20:06:02 No.100121671

Anonymous 04/21/24(Sun)20:06:02 No.100121671

>>100121444
>>100121509
>>100121524
>>100121580
>>100121595
>>100121651
you can resize them after you train them

Anonymous
04/21/24(Sun)20:06:23 No.100121676

Anonymous 04/21/24(Sun)20:06:23 No.100121676

>>100121651
>>100121663
>>100121595
based and I don't give a shit about 5 cents of storage space pilled

>>100121671
or I can simply not

Anonymous
04/21/24(Sun)20:07:30 No.100121686

Anonymous 04/21/24(Sun)20:07:30 No.100121686

>>100121663
storing the loras isn't the issue. loras have to be loaded in VRAM. can quickly run out of room with 1gb loras.

>>100121671
Interesting. googling.

Anonymous
04/21/24(Sun)20:09:19 No.100121714

Anonymous 04/21/24(Sun)20:09:19 No.100121714

>>100120965
thank you. trying this now

Anonymous
04/21/24(Sun)20:09:46 No.100121722

Anonymous 04/21/24(Sun)20:09:46 No.100121722

>>100116832
>>100117355
>maximum details
>extreme hyperrealistic details
>trending on artstation
kekt

Anonymous
04/21/24(Sun)20:09:50 No.100121725

Anonymous 04/21/24(Sun)20:09:50 No.100121725

>>100121676
>or I can simply not
it retains a lot of the quality without taking up so much space. there isn't a reason to keep them that big

>>100121686
>storing the loras isn't the issue. loras have to be loaded in VRAM. can quickly run out of room with 1gb loras.
this too

Anonymous
04/21/24(Sun)20:10:58 No.100121740

Anonymous 04/21/24(Sun)20:10:58 No.100121740

>>100121722
>score_9
I bet this will start showing up forever in future models completely unrelated to pony

Anonymous
04/21/24(Sun)20:13:05 No.100121759

Anonymous 04/21/24(Sun)20:13:05 No.100121759

>>100121740
everyone universally hates it so no

Anonymous
04/21/24(Sun)20:13:21 No.100121760

Anonymous 04/21/24(Sun)20:13:21 No.100121760

File: 00212-3159343615.png (2.33 MB, 1072x1376)

2.33 MB PNG

Anonymous
04/21/24(Sun)20:13:48 No.100121766

Anonymous 04/21/24(Sun)20:13:48 No.100121766

>>100121759
more like people will forever blindly copy prompts from images and a huge % of images over this time period will have that

Anonymous
04/21/24(Sun)20:14:48 No.100121782

Anonymous 04/21/24(Sun)20:14:48 No.100121782

>>100121766
this

Anonymous
04/21/24(Sun)20:14:50 No.100121783

Anonymous 04/21/24(Sun)20:14:50 No.100121783

File: 00166-TFT_26016766.png (3.53 MB, 2560x1536)

3.53 MB PNG

Anonymous
04/21/24(Sun)20:15:37 No.100121792

Anonymous 04/21/24(Sun)20:15:37 No.100121792

File: dekm_00047_.png (2.81 MB, 1344x1728)

2.81 MB PNG

Anonymous
04/21/24(Sun)20:15:42 No.100121793

Anonymous 04/21/24(Sun)20:15:42 No.100121793

>>100121740
>>100121759
I'd want to train a non-cucked SD3 model so people don't have to deal with Pony anymore, but it would need to not suck to compete. I don't mind spending some money renting A100s for the training but the dataset needs to be well done and that seems like a challenge.

Anonymous
04/21/24(Sun)20:16:28 No.100121804

Anonymous 04/21/24(Sun)20:16:28 No.100121804

File: 00048-[epicrealismXL_v5Ul(...).jpg (1.03 MB, 2476x4000)

1.03 MB JPG

How about some chrome?

Anonymous
04/21/24(Sun)20:17:06 No.100121812

Anonymous 04/21/24(Sun)20:17:06 No.100121812

>>100121793
if you have a budget what you do is literally hire people (probably Indian) to tag massive amounts of data for you.
It isn't tech that is the limitation for a great model, it's datasets

Anonymous
04/21/24(Sun)20:17:15 No.100121815

Anonymous 04/21/24(Sun)20:17:15 No.100121815

File: image_2024-04-22_021710060.png (1.51 MB, 1024x1024)

1.51 MB PNG

Anonymous
04/21/24(Sun)20:19:11 No.100121838

Anonymous 04/21/24(Sun)20:19:11 No.100121838

>>100121793
It will need to be a coordinated group effort

Anonymous
04/21/24(Sun)20:19:12 No.100121840

Anonymous 04/21/24(Sun)20:19:12 No.100121840

File: dekm_00045_.png (3.02 MB, 1344x1728)

3.02 MB PNG

>>100121804
me in the back (I'm an orb)

Anonymous
04/21/24(Sun)20:19:14 No.100121843

Anonymous 04/21/24(Sun)20:19:14 No.100121843

>>100121766
to be fair, in the early days of base 1.5, there were some decently complex negative prompts floating around that worked much better than embeddings, and the "amazing quality, masterpiece, award-winning photography" prompts did make a decent difference in quality when trying to gen photoreal people

Anonymous
04/21/24(Sun)20:19:41 No.100121846

Anonymous 04/21/24(Sun)20:19:41 No.100121846

>>100121793
we need a way to collaboratively put together datasets from all out lora training without retards shitting it up. the latter is the hard part

Anonymous
04/21/24(Sun)20:20:45 No.100121862

Anonymous 04/21/24(Sun)20:20:45 No.100121862

File: 0.jpg (246 KB, 1024x1024)

246 KB JPG

Anonymous
04/21/24(Sun)20:22:18 No.100121885

Anonymous 04/21/24(Sun)20:22:18 No.100121885

File: dena_00047_.png (2.04 MB, 1728x1344)

2.04 MB PNG

Anonymous
04/21/24(Sun)20:22:20 No.100121887

Anonymous 04/21/24(Sun)20:22:20 No.100121887

>>100121846
The only way to really vet people and pay for the training of such a model very quickly begins to resemble something like a real company except its employees dont get paid.

Anonymous
04/21/24(Sun)20:23:00 No.100121894

Anonymous 04/21/24(Sun)20:23:00 No.100121894

>>100121887
>>100121846
which is why you just don't bother and pay up to the pajeets

Anonymous
04/21/24(Sun)20:23:23 No.100121901

Anonymous 04/21/24(Sun)20:23:23 No.100121901

File: dena_00046_.png (2.35 MB, 1728x1344)

2.35 MB PNG

Anonymous
04/21/24(Sun)20:23:48 No.100121906

Anonymous 04/21/24(Sun)20:23:48 No.100121906

>>100121894
>pay up to the pajeets
that's how you get LAION

Anonymous
04/21/24(Sun)20:24:41 No.100121918

Anonymous 04/21/24(Sun)20:24:41 No.100121918

What I don't understand about pony model tags are:

> score_8_up
Does this mean score 8 and up? If that's the case it shouldn't even need a score_9? The advice I got when I first started using it, can't remember from where, said to use something like:

> score_9,score_8_up,score_7_up,score_6_up
But this seems redundant if "up" means what I think, so I assume I'm wrong. Also do you have to put a BREAK after the score stuff? At first I was doing that, but then I stopped partly because it was tedious to manage that in comfyUI and it didn't seem to make a whole lot of difference.

Anonymous
04/21/24(Sun)20:26:09 No.100121936

Anonymous 04/21/24(Sun)20:26:09 No.100121936

>>100121906
>>100121887
>>100121894
alright let's see to have a great model I need to just compete in the same space and style as the billion dollar companies but do it for free with 2k of equipment instead of microsoft using 1$ billion.

I think I will continue to wait for others to provide the models and focus on loras

Anonymous
04/21/24(Sun)20:27:54 No.100121954

Anonymous 04/21/24(Sun)20:27:54 No.100121954

>>100121918
Check the PonyXL CivitAI page - the trainers literally admit there they fucked up the training so the score numbers are broken and don't function correctly/logically. Literally the only reason it's used is because there's no better alternative XL model. That why I'm really hoping to either train something better or that someone else will because as soon as there's something better that's not cucked there will be no reason to use PonyXL ever again.

Anonymous
04/21/24(Sun)20:28:23 No.100121959

Anonymous 04/21/24(Sun)20:28:23 No.100121959

can i train on sdxl_vaefix or do I have to train on the model without the built in vae?

Anonymous
04/21/24(Sun)20:28:59 No.100121962

Anonymous 04/21/24(Sun)20:28:59 No.100121962

>>100121840
You're highly reflective. Good.

Anonymous
04/21/24(Sun)20:29:28 No.100121968

Anonymous 04/21/24(Sun)20:29:28 No.100121968

>>100121936
they use synthetic datasets like they say they use in their papers

Anonymous
04/21/24(Sun)20:39:15 No.100122093

Anonymous 04/21/24(Sun)20:39:15 No.100122093

baker...
...baker?
b
a
k
e
r
>b
>a
>k
>e
>r

Anonymous
04/21/24(Sun)20:41:07 No.100122115

Anonymous 04/21/24(Sun)20:41:07 No.100122115

>>100121887
>>100121846
>>100121838
>>100121812
>>100121968
>>100121936

The process I was thinking was to grab a lot of booru images since those are easy, use a Python script to clean/synchronize tags between different boorus, and then use an AI to convert the tags into natural language which should greatly improve prompting based on the ELLA research: https://ella-diffusion.github.io/

That dataset could be supplemented with more manually gathered images to cover characters/styles/concepts people want, and those images would need to be fed to an AI for captioning too.

I'd thinking I'd need some huge harddrives if I'm going to store the data set locally, maybe pay for a Chat GPT4 subscription to caption safe images, and setup something local to caption explicit images.

For people with any training experience, does that seem reasonable?

Anonymous
04/21/24(Sun)20:42:31 No.100122133

Anonymous 04/21/24(Sun)20:42:31 No.100122133

>>100122115
would it improve prompting with tags or just make it better at understanding natural language though. I don't think you would get the result you are hoping for.

Anonymous
04/21/24(Sun)20:43:08 No.100122141

Anonymous 04/21/24(Sun)20:43:08 No.100122141

>>100121714
>>100120965
ok this seems to work but turns out the thing i'm trying to run is not the thing i actually want to run lmao

>>100121862
(painting, traditional media)

Anonymous
04/21/24(Sun)20:45:40 No.100122173

Anonymous 04/21/24(Sun)20:45:40 No.100122173

>>100122133
"1girl, apple" doesn't provide enough information to the AI to understand location, color, etc. If a language model can turn that into "1girl holding a red apple in her right hand" than the resulting model is leaps and bounds ahead in understanding prompts. That's what the ELLA research found, was going to post one of the pics at https://ella-diffusion.github.io/ but we hit the image limit.

Anonymous
04/21/24(Sun)20:46:51 No.100122188

Anonymous 04/21/24(Sun)20:46:51 No.100122188

it's actually over this time

Anonymous
04/21/24(Sun)20:49:06 No.100122222

Anonymous 04/21/24(Sun)20:49:06 No.100122222

>please wait before making a thread

Anonymous
04/21/24(Sun)20:49:51 No.100122232

Anonymous 04/21/24(Sun)20:49:51 No.100122232

>>100122188
4 u

Anonymous
04/21/24(Sun)20:50:06 No.100122236

Anonymous 04/21/24(Sun)20:50:06 No.100122236

Next thread
>>100122230
>>100122230
>>100122230

Anonymous
04/21/24(Sun)21:02:18 No.100122413

Anonymous 04/21/24(Sun)21:02:18 No.100122413

>>100116195
i like this

Anonymous
04/21/24(Sun)21:53:39 No.100122992

Anonymous 04/21/24(Sun)21:53:39 No.100122992

>>100120089
more intelligible paneling than oda

Anonymous
04/21/24(Sun)21:55:51 No.100123019

Anonymous 04/21/24(Sun)21:55:51 No.100123019

>>100120755
Try Worldly lora, DPM++ 3M SDE, and maybe perturbed attention guidance

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.