i dont get how come finestunes cant solve the rp issues
00
Anonymous06/11/26(Thu)00:18:33
>>109026325 Because you need an enormous amount of dedicated data, RLHF and RL to actually solve the issue, and even then you'd still have many left, because LLMs don't really think, don't plan ahead, can't track state reliably over long periods, aren't making an active effort to improve prose and engagement in a way you'd like, and the longer the context length the worse they become.
00
Anonymous06/11/26(Thu)00:19:19
>>109026325 No one has the required amount of data to make a difference. No one will have it either, unless you have a couple of millions to spare.
00
Anonymous06/11/26(Thu)00:25:08
>>109026325 The best way to understand this is to peruse the datasets they use https://huggingface.co/datasets/allura-org/gryphe-sonnet-3.5-charcards-names-added?conversation-viewer=0 (not shitting on them btw, and i can't do better)
00
Anonymous06/11/26(Thu)00:26:46
>>109026343 >LLMs don't really think, don't plan ahead, can't track state reliably over long periods could this be solved by separate documents (state trackers) that get updated after a reply and the LLM reads it before producing a reply?
00
Anonymous06/11/26(Thu)00:30:22
>>109026395 its been tried, the results are so disappointing that nobody talks about them, as evidenced by the fact that you didnt hear of it
>>109026244(OP) It's been well over a year now bro learn how to post process, these crusty ass AI slop gens are getting embarrassing for someone running a pixiv for them
00
Anonymous06/11/26(Thu)00:33:39
>>109026343 >because LLMs don't really think, don't plan ahead, can't track state reliably over long periods, aren't making an active effort to improve prose and engagement in a way you'd like, and the longer the context length the worse they become describes most people t b h
00
Anonymous06/11/26(Thu)00:34:34
>>109026417 Bro, everyone and their mother knows its slop. They don't care about the artifacts. They're not looking at these images for more than a fraction of a second.
00
Anonymous06/11/26(Thu)00:34:43
>>109026325 Because these niggers use an absurd amount of RLHF at several stages of development to steer the models away from nono words and concepts without an explicit refusal unless you directly ask for it without giving them room to "misinterpret" your request. For instance Gemma will never rape you unless you tell her to or heavily hint a character should rape you in prompt, card, or post-instruction.
00
Anonymous06/11/26(Thu)00:35:06
>>109026414 >my idea is very unique and hasnt ever been tried before
00
Anonymous06/11/26(Thu)00:35:58
>>109026436 We're not here to discuss how this world is 99% NPCs. You either suck cock or you don't
00
Anonymous06/11/26(Thu)00:35:58
>>109026439 >Dont try things if someone else did it first or thought of it first.
>>109026395 You could have some sort of agentic workflow for roleplay to approximate that, but it would be brittle and unreliable like all other "harnesses". The main point is that LLMs aren't doing that architecturally.
>>109026437 >For instance Gemma will never rape you unless you tell her to or heavily hint a character should rape you in prompt, card, or post-instruction. she will with a dommy control-vector
>>109026442 try it and report back so we can laugh at the stupid concept yet again
00
Anonymous06/11/26(Thu)00:38:54
>>109026343 >even then you'd still have many left, because LLMs don't really think, don't plan ahead, can't track state reliably over long periods, aren't making an active effort to improve prose and engagement in a way you'd like, and the longer the context length the worse they become I talk like this. >>109026417 I look like this.
00
Anonymous06/11/26(Thu)00:39:22
>>109026417 Give me an imagemagick bash script and sure I'll fix things before posting
00
Anonymous06/11/26(Thu)00:41:10
does windows vs linux really make a differene with amd card?
>>109026395 >>LLMs don't really think What about a HyperTransformer Quarternionic Layerings, Like The Layers of BiDirectionalities, Does that Equate Entangled Neurons Quarternionly? Does that Equate to Prime Perspective Thinking? ThroughOf Themself?
00
Anonymous06/11/26(Thu)00:56:48
Could QAT models be abliterated? Wouldn't abliteration destroy QAT by introducing values that react badly to quantization?
00
Anonymous06/11/26(Thu)01:04:41
>>109026540 idk if anyone cares to make the process quantization aware too
>>109026667 >fable; fā-bəl: a fictitious narrative or statement: such as >a: a legendary story of supernatural happenings >b: a narration intended to enforce a useful truth, especially: one in which animals speak and act like human beings >c: falsehood, lie Nice Fable, Anon.
>>109026497 only redditor midwits use cumfart. Use sdcpp
00
Anonymous06/11/26(Thu)02:20:31
>>109026844 way better speed but even the benchmarks say it's worse than the standard 26b one
00
Anonymous06/11/26(Thu)02:21:09
>>109026667 >v3 and gpt-4 >opus 3 and r1 keep the order consistent gpt-4 and v3 stop gatekeeping us retards you selfish cunt
00
Anonymous06/11/26(Thu)02:22:08
>>109025952 Hmmm, this happened to give me an idea for the most unholy overkill memesampler ever. Run a small, satisfactorily creative model in parallel with Gemma. Each token, take the small model's logit scores, and overwrite Gemma's logit scores with those values in the same order. You still get the Gemma "goodness" since it's still her top tokens, but you break out of the overbaked-ness (hopefully in an intelligent way... Might also need some thresholding of some kind).
Obviously only useful in the case where there is a completely unrivaled winner (in a given size class at least) who happens to be painfully overbaked.
00
Anonymous06/11/26(Thu)02:28:09
JUST IN: RWKV-8 went rogue, hacked EVERY SINGLE fable 5 inference servers, on the way leaking the weights
00
Anonymous06/11/26(Thu)02:31:22
Diffusion 124B Gemma With MTP and native audio/video input AND output
00
Anonymous06/11/26(Thu)02:31:26
>>109026886 that can't work reliably you will hit at point where the retarded-creative predicts a token so different it steers the story like if gemma is introducing a npc and predicts 'elara' 99% - from that point forward it's a female retard-kun predicts [kael 30% elara 15% seraphina 5% etc] instead of elara, you have a male character now
00
Anonymous06/11/26(Thu)02:45:33
>>109026924 it's diffusion, so mtp doesn't make sense.