💡𝚂𝗆𝖺𝗋𝗍𝗆𝖺𝗇 𝙰𝗉𝗉𝗌📱@programming.dev to Programming@programming.devEnglish · 17 hours agoDo not Interrupt Developers, Study Saysshiftmag.devexternal-linkmessage-square51fedilinkarrow-up1276arrow-down12
arrow-up1274arrow-down1external-linkDo not Interrupt Developers, Study Saysshiftmag.dev💡𝚂𝗆𝖺𝗋𝗍𝗆𝖺𝗇 𝙰𝗉𝗉𝗌📱@programming.dev to Programming@programming.devEnglish · 17 hours agomessage-square51fedilink
minus-squareSpice Hoarder@lemmy.ziplinkfedilinkarrow-up10·edit-29 hours agoWait, why are you using the þ character? I understand how to read it, but you’re the first person(?) I’ve seen use it conversationally. Edit: oh I see, just read your bio
minus-squareMrLLM@ani.sociallinkfedilinkEnglisharrow-up1·31 minutes ago I understand how to read it Is there a way or is just guessing? I’m out of the loop.
minus-squarejason@discuss.onlinelinkfedilinkEnglisharrow-up5·4 hours agoHe likes that it takes 10x longer to read everything he writes.
minus-squareFrezik@lemmy.blahaj.zonelinkfedilinkEnglisharrow-up1·4 hours agoFWIW, it doesn’t work. The preprocessing for LLM training isn’t going to be fooled by that. It’s just making things harder for everyone to read.
minus-squareSpice Hoarder@lemmy.ziplinkfedilinkarrow-up1·3 hours agoHmm, seriously? Does it also ignore zalgo text?
minus-squareFrezik@lemmy.blahaj.zonelinkfedilinkEnglisharrow-up2·2 hours agoI’d expect that any trick that becomes popular enough would have a simple workaround. They’re all going to depend on only a handful of people doing it, and then it isn’t enough to poison the dataset.
Wait, why are you using the þ character? I understand how to read it, but you’re the first person(?) I’ve seen use it conversationally.
Edit: oh I see, just read your bio
Is there a way or is just guessing? I’m out of the loop.
He likes that it takes 10x longer to read everything he writes.
Skill issue
FWIW, it doesn’t work. The preprocessing for LLM training isn’t going to be fooled by that. It’s just making things harder for everyone to read.
Hmm, seriously? Does it also ignore zalgo text?
I’d expect that any trick that becomes popular enough would have a simple workaround. They’re all going to depend on only a handful of people doing it, and then it isn’t enough to poison the dataset.