paequ2@lemmy.today to Programmer Humor@lemmy.mlEnglish · 2 days agoUse this information wiselylemmy.mlimagemessage-square85fedilinkarrow-up1579arrow-down18
arrow-up1571arrow-down1imageUse this information wiselylemmy.mlpaequ2@lemmy.today to Programmer Humor@lemmy.mlEnglish · 2 days agomessage-square85fedilink
minus-squareAmazingAwesomator@lemmy.worldlinkfedilinkarrow-up119·1 day agoanother good one to sneak in there… thai zero-width space: U+200B cant see it, nothing reads it, and it makes everything error. : D
minus-squareS_H_K@lemmy.dbzer0.comlinkfedilinkarrow-up3·13 hours agoBefore I went to the comments I wished no one mentioned that. As a DBA I fucking hate you…
minus-squareAmazingAwesomator@lemmy.worldlinkfedilinkarrow-up3·13 hours agoi am an SDET. this character destroys DBs… i am sorry :(
minus-squareanton@piefed.blahaj.zonelinkfedilinkEnglisharrow-up5·16 hours agoThe right to left mark (U+2000F) can also be fun.
minus-squareOnno (VK6FLAB)@lemmy.radiolinkfedilinkarrow-up34·1 day agoHmm … we should start collecting these. Anyone know of an existing list?
minus-squarefloquant@lemmy.dbzer0.comlinkfedilinkarrow-up91·1 day agohttps://github.com/minimaxir/big-list-of-naughty-strings/
minus-square∞🏳️⚧️Edie [it/its, she/her, fae/faer, love/loves, null/void, des/pair, none/use name]@lemmy.mllinkfedilinkarrow-up37·1 day agohttps://invisible-characters.com/
minus-squarePartyAt15thAndSummit@lemmy.ziplinkfedilinkarrow-up1·13 hours agoI’m not an expert in Glagolitic, but I have a feeling that next-to-none of its letters are supposed to be invisible.
minus-squareCanadaPlus@lemmy.sdf.orglinkfedilinkarrow-up1·15 hours agoIt’s the first one, the ㅤU+3164 Hangul filler, to save everyone else a source+browser console+copy-paste+hex editor/search.
minus-squareJohnnyCanuck@lemmy.calinkfedilinkarrow-up3·21 hours agoOh ho! I see what you did there!
minus-squareCallMeAnAI@lemmy.worldlinkfedilinkarrow-up21·1 day agoCame here to say fuck the zero width space. I spent 90 hours in the depths of solr looking for this fucker who brought down our entire search index.
minus-squarederfunkatron@lemmy.worldlinkfedilinkEnglisharrow-up14·1 day agoI deal with shy hyphens a lot. They don’t display unless there’s a line break, so they get copied from various word docs or websites and end up in a database somewhere waiting to piss me off.
minus-squareOnno (VK6FLAB)@lemmy.radiolinkfedilinkarrow-up7·1 day agoI’m guessing that they pasted code from inside Microsoft Word.
minus-squareCallMeAnAI@lemmy.worldlinkfedilinkarrow-up4·1 day agoNo. CMS updated to support new character set while solr did not. Not enough sanitization.
minus-squareOnno (VK6FLAB)@lemmy.radiolinkfedilinkarrow-up3·24 hours agoI’ve had similar “fun” with the character defaults on MySQL, from memory for a time it was Swedish by default, rather than UTF.
another good one to sneak in there… thai zero-width space: U+200B
cant see it, nothing reads it, and it makes everything error. : D
Before I went to the comments I wished no one mentioned that. As a DBA I fucking hate you…
i am an SDET. this character destroys DBs… i am sorry :(
The right to left mark (U+2000F) can also be fun.
Hmm … we should start collecting these.
Anyone know of an existing list?
https://github.com/minimaxir/big-list-of-naughty-strings/
https://invisible-characters.com/
I’m not an expert in Glagolitic, but I have a feeling that next-to-none of its letters are supposed to be invisible.
ᅠ
It’s the first one, the ㅤU+3164 Hangul filler, to save everyone else a source+browser console+copy-paste+hex editor/search.
Oh ho! I see what you did there!
Came here to say fuck the zero width space. I spent 90 hours in the depths of solr looking for this fucker who brought down our entire search index.
I deal with shy hyphens a lot. They don’t display unless there’s a line break, so they get copied from various word docs or websites and end up in a database somewhere waiting to piss me off.
Yup
I’m guessing that they pasted code from inside Microsoft Word.
No. CMS updated to support new character set while solr did not. Not enough sanitization.
I’ve had similar “fun” with the character defaults on MySQL, from memory for a time it was Swedish by default, rather than UTF.