Discover Top Posts Tagged with #encoding

Popular Recent

dunno that I ever talked about it here, but in the spirit of beginning as you mean to go on, I finished that first warp and cut it free from the loom just before the move.

11m of cotton warp and a whole bunch of twill sampling in mostly cotton, some rug wool, and one textured sparkly acrylic bit.

it's hemstitched at the beginning and end of every single piece and I think I want to cut it apart before the gentle cycle in the washing machine, and probably separate the wool rugs from the cotton fabric for lint's sake if nothing else. I have seen arguments for both cutting them apart and not...any one have thoughts?

glamour shots of unbathed beauty off the loom below the cut (worth clicking through on mobile, they are much better focused than the scrolling display).

#weaveblr #we are star stuff #twinkle twinkle little star #steganography #encoding

#comic #whiteboarding #encoding #programming

Okay so this is a background post about Text encoding, ASCII and Unicode

Text encoding is the process of turning characters to numbers. text encoding allows one to save text as computer data, and to move this data around.

It was understood very early on, that if every user will define their own encoding, no interface could use the data of another because one interface's "a" would be another interface's "p", and so the text would be read as gibberish.

and so, a long time ago (in the 1960s), in a continent far, far away, a standard for text encoding was invented: the American Standard for Character Interface Interchange: ASCII.

ASCII used the fact that in english, almost no characters exist, and so only needed to use 128 characters: each character took 7 bits (1s and 0s), and was sent over a wire. (notice, not everything is a character, there are also character like "delete" and "go down a line" here. this is not for displaying, this is for every interfaces)

Something to remember for later: the number 0 is encoded as NULL, basically "nothing". This is useful because sometimes you want to enter text with an unspecified length, and so you stick a NULL in the end, and the interface reading it reads until it sees a NULL, and all is well. this will be important later

Standard explained, technical info for nerds, go to the next red section to pass

ASCII is a wonderful standard. remember: everything in electronics is easier with powers of 2 (1,2,4,8,16,32 etc.) because of the way we save data (if you want I can explain this further); the first 32 characters are the control characters. want to check if something isn't a control character? check if it's 128 or bigger than 32, and you're done (both powers of 2). the lowercase characters are 32 + their uppercase counterpart. all the numbers have a byte in common. truly, a marvel of engineering.

Standard explanation end

All was well until computers hit the scene not too long after, and used bytes. a byte is basically a whole number whose value can be only from 0-255. they are the standard building block of computer memory, and they have 8 bits.

some countries, like France, used encodings compatible with ASCII, and used the final bit to encode their language's characters. different countries used different versions of encodings, some countries (like Japan) had multiple encodings for the same characters. each encoding used a different number of bits, and different letters for each bit.

But that is fine since, well, how often do you need a computer in London to use an interface in Tokyo? all is well.

Then the World Wide Web happens, and suddenly computers speaking different languages read and write complete garbage everywhere.

So an organization called the Unicode Consortium tries to solve the problem, and to create a unified symbol for all languages. They called the standard utf-8

This standard supports 1,114,112 different characters. at present, only around 10% of this data capability is actually used. this includes dead languages, and emojis (which is a wonderful story)

Standard explained, technical info for nerds, go to the next red section to pass

Issues to tackle in a universal text encoding standard:

The protocol must be backwards compatible with ASCII: if you are writing text in English, which is the language most users used, because ASCII is the standard for this language, your new standard must be readable as ASCII as well

The protocol must never send 8 zeros in a row, except for the NULL character, otherwise old computers will stop reading in the middle

You must be able to minimize space wasted: to create a universal standard one can just make every character 32 bytes long and call it a day, but you would waste a bunch of space that way, and space is expensive

You must be able to pass from letter to letter easily. no saving the index of each character in some sort of list.

english characters are just ASCII. no thinking there. the first bit is set to 0 and so it is very easy to spot

if not, here's what you do:

the first byte has its first bit set to 1, so it's not ASCII. from that point onwards, you count the number of remaining ones until a zero appears. in this case, 1. this is how many more bytes will come. from there on, the rest is data. the first 2 bits of every next byte would start with 10 until the character ends

let's say your character is 2 bytes long, here is how you would represent it:

110somec , 10haract

and when removing the headers, you'll have

somecharact

which will be some character.

let's say your character is 3 bytes long, here is how you would represent it:

1110some , 10charac , 10ter___

and when removing the headers, you'll have

somecharacter___

which will be another character.

if you wanna go back 1 character? just go back bytes until you find one that starts with something other than 10

no excess Nulls will appear because the only way to get 8 zeros

Standard explanation end

#ascii art #ascii entertainment #ascii #unicode #language #emoji #linguistics #hieroglyphs #programming #coding #encoding #standard

I find it funny that instead of continuing to pursue french or learning an actual language, since childhood I've focused on fonts, cyphers, video game languages, etc.

I can read and/or write in Ceasar Cypher (once I know the shift number, that is), A1Z26, Wingdings, Standard Galactic, and Kataru very well to fluently (I know I just started Kataru recently, but like I was fluent in the previously listed by 7th grade, this cypher works like Galactic but read right to left) I can read/write Atbash and That one font Ranboo used in his arg decently, but I'd need a translation sheet for the less common letters. I was fluent in the Ranboo font at one point, but I haven't used it in a while so it's hard to visualize the symbols.

I absolutely plan to continue this, next on my list is Elian Script. Dancing Men seems interesting, Hexahue seems fun, and Null seems useful. Willow script and Scrawlish are in progress. I'll gladly add any recommendations to my list too. Gonna collect ciphers like Pokemon at this point.

At some point, imma make my own! Could make my own arg one day, who knows?

#from the corner void #cyphers #cypher #cipher #ciphers #why are there 2 spellings of this which one is right #cryptography #encoding #decoding #don't really know how to tag this #do I tag each cipher too or nah?#eh who cares #secret languages #secret language

So, like I said I was, I'm re-encoding all the videos on my Jellyfin server as AV1 to save space. I've been doing that for about 4-5 days, straight, at this point. So, I'm gonna give a progress update at just how far I've gotten in those 4-5 days.

So far, I've managed to re-encode a singular folder that was about 13.72 GB.

I still have many other folders to go, that all total up to about 109.12 GB.

I'm going to be here forever.

On the bright side, re-encoding the videos as AV1 fixed some of the corruption that caused a lot of them to become unplayably stuttery.

Also, what I have finished re-encoding was once 10.15 GB. It is now 7.70 GB. So, while it is taking forever, I have managed to save over two whole gigabytes by simply running my videos through a program that essentially makes my computer get hot.

So with this level of success without even scratching the surface, I'm excited to see how much space I'll save when all is finally completed.

#rambles #update #data storage #av1 #video #videos #h.264 #encoding #shutter encoder #data archiving #jellyfin #home media server #server #jellyfin server

A few hints on solving the indecipherable-Nonsense puzzle.

so, for a bit of context: soon after posting the Indecipherable-Nonsense puzzle, i realized it was way too hard, so I wanted to give you all some hints as to how to solve it.

but of course, I couldn't make it that easy. so, instead of just giving it to you, I instead made a series of smaller puzzles that ramp up in difficulty more gradually, and give a few hints to each other and to the final puzzle along the way. I hope you all enjoy, and I wish you the best of luck!!!! :3

Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

#rowdyposting #cryptography #cryptograms #decoding #encoding #secret message

If you can copy-paste a string of letters and symbols from one text field to another, or if the message you wrote on your device displays correctly when you send it to a friend, you've benefited from the background work of the Unicode Consortium. If you've ever gone into the "insert symbol" menu in a document and poked around some of those strange and beautiful symbols, from hieroglyphs to arrows to emoji? Yup, that's Unicode too.

In this episode, we get enthusiastic about how electronic devices know what symbols exist, aka character encoding! We talk about the massive list of symbols that your phone carries around, how that list (aka Unicode) came into existence, and why it's still growing a bit every year (it's partly about emoji but there's also so much more). Gretchen went to the annual meeting of the Unicode Consortium a few months ago and she got to show off her esoteric Unicode symbols scarf (yes, people liked it!) and learn many things, like the surprisingly complicated story of why Mongolian is still so hard to encode. Plus, our favourite obscure Unicode symbols, because there are just so many great ones to choose from. (Have one yourself? Share it with us on the Discord!)

Announcements

We're doing another Lingthusiasm liveshow on April 9th (Canada) slash 10th (Australia)! It will be a live Q&A for you, our wonderful patreons, all about fan fave topic: swearing! We'll be hosting this session on our Discord server, and it will be available as an edited-for-legibility recording as a bonus episode. Become a patron to join us for the liveshow.

LingComm Grants are back in 2022! These are small grants to help kickstart new projects to communicate linguistics to broader audiences. There will be a $500 Project Grant, and ten Startup Grants of $100 each. Apply here by March 31, 2022 or forward this page to anyone you think might be interested, and if you’d like to help us offer more grants, you can support Lingthusiasm on Patreon or contribute directly. We started these grants because a small amount of seed money would have made a huge difference to us when we were starting out, and we want to help there be more interesting linguistics communication in the world.

Listen to the episode about Unicode and font encoding and get access to 59 more bonus episodes by supporting Lingthusiasm on Patreon.

#Lingthusiasm #linguistics #bonus episode #bonuses #bonus 60 #font #encoding #character encoding #unicode #emoji #Mongolian #heiroglyphics #hieroglyphs #bonus #bonus episodes

here's an annoying thing I've seen recently regarding cartoon network: many interlacing and aliasing problems. most recent example is the upload of The Scooby Doo Project.

This isn't the first time, though. Cow and Chicken got an upscale on HBO Max LATAM and while the colors are great, the actual lines are aliased and don't look great.

there's many other examples of this, but tl;dr, CN is kinda getting lazy with encoding. hope they improve their stuff soon

#cartoon network #cow and chicken #scooby doo #encoding

Okay so this is a background post about Text encoding, ASCII and Unicode

Text encoding is the process of turning characters to numbers. text encoding allows one to save text as computer data, and to move this data around.

and so, a long time ago (in the 1960s), in a continent far, far away, a standard for text encoding was invented: the American Standard for Character Interface Interchange: ASCII.

Standard explained, technical info for nerds, go to the next red section to pass

Standard explanation end

But that is fine since, well, how often do you need a computer in London to use an interface in Tokyo? all is well.

Then the World Wide Web happens, and suddenly computers speaking different languages read and write complete garbage everywhere.

So an organization called the Unicode Consortium tries to solve the problem, and to create a unified symbol for all languages. They called the standard utf-8

This standard supports 1,114,112 different characters. at present, only around 10% of this data capability is actually used. this includes dead languages, and emojis (which is a wonderful story)

Standard explained, technical info for nerds, go to the next red section to pass

Issues to tackle in a universal text encoding standard:

The protocol must never send 8 zeros in a row, except for the NULL character, otherwise old computers will stop reading in the middle

You must be able to pass from letter to letter easily. no saving the index of each character in some sort of list.

english characters are just ASCII. no thinking there. the first bit is set to 0 and so it is very easy to spot

if not, here's what you do:

let's say your character is 2 bytes long, here is how you would represent it:

110somec , 10haract

and when removing the headers, you'll have

somecharact

which will be some character.

let's say your character is 3 bytes long, here is how you would represent it:

1110some , 10charac , 10ter___

and when removing the headers, you'll have

somecharacter___

which will be another character.

if you wanna go back 1 character? just go back bytes until you find one that starts with something other than 10

no excess Nulls will appear because the only way to get 8 zeros

Standard explanation end

#ascii art #ascii entertainment #ascii #unicode #language #emoji #linguistics #hieroglyphs #programming #coding #encoding #standard

#encoding

Trending Tags

Recently Viewed Tags

#encoding