Lawrence D’Oliveiro @ldo17 - Tumblr Blog

Fixing Python’s Greatest Mistake

Python is easily my favourite programming language right now. When I can use it, it lets me be massively more productive than I can be in other languages. Its powerful data-structuring and data-manipulation facilities, in particular, let me solve the same problems in much less code.

One specific example I can offer is this program which, given certain parameters for a display screen, will calculate the rest. The program is driven by a set of rules which give the formulas by which parameters are calculated from other parameters. The rules are kept in the dictionary named paramdefs. The definition of this dictionary, as of this writing, takes up 102 lines of the program.

Compare the Java version, written for Android. Ignore the fact that this app includes a GUI, whereas the Python version runs from the command line; just look at the Rules.java source file, specifically the creation of the ParamDefs class that corresponds to the Python dictionary: this part of the module alone is currently 578 lines—an expansion in code size of well over 5:1.

I could offer more examples, but that should already give you a good idea about the expressiveness of Python.

Of course, Python gets a few things wrong. With all its eschewing of C-style conventions that have infected most programming languages over the last few decades, it is surprising to see it use “=” for assignment and “==” for the equality comparison, instead of the older ALGOL-style convention of “:=” and “=” respectively, which would have kept it more consistent with usual mathematical notation.

But the worst thing wrong with Python is its use of indentation, instead of bracketing symbols, to delimit compound statements.

Yes, it is true that code should be indented anyway, since it helps immensely to keep the code readable. And Python’s convention for omitting semicolons at the ends of simple statements is actually quite intelligently thought out, unlike for example the corresponding rule in JavaScript. The net result is that my personal conventions for laying out code, painstakingly evolved over many years using several different languages, adapt well to the one language, not so well to the other.

The problem is that Python wants to do away with useful redundancy. In a language like C, I could write a construct like

for (int i = 0; i < 10; ++i) { if (is_prime(i)) { fprintf(stdout, "%d\n", i); } /*if*/ } /*for*/

Here the compiler pays no attention to the indentation whitespace, only to the actual symbols. The redundancy comes in having both: if there is a discrepancy between the two, the compiler may not pick it up, but there is at least a chance that a human reader could do so. Remember the saying: “many eyes make all bugs shallow”.

The way the corresponding Python version is usually written, there is no such redundancy:

for i in range(10) : if is_prime(i) : print(i)

Now, what happens if these pieces of code get posted online somewhere, say in a discussion forum which makes it hard, or even impossible to keep the correct formatting?

The C version might turn into this:

for (int i = 0; i < 10; ++i) { if (is_prime(i)) { fprintf(stdout, "%d\n", i); } /*if*/ } /*for*/

That’s harder to read, but at least it will still compile correctly, and with some work, the formatting can be recreated.

But the Python version turns into complete gibberish:

for i in range(10) : if is_prime(i) : print(i)

This is a simple enough example that you could take a guess on how it is supposed to be indented, and recover the original statement structure manually. But imagine trying to do that for just a few dozen lines of code...

For this reason, I like to put in “#end” comment lines to explicitly mark the ends of compound statements. For example, I would write the above as

for i in range(10) : if is_prime(i) : print(i) #end if #end for

Now, if the indentation were to be lost for any reason, you stand a much better chance of reconstructing it correctly. It could even be done automatically by some prettyprinting tool, just as with the C version.

Another problem with how Python code is usually written comes up when trying to edit the text of a program. For example, the Emacs editor provides commands (ctrl-shift-n and ctrl-shift-p) for jumping between matching opening and closing bracket symbols (“(” and “)”, “[” and “]”, “{” and “}”). While these can still be used within expressions in Python, they are useless for jumping around statements.

But because of my bracketing-comments convention, I was able to add commands to my custom Emacs definitions that provide corresponding functionality for Python statements. Specifically, ctrl-super-n jumps to the next line with the same indentation level as the current line, while ctrl-super-p jumps to the previous such line. So in the above example, it is easy enough to jump between the “for” and “#end for” lines, or between the “if” and “#end if” lines using these keystrokes.

In short, most other languages pay no attention to indentation whitespace, using bracketing symbols to delimit compound statements. But it is generally considered a good idea to add the indentation whitespace anyway, as a form of redundancy to help catch errors.

Conversely, Python pays no attention to these “#end” bracket comment lines, using only the indentation whitespace to delimit its compound statements. But I think it is a good idea to add the bracketing comment lines anyway, as a form of redundancy to help catch errors.

#python #programming #indentation #Emacs #prettyprinting #algol

Constructive Tools Versus Destructive Weapons

Guns are nasty, dangerous things, and should be banned. Yet when you try to say this to the gun-lovers, a common response is that cars and power tools can be dangerous, too, yet we don’t try to ban them. A nicely specious argument, which overlooks the fact that a car is a constructive tool, while a gun is a destructive weapon. Can a gun take your kids to school? Bring home the shopping? Rush a sick loved one to the hospital? Impress a date? A car can do all these things, and more. Whereas a gun can only blow holes in things. A power drill can make holes too, but those are precision holes, that can be used to put together a computer cabinet or a piece of furniture. Whereas all a gun can do is take them apart again, messily and noisily. Even in the worst-case situation, where a car hits someone, the modern vehicle is full of features to mitigate the seriousness of the impact: crumple zones and soft yielding parts on the outside, seat belts and airbags on the inside. And that’s not to mention the systems to make such a collision less likely in the first place. Modern power tools, too, are packed with safety features. Whereas when a gun causes damage, injury and death, it is working as designed. So please, gun lovers, stop with the irrelevant parallels, OK? Oh wait, you can’t. Because logic doesn’t apply in your world, does it?

#gun control #second amendment #firearms #guns

#JeSuisCharlie

Let there be no question about it: the shameful, brutal, murderous attack on Charlie Hebdo is an attack on all of us. It is an expression of violent hatred toward the principle of freedom of expression that all truly democratic nations hold dear. Inevitably, some are asking how these sorts of attacks could be prevented in future. The only realistic answer is, they can’t. But how are we supposed to be kept safe? To which the only rational answer is: is liberty not worth dying for? Many people fought and died to give us the freedoms we take for granted today. If we have to keep dying to hold on to those freedoms, well, can you think of something more important to die for? Remember what the aim of the terrorist is: killing people is not the end, it is only a means to the end for them. Their end is to terrorize: to make us live in fear. Passing more repressive laws and restricting everybody’s freedoms in the name of “public safety” is nothing more or less than an expression of that fear: it is an admission that the terrorists are achieving their aims. The only rational response to the terrorist is to point out that they are not worth fearing; that they cannot stop us from going about our ordinary lives, with our freedoms intact. When the terrorists realize that their attacks are not having the effect they were hoping for, it is they who will become demoralized, not us. The victims at Charlie Hebdo did nothing wrong. I would like to think that, if we could ask those murdered cartoonists whether they would do it again, they would say “yes”. Let us hope that this disgraceful act only spurs even more people to come forward and exercise their rights to free speech and satire. And yes, even blasphemy. Because blasphemy is not a crime in any civilized country. But what about religious tolerance? Of course we have to find a way for those with differing views to get along—but then, that is already the point of every free, democratic society, is it not? To which, let me add a point about religion that some may not like to hear: there can only be religious tolerance when we stop relying on religion as a basis for morality. It is time to recognize that morality is something common to all of us as human beings, not something that depends in any important way on what religion you might subscribe to.

#JeSuisCharlie #freedom #terrorism #freedom of expression #democracy #religion #morality #religious tolerance

To ... err ... is human; to ... umm ... divine.

CLI Versus GUI Deathmatch!

Which is the better way to operate a computer: via a command-line interface (CLI) or a graphical user interface (GUI)? Seems like the CLI has become unfashionable to many, while the GUI is the preferred way of doing things. Really, the optimum way to use a computer nowadays is with a combination of both.

But some people are reluctant to admit this: they talk about the GUI as being for “nontechnical” or “ordinary” users, and the CLI as more suited to “geeks” or “nerds” or other, less complimentary terms. The reality is, if you don’t know something of how to use a CLI, then you are not fully exploiting the potential of the computer to save you time in your work. Computers are good at doing boring, repetitive things, while humans are not. That’s why we invented computers! With a CLI, if you find yourself doing the same sequence of things as before, you can save the command sequence you used as a script, and reinvoke it with a single command. With a GUI, you have to repeat the entire set of steps you did before, yourself.

A CLI takes the burden of doing repetitive stuff from you; a GUI puts the burden on you.

GUIs pioneered the concept of copy and paste for moving data between documents and between applications: select some text or graphics or whatever in one window, copy it to the Clipboard, switch to another window, possibly belonging to a different application, and paste it from the Clipboard into there. CLIs never had this capability before GUIs came along.

Yet the irony is that you can use copy and paste to transfer command sequences between CLI windows (including text editor windows to save them to a file or read them from a file), yet you cannot use it to transfer GUI action sequences between windows. There are no such things as automatable GUIs.

People do try, though: every few years, somebody rediscovers the concept of “macro recorders” to record and play back mouse clicks and keystrokes, to allow automating GUI actions. And they also rediscover why the idea never took off: it is difficult to get right, liable to break with slight changes to the GUI layout when a new release of the application comes out, and just when you thought you got it working, it often misbehaves for subtle timing reasons that are very hard to reproduce. In short, it is the worst of both worlds: it requires you to think like a programmer, like a CLI does, yet it inflicts on you programming facilities that are more fiddly and horrible than that of any CLI that any right-minded person would choose to use.

GUIs are often characterized as “humanizing” the use of computers; they adapt the computer to more human ways of doing things, rather than making the human adapt to the computer’s way of doing things. This may be true as far as it goes, but you can’t escape the flipside of that: an interface designed specifically for humans to use requires a human to operate it, it cannot easily be automated by the computer itself.

There is something of a joke among computer scientists that “there is no computing problem that cannot be solved by adding another level of indirection”. For a joke, this has a lot of truth in it. We don’t (at least most of us don’t) use the bare computer hardware on its own. We add a software layer on top, which turns the original hardware machine into quite a different, higher-level “abstract machine” (made out of both hardware and software); this new machine may be less general and versatile than the original, but it is better suited to solving particular problems, with less work. By building on the lower layers, we make it easier (quicker, cheaper) to solve the problems relevant to us.

But it doesn’t need to stop at just one layer of software: we can build even more software layers on top. The operating system software provides certain fairly general services; we build application programs on top of that, specialized for doing certain tasks. Those programs are written in various higher-level languages, where we can take advantage of a multitude of libraries of prewritten code provided by others to save work and increase productivity.

If the application is a command-line tool, then it can be combined with other such tools in a custom shell script, which can be used as though it were a new command-line tool in its own right. And so the command line continues, even encourages, the layering of abstract machine on top of abstract machine.

But once we get to the GUI layer, all this comes to a screeching halt. There is no way to build new, higher-level machines on top of a GUI abstract machine; the GUI marks the end of the line. From that point on, it’s all manual human labour.

Another fun thing about GUIs is that there are so many of them to choose from. A GUI design starts out addressing a specific set of usage scenarios. But of course users’ needs grow with time. So the GUI has to adapt to new scenarios. But these were not taken into account in the original design, so they cannot be quite as well integrated as the original concepts. So the overall GUI design grows in complexity, and loses something of its original cohesiveness. At some point, the whole system just gets too unwieldy, and you have to scrap it all and start again. But this causes its own problems with users who were accustomed to the old way of doing things.

A rather high-profile case in point right now is the spectacular failure of Microsoft’s attempt to impose a one-size-fits-all GUI across its product line. Even as customers stay away from that, they enthusiastically embrace Android devices which have a UI completely different from the PCs that they were familiar with.

Compare the CLI. Compare a modern-day Linux shell prompt, with, say, command lines from the 1970s or even the 1960s. The modern command line offers powerful new capabilities which were unheard of back then (line editing, tab completion), yet those build on basic ideas that date from from those early roots. The modern CLI is an evolution, not a revolution, from its early beginnings.

GUIs may come and GUIs may go, but the CLI is the one true timeless computer interface.

#cli #gui #linux #commandline #point and click #scripting #automation

Is Linux Unix?

You will often hear Linux described as a “Unix-like” operating system, or a “Unix clone”, and “not real Unix”. Sometimes people will point to the BSD derivatives and claim that they are “real Unix”. Of course this is not true: the BSDs are no more (and no less) “Unix” than Linux is. Once upon a time, there was a real operating system called “Unix”. It was developed at (now-defunct) AT&T Bell Labs, and offered, with source code, for nonprofit use by Universities. This was because AT&T was not, at the time, allowed to go into the computer business. Later, this restriction was lifted. So AT&T licensed the Unix code (and name) to a bunch of commercial companies, including IBM, HP and several others long since gone. They then built their own incompatible variations into their Unix systems, leading to a prolonged period of “Unix wars” (thankfully now over). So the “Unix” name referred to two things: a bunch of proprietary code belonging to AT&T, and a trademark that certain companies were allowed to use. The systems called “BSD” originally grew out of code developed at the University of California at Berkeley, which was done as enhancements to the AT&T Unix code. But thanks to a lawsuit filed by AT&T in the early 1990s, every trace of AT&T Unix had to be scrubbed from this. And of course, you know the history of a Computer Science student called Linus Torvalds, who set out to create his own Unix workalike, that owed nothing to the AT&T code, and made no use of the “Unix” name. He wasn’t the first to achieve this (his dissatisfaction with the licence restrictions around an earlier effort, called Minix, was a key factor in the creation of Linux); AT&T thankfully didn’t claim any copyrights around the Unix APIs or other functional features (which have been codified in the standard known as “POSIX”, after all), so others have been free to do the same. But no one can dispute nowadays that Linux has become the most successful of them all. These days, “Unix” is basically just a trademark. In order to use it, you have to pass some compliance tests, and pay a hefty licensing fee. I believe Apple, for example, has done this; their OS X system is based on a BSD derivative, and they are allowed to call themselves “Unix”, even if none of the open-source BSD projects that they copy from are allowed to use the name. I don’t think they even use any AT&T code as such. In principle, then, a Linux vendor could go through the same licensing process to call their product “Unix”. But nobody can be bothered with this. My impression is, the “Unix” name just brings to mind those long-past times of warring proprietary systems that everybody is happy to have left behind. Some people use the term “*nix” to refer generically to Linux, the BSDs and anything else that might claim POSIX compatibility, but then, why not just use the term “POSIX” for that? And Linux itself has become so dominant, that if anybody were to try to offer a “Unix” system nowadays, I would want to ask them: Is your “Unix” Linux-compatible? Because that is very much the standard nowadays.

#unix #linux #bsd #at&t

worldssmallesttrampoline

coyohti-deactivated20181203

Also…so taking screenshots of Google Earth and printing them out is art now?

ldo17

You could always go back to arranging Cambell’s Soup cans...

Forking GitHub Repos

I’ve lost count of the number of times that other GitHub users have forked one of my repositories and then sat on their copy and did nothing. If you want your own personal copy of a repo, by all means use git clone to mirror it to your PC. Is there a need for your copy to be public on GitHub, when it contains nothing but a (gradually becoming obsolete) snapshot of the original? If you plan to do some of your own development on it, then my advice is: don’t do the fork until you are ready to push your first change to it. This is because, at the time you fork, the owner of the repo you fork from gets a notification, and that is the time I will go to have a look at your copy, to see if you’ve done anything interesting yet. If I see nothing, I will probably forget to go back at some later point to recheck, so you are unlikely to get a second chance at my attention. So use GitHub wisely.

#github #git #vcs #development #programming #version control

fuckyeahcomputerscience

the Alan Turing statue in Manchester decorated for his 100th birthday

If any techie argument in an online forum goes on for long enough, the probability that two disagreeing viewpoints will both claim “real-world” justification for their position approaches one.

Me, inspired by some of the reader comments to this article.

#real world #tech #objectivity

jecrcuniversity-blog

ldo17

jecrc:

The factorial should be outside the square-root sign.

The “Equation” That Changed My Life

At school, I was good at maths and science. But from an early age, I was fascinated by these things called “computers” that were portrayed on TV and in books (this was the early 1970s in a South-East-Asian country, so they were not exactly household objects). I read up everything I could about them, but all the popular accounts were frustratingly short on detail. I was very impressed when some older boys built a rudimentary one (just a few lights and switches) for a science fair at school.

Then some school friends and I got membership at the British Council library in town. They had more books than I had ever seen before, including a full set of tne Encyclopedia Britannica. Alone of all the encyclopedias I had seen up to that point, the Britannica article on “Computers” actually had examples of proper program code! (It was in FORTRAN, but, hey, that was still fantastic to me.) In among all the sample statements, there was this:

N = N + 1

As I said, I was good at maths, and I knew what an equation was—enough to realize that this made no sense as a mathematical equation—there was no value of N (at least, no finite value) which would satisfy it!

But the key point was, in FORTRAN, the “=” denotes, not equality, but assignment. The statement means, “take the current value in the location denoted by N, add 1 to it, and put it into the location denoted by N”.

Once I had grasped this concept, I understood a whole lot more about computers than I had before.

Other languages from around the same time, designed by Proper Computer Scientists, used “:=” to denote assignment, leaving “=” to represent something closer to its mathematical meaning of equality. But unfortunately, the later popularity of C, which uses FORTRAN-style “=” for assignment, and invented another operator, “==”, to denote equality comparison, has probably meant that whole new generations of maths-savvy teenagers will have to go through the confusion I did.

#programming #computing #mathematics

Sanitrights—A Modest Proposal

There’s a lot of hoo-hah over copyright piracy these days. “Content creators deserve to be paid for their hard work”, we are told. Do people who work hard deserve to be paid? Well, then—who are the hardest-working people in the world? How about those who clean our toilets and our sewers? Shouldn’t they be paid more than anybody else? And furthermore, shouldn’t these hygiene creators get royalties for their work, just like content creators do? After all, think about this: when you use a clean toilet, and dispose of your surplus excrement down a properly-functioning sewer, you are not just benefiting from the use of the facilities at that moment. No, you are also benefiting from the fact that you haven’t caught a nasty disease, like typhoid or cholera, from using unhygienic toilet facilities. Such diseases can really bugger up your life, not just kill you. So the fact that you are able to leave that toilet after using it in as good health as when you entered, and are able to go on living a productive life, means that you owe those who gave you that clean toilet. They have hygienic property rights over your life now (just like intellectual property rights, but closer to the opposite end of your body, if you know what I mean). In short, just like copyrights, they have sanitrights. And remember, hygienic property is like intellectual property—you don’t own your good health, you only license it from the sanitright-holders. The initial licence is for your personal, non-commercial use only. If you want to use your good health for anything more, you need to pay extra for the necessary licence. Any kind of work that involves being able to stay conscious, get out of bed and move around would incur royalties on any income you get. Suppose you want to become a doctor, who earns a living from imparting good health to others—such a flagrantly commercial use of your own good health would have to incur the second-highest licence fee of all. And the highest? That would be if you were to become a toilet cleaner or sewage worker yourself.

#copyright #intellectual property #sanitright #piracy

My Creative Commons Shit List

I previously discussed Creative Commons licensing, including which parts of it to use and which parts to avoid. In this posting I want to list some projects that use the non-Free parts—in other words, the parts should be avoided.

• Blend Swap is a very handy resource for those looking for models to use with the Free Blender 3D modelling program. And they are enlightened enough to require that all contributions be redistributable under the Free CC-BY-SA licence—with one glaring exception: All LEGO-related material must be licensed under the non-Free CC-BY-NC. Why is this? It seems to be based on a misunderstanding of trademark law.

After all, consider that the site also offers models related to other trademarked merchandise, such as game/movie characters, car models and so on, yet none of these are subject to the same restrictions—what’s so special about LEGO?

• Celestia is a wonderful program for those wanting to learn about astronomy, and the software itself is available under a Free software licence (the GPL). However, the Celestia Motherlode, that hosts additional data files for use with Celestia, makes its offerings available under a non-Free licence:

We believe that all files on CM, including their contents, are freely distributable for NON-commercial use.

It would have been much nicer if they had made it clear that all contributions were understood to be made available on a CC-BY or CC-BY-SA licence.

• XKCD is a cartoon that is a favourite of many geeks. However, the cartoons are officially licensed CC-BY-NC. Yet I have seen more than one cartoon published in a print magazine—surely if anything constitutes “commercial” use, that would be it? On his “clarification” page, he says he’s

also okay with people reprinting occasional comics (with clear attribution) in publications like books, blogs, newsletters, and presentations.

Interestingly, there is no mention of magazines in that list.

• the Free Universal Construction Kit is a wonderful use of nascent 3D-printing technology to promote interoperability between different toy-construction systems like Lego etc. Or it would be, if they hadn’t used the NC clause in their CC licence. Why do people keep making this mistake?

#creative commons #free culture #copyright

contemplatingstardust

Pi day is my excuse to just learn about math all day. The other 364 days I have no excuse.

archiemcphee

abirato

Static typing.

Trending Blogs

Recently Viewed Blogs

Lawrence D’Oliveiro