biochemistries @biochemistries - Tumblr Blog

Posts

And just like that my time’s up, and I’m leaving the University of Manchester after almost half a decade. Sentimental perhaps, not morose - above are some parting shots (I still haven’t taken my film camera down off the shelf).

I started Nonlinear Dynamics: Mathematical and Computational Approaches last night, from the fantastic Liz Bradley, at the Santa Fe Institute (notable hub for such research, which I first encountered through renowned complexity theorist Stuart Kauffman).

It’s really satisfying having received notions of “flow” rendered concrete within dynamical systems theory, giving a framework to trains of thought so ephemeral they begin to feel impossible to work into a scientific format.

A pair of papers (from Yogi Jaeger, whose work I've written about previously) recently talked in such terms, and reinvigorated my memories of studying biological oscillations, within which can be glimpsed this strange crossover of biology and fluid dynamics. As summarised by Anton Crombach:

Dynamics of body plan segmentation in long and short-germ insects more similar than previously thought: A damped oscillator imposes temporal order on posterior gap gene expression in Drosophila

Less simplification, more insight. Analysis of phase space in a data-driven model with time-dependent parameters: Dynamic Maternal Gradients Control Timing and Shift-Rates for Gap Gene Expression

There’s a good 50 year-long backstory to this field — for a time complexity science was the “hot topic” in biology (not necessarily in positive terms!). The hype I've seen of it in magazines in the 90s, becoming exploited for publicity with some canny approach to influencing stock markets no doubt, harks to ‘Big Data’ in recent years (though do note that there is valid and fascinating, biological analysis-relevant work on multidimensional data beneath Big Data's spiel).

I've been reading Complexity: Life at the Edge of Chaos by Roger Lewin, who gives a staggering account of the field for a lay audience, and nothing short of a tour of Hollywood for a biochemist who's bumped into these figures in dribs and drabs.

(I was disappointed to find that the reason I hadn’t heard of Lewin prior was likely to do with his conversion to pseudoscience in later life, in what would be his final published book)

In a 2015 talk, Marc Vidal (Harvard) — proponent of the ‘edgetic’ view of biological networks which I’ve written about previously — showed a map of the world of complexity science briefly - traced back to this site.

My notes on the first module are in a Wiki here, though a problem I’m currently finding is that their relevance is to some extent conditional on my own — increasingly specific — interests, using complexity theory to understand nonlinear dynamical systems, including the undeniably abstract notions of optical and attention processes in theories of mind, for application to neural network processing of knowledge (to put it as plainly as possible).

It’s not all pie in the sky however, for an idea of why these are relevant, take a look at this blog post: Attention and Augmented Recurrent Neural Networks, by Chris Olah and Shan Carter at Google Brain. To follow the research in this field, you need to take up background readings in psychology (as notions such as transfer learning will be dropped into technical discussion, direct analogies from human psychology).

In other news, the company I’m interning with changed its name to benevolent.ai this week… the new site is an order of magnitude more swish (and it was already pretty neat) - it really seems like a different venture all of a sudden. The former vice president of IBM Watson was announced to be joining the company on the same day: IBM’s AI guru leaps over to Brit biz benevolent.ai. It’s... a lot to take in.

Three talks I found by the man in question, Jérôme Pesenti, are all kinds of incredible:

Keynote discussion on IBM Watson and Big Data insight (2014)

"Cognitive computing | TEDxBermuda" (2014)

"New advances in cognitive computing" (2015), a talk at ENS Cachan (i.e. more intended for researchers than public relations, but not overly technical)

From Alison B. Lowndes’s Bachelors dissertation, Deep Learning with GPU Technology for Image & Feature Recognition:

Cognitive computing systems, such as IBM’s Watson, allow reasoning via probabilistic natural language processing for interacting more naturally with computers than ever before. The pace of progress is so great within cognitive deep learning that NVIDIA is now shipping a plug-n-play CNN “appliance” for the academic and developer community integrating 4 GPU’s and their DIGITS system, a powerful visualisation and configuration tool for neural networks, which I also demonstrate in this report.

benevolent.ai is situated in the ‘Knowledge Quarter’, neighbours with Facebook, Google and research institutes from the Crick to the Alan Turing Institute. It’s… a lot to take in.

Spot benevolent.ai [formerly known as Stratified Medical] right next to Euston station… via: knowledgequarter.london

The past few weeks have been pretty incommunicable, and fruitful in some ways, even if I had found the time to write much here. I’ve begun some new literature curation projects to channel and structure (to avoid overloading naivelocus.com):

thermodynamics and dynamical systems at spin.systems (for now site is just a placeholder, readings are being synced to Twitter @systems_spin)

machine learning, computer vision and optics/microscopy at @naiveoculus (no website)

There's a sense of being on the edge of understanding, and ‘good-enough’ to proceed (though I'm not an artificial intelligence expert, I think that's understood), having a sufficient grasp lets you carry on downstream. Chris Meiklejohn, one of the technical workers I’m currently idolising [for want of a better word] wrote recently about how this process (not quite the overbaked complaint of ‘impostor syndrome’) of riding out to overcome... incompleteness of your experience (or knowledge) is not so much fraudulent as an honorable task, and a necessary period for upstarts in interdisciplinary work.

Speaking of which, another paper recommendation: Greg Wilson, Jennifer Bryan, Karen Cranston et al. (2016) Good Enough Practices in Scientific Computing. arXiv: Software Engineering (cs.SE), 1609.00037

Coming back to the sense of ideas being rendered concrete, I feel a lot of what’s happened in the past few months won’t settle for a long time to come, however there are bubbles in which it seems to be useful in my workaday research. The mathematical readings I’m undertaking now are more out of necessity not to have certain trains of thought suffocate in my ignorance of lower level detail (I have seen computing researchers write - though not without rebuttal - that mathematics/physics ignorance is a barrier to where you can take your research in this domain). I wish I’d been given the support for these as an undergraduate, but the idea is barely on the table, and I don't really know who the relevant parties are to take the issue up with. MOOCs are great, but nothing can stand in for meaningful institutional support.

On that I’ll say no more — oh and if you are interested, I found out too late that the Royal Statistical Society and SIAM (Society for Industrial and Applied Mathematics) both offer free membership for undergraduate students. Such is life.

A couple of sites at which I see abstract mathematical notions being relevant to biochemical research:

"Traditional seq/struct alignment algos can't detect circular permutations between proteins" https://t.co/E4VQAue1gp pic.twitter.com/EOyxNlTYm3

— Louis Maddox (@biochemistries) 6 August 2016

An intriguing observation via a little Wikipedia serendipity (with thanks to the super handy @wiki Telegram bot)

Brilliant paper, #combinatorics-wise ≅ 'permutation interconnection networks' in computing 🔀 https://t.co/YxDRaZB4Tb pic.twitter.com/WOpWFErgJm

— Louis Maddox (@biochemistries) 7 September 2016

See: tweet, on this paper proposing "noncommutative biology" (nicely summarised in the conversation shown above)

I've not written about permutation interconnect networks but they're very easy to find out more about in the network computing literature: see for example this paper: "Fast subword permutation instructions based on butterfly networks" which describes how interconnection networks are used to solve permutation problems down at the machine hardware level of a computer.

The modern scientific programming language Julia gets you down to this level of abstraction (the magic words "code_native" let you see the 'machine code' rather than the human-readable code written by the programmer, such as a simple for loop) to repeatedly do something on a list of items (e.g. for a biology lab's set of genes, proteins, etc.).

On LLVM and Julia, see Working with LLVM

Julia: Reflection and introspection

Leah Hanson's blog: Julia introspects

All in all it doesn't seem too far-fetched to bring these two disparate fields of reading (optical/network computing and modelled biological regulation) together, and may help the latter go further. However, it can be difficult to explain or reason through in terms others share while the ideas are coming into focus.

Cats in trees, permutohedra, and other colourful associahedra

I’d never seen a geometrical representation of phylogenetics before (the ‘tree’ diagrams used to investigate heritage, similar to pedigree charts but where branch length indicates some measure of evolutionary divergence from an ancestor).

Need to read a bunch of papers in more detail, but discovering the word “permutohedron” has made my day — as well as the fact it has an alternate spelling, permutahedron (both show up in the literature).

As well as a mathematical concept, as in "permutations and combinations", a permutation in common usage is an anagram, or an alternative presentation of something. The dual spelling seems to be the sort of terrible deliberate meta-humour dearly beloved of mathematicians that works its way deep into tacit usage if you ask me... By way of example, statistical programmer extraordinnaire Hadley Wickham released another R package recently, forcats.

It's an anagram of factors, and concerned with the statistical concept of 'factor'. The R source code documentation notes:

(the terms ‘category’ and ‘enumerated type’ are also used for factors)

Hence, to the statistically-minded, Hadley's squeezing both cats (a nickname for 'categories of small categories', or 2-categories, in category theory) and permutation into a pun... or I may be reading too much into it ;-)

Some thought that sprung into my head at the time of using forcats for the first time, from its key concepts of factors, levels, and orderings, led me to [as I understand it] the formal geometric development of factors, in a 1982 publication I couldn't find online but mentioned in “Factor Space, the Theoretical Base of Data Science” and material on the subject.

Excerpt from The Basics of Factor Spaces, chapter 12 of Fuzzy Neural Intelligent Systems (2000):

The original definition of “factor spaces” was proposed by Peizhuang Wang [l]. He used factor spaces to explain the source of randomness and the essence of probability laws. In 1982, he gave an axiomatic definition of factor spaces [2]. Since then he has applied factor spaces to the study of artificial intelligence (Wang 1990, Kandel 1990). Several applications in the area of fuzzy information processing have been discussed (Li 1994). This chapter provides an introduction to the basic concepts and methods of applications of factor spaces.

To cut a large body of work short, Peizhuang Wang, who originally proposed 'factor space', is now writing about “Cognition Math Based on Factor Space”. Update - I'm not sure if I want to suggest reading this paper after feedback, see note below for an alternative.

Big data is the era we are faced with, the character of big data is I & I, Internet and Intelligence. Internet is the wing and intelligence is the soul of information revolution [1,2]. Big data era is not beyond, but belonged in the historical stage of information revolution, and the core of big data is intelligence still. However, big data is a new stage at the internet time. When all computers communicate each other on line, what is the kind of computer like? The role of the CPU of the computer has been marginalized and the data processing software plays the main role of AI. The entity of AI machine, so called the fifth or post-fifth generation computer, will be replaced by the man-machine cognition network [3]; the mode of AI will be changed from the bottom-top manual work to the combination of top-bottom and bottom-top network; which makes the man-machine cognition network intelligently huff ‘n’ puff big data from internet. Different from human intelligence, the subject of AI is not brain, but machine. How do machine emulate intelligence of brain? Is it possible to construct a brainlike machine? No matter how advanced science becomes, it is not possible to make a machine as a clone of brain. It is mystery that the insuperable barrier does not take away all belief from AI researchers. Even though the ebb of fifth generation computer in 1990s hints that computer must emulate from the structure of human brain, people still have confidence on AI facing the difficulty of structure-emulation. Indeed, we would cognize that brain is the cognition’s subject, but not the very cognition. Is there a cognition theory keeping a little independence from brain? It concerns with the relationship between cognition information and ontological information [3]. Even though brain has influence to ontology information, ontology information is independent from the subject of cognition essentially, and there exists inner cognition theory to guide artificial intelligence. There were theories arising in artificial intelligences, unfortunately, they are not deep and united but shallow and split [3,4]. There have been no deep and united artificial intelligence theory yet. No a strong theory, no substantial practice! Therefore, we are going to build a strong theory of artificial intelligence [5,6].

Artificial intelligence can’t be achieved without mathematics. Existent heterogeneity of intelligence theory is caused by the heterogeneity of mathematics. To build a united information theory, we must to build a united cognition math.

(None of the references were available through Google Scholar or online materials I could find, included below anyway)

1. Wang PZ (2014) Factor space, a mathematical preparing for the coming of big data tide (special talk), High-end Forum on Big Data. Chinese Academy of Sciences, Beijing 2. Huang CF (2015) A way to test if wisdom network can improve intelligence (Special talk). In: International conference of orient thinking and fuzzy logic: the 50th anniversary of fuzzy sets. Dalian, China 3. Zhong YX (2002) Information science principle. Beijing University in Posts and Telecommunication Press, Beijing 4. He HC, Ma YC (2008) Information, intelligence and logics. North-west University of Polytechnical Press, Xi’an 5. Zhong YX (2012) Higher principle of artificial intelligence, the idea, method, model and theory. Science Press, Beijing 6. He HC, Wang H, Liu YH, Wang YJ, Du YW (2001) Universal logics principle. Science Press, Beijing

Update: I mentioned Wang's recent paper on 'cognition math' to Hadley and he replied that "I would best summarise that paper as jolly obfuscatory"... between the lines of which I guess means it's not just a language barrier but er, work not worth my time reading... Going to leave the above here anyway (references not being on Google Scholar is probably the red flag to note in future). Instead, see: Garrett Grolemund and Hadley Wickham (2014) A Cognitive Interpretation of Data Analysis. International Journal of Statistics 82(2)

Hadley took up a post as Adjunct Professor of Statistics at the University of Auckland late last year, but continues his role as chief data scientist with RStudio (developers of a major code editor used with R, widely used by life sciences researchers).

Flowers and Bouquets, above, comes via Devadoss et al. (2013) Polyhedral Covers of Tree Space.

Trending Blogs

Recently Viewed Blogs

biochemistries