China’s Ideological Spectrum
We find that public preferences are weakly constrained, and the configuration of preferences is multidimensional, but the latent traits of these dimensions are highly correlated. Those who prefer authoritarian rule are more likely to support nationalism, state intervention in the economy, and traditional social values; those who prefer democratic institutions and values are more likely to support market reforms but less likely to be nationalistic and less likely to support traditional social values. This latter set of preferences appears more in provinces with higher levels of development and among wealthier and better-educated respondents.
march 2018
What Peter Thiel thinks about AI risk - Less Wrong
TL;DR: he thinks its an issue but also feels AGI is very distant and hence less worried about it than Musk.

I recommend the rest of the lecture as well, it's a good summary of "Zero to One"  and a good QA afterwards.

For context, in case anyone doesn't realize: Thiel has been MIRI's top donor throughout its history.

other stuff:
nice interview question: "thing you know is true that not everyone agrees on?"
"learning from failure overrated"
cleantech a huge market, hard to compete
software makes for easy monopolies (zero marginal costs, network effects, etc.)
for most of history inventors did not benefit much (continuous competition)
ethical behavior is a luxury of monopoly
february 2018
Gimbal lock - Wikipedia
Gimbal lock is the loss of one degree of freedom in a three-dimensional, three-gimbal mechanism that occurs when the axes of two of the three gimbals are driven into a parallel configuration, "locking" the system into rotation in a degenerate two-dimensional space.

The word lock is misleading: no gimbal is restrained. All three gimbals can still rotate freely about their respective axes of suspension. Nevertheless, because of the parallel orientation of two of the gimbals' axes there is no gimbal available to accommodate rotation along one axis.

Now this is where most people stop thinking about the issue and move on with their life. They just conclude that Euler angles are somehow broken. This is also where a lot of misunderstandings happen so it's worth investigating the matter slightly further than what causes gimbal lock.

It is important to understand that this is only problematic if you interpolate in Euler angles**! In a real physical gimbal this is given - you have no other choice. In computer graphics you have many other choices, from normalized matrix, axis angle or quaternion interpolation. Gimbal lock has a much more dramatic implication to designing control systems than it has to 3d graphics. Which is why a mechanical engineer for example will have a very different take on gimbal locking.

You don't have to give up using Euler angles to get rid of gimbal locking, just stop interpolating values in Euler angles. Of course, this means that you can now no longer drive a rotation by doing direct manipulation of one of the channels. But as long as you key the 3 angles simultaneously you have no problems and you can internally convert your interpolation target to something that has less problems.

Using Euler angles is just simply more intuitive to think in most cases. And indeed Euler never claimed it was good for interpolating but just that it can model all possible space orientations. So Euler angles are just fine for setting orientations like they were meant to do. Also incidentally Euler angles have the benefit of being able to model multi turn rotations which will not happen sanely for the other representations.
september 2017
Fear and Loathing in Psychology - The Unz Review
Warne and Astle looked at 29 best-selling undergraduate textbooks, which is where psychology students learn about intelligence, because less than 10% of graduate courses offer an intelligence option.

3.3% of textbook space is dedicated to intelligence. Given its influence, this is not very much.

The most common topics start well, with IQ and Spearman’s g, but do not go on to the best validated, evidence-led Cattell-Horn-Carol meta-analytic summary, but a side-stream, speculative triarchic theory from Sternberg; and a highly speculative and non-specific sketch of an idea about multiple intelligences Gardner. The last is a particular puzzle, since it really is a whimsical notion that motor skill is no different from analytical problem solving. All must have prizes.
Commonly, environmental influences are discussed, genetic ones rarely.

What Do Undergraduates Learn About Human Intelligence? An Analysis of Introductory Psychology Textbooks: https://drive.google.com/file/d/0B3c4TxciNeJZOTl3clpiX0JKckk/view

Education or Indoctrination? The Accuracy of Introductory Psychology Textbooks in Covering Controversial Topics and Urban Legends About Psychology: http://sci-hub.la/https://link.springer.com/article/10.1007/s12144-016-9539-7

Twenty-four leading introductory psychology textbooks were surveyed for their coverage of a number of controversial topics (e.g., media violence, narcissism epidemic, multiple intelligences) and scientific urban legends (e.g., Kitty Genovese, Mozart Effect) for their factual accuracy. Results indicated numerous errors of factual reporting across textbooks, particularly related to failing to inform students of the controversial nature of some research fields and repeating some scientific urban legends as if true. Recommendations are made for improving the accuracy of introductory textbooks.

Mapping the scale of the narcissism epidemic: Increases in narcissism 2002–2007 within ethnic groups: https://www.sciencedirect.com/science/article/pii/S0092656608000949

The increasing numbers of Asian-Americans at the UCs over time may have masked changes in narcissism, as Asian-Americans score lower on the NPI. When examined within ethnic groups, Trzesniewski et al.’s data show that NPI scores increased significantly between 2002 and 2007 at twice the rate of the yearly change found over 24 years in Twenge et al. (2008a). The overall means also show a significant increase 2002–2007. Thus the available evidence suggests that college students are endorsing progressively more narcissistic personality traits over the generations.

Birth Cohort Increases in Narcissistic Personality Traits Among American College Students, 1982–2009: http://journals.sagepub.com/doi/abs/10.1177/1948550609355719

Both studies demonstrate significant increases in narcissism over time (Study 1 d = .37, 1982–2008, when campus is controlled; Study 2 d = .37, 1994–2009). These results support a generational differences model of individual personality traits reflecting changes in culture.

could this just be a selection effect (more people attending)?
july 2017
Overcoming Bias : High Dimensional Societes?
I’ve seen many “spatial” models in social science. Such as models where voters and politicians sit at points in a space of policies. Or where customers and firms sit at points in a space of products. But I’ve never seen a discussion of how one should expect such models to change in high dimensions, such as when there are more dimensions than points.

In small dimensional spaces, the distances between points vary greatly; neighboring points are much closer to each other than are distant points. However, in high dimensional spaces, distances between points vary much less; all points are about the same distance from all other points. When points are distributed randomly, however, these distances do vary somewhat, allowing us to define the few points closest to each point as that point’s “neighbors”. “Hubs” are closest neighbors to many more points than average, while “anti-hubs” are closest neighbors to many fewer points than average. It turns out that in higher dimensions a larger fraction of points are hubs and anti-hubs (Zimek et al. 2012).

If we think of people or organizations as such points, is being a hub or anti-hub associated with any distinct social behavior?  Does it contribute substantially to being popular or unpopular? Or does the fact that real people and organizations are in fact distributed in real space overwhelm such things, which only only happen in a truly high dimensional social world?
july 2017
A combined analysis of genetically correlated traits identifies 107 loci associated with intelligence | bioRxiv
We apply MTAG to three large GWAS: Sniekers et al (2017) on intelligence, Okbay et al. (2016) on Educational attainment, and Hill et al. (2016) on household income. By combining these three samples our functional sample size increased from 78 308 participants to 147 194. We found 107 independent loci associated with intelligence, implicating 233 genes, using both SNP-based and gene-based GWAS. We find evidence that neurogenesis may explain some of the biological differences in intelligence as well as genes expressed in the synapse and those involved in the regulation of the nervous system.


Finally, using an independent sample of 6 844 individuals we were able to predict 7% of intelligence using SNP data alone.
july 2017
Dadly adaptations | West Hunter
If we understood how this works, we might find that individuals and populations vary in their propensity to show paternal care ( for genetic reasons). I would guess that paternal care was ancestral in modern humans, but it’s easy enough to lose something like this when selective pressures no longer favor it. Wolves have paternal care, but dogs have lost it.

This could have something to do with better health in married men. High testosterone levels aren’t cost-free.

It’s possible that various modern environmental factors interfere with the triggers for dadliness. That would hardly be surprising, since we don’t really know how they work.

All this has a number of interesting social implications. Let’s see how many of them you guys can spot.

Poles in the Tent: https://westhunt.wordpress.com/2013/07/09/poles-in-the-tent/
I’m considering a different question: what was the impact of men’s contribution on their children’s survival and fitness? That’s not quite the same as the number of calories contributed. Food is not a single undifferentiated quantity: it’s a category, including a number of different kinds that can’t be freely substituted for each other. Proteins, fats, and carbohydrates can all serve as fuel, but you need protein to build tissue. And amino acids, the building blocks of proteins, are not all fungible. Some we can’t synthesize (essential amino acids) others can only be synthesized from a limited set of precursors, etc. Edible plants often have suboptimal mixes of amino acids ( too many Qs, not enough Us) , but I’ve never heard of this being a problem with meat. Then you have to consider essential fatty acids, vitamins, and trace elements.

In principle, if high-quality protein were the long pole in the tent, male provisioning of meat, which we see in chimpanzees, might matter quite a bit more than you would think from the number of calories alone. I’m not say that is necessarily the case, but it might be, and it’s worth checking out.

Sexual selection vs job specialization: https://westhunt.wordpress.com/2017/10/02/sexual-selection-vs-job-specialization/
Pretty much every species is subject to sexual selection: heritable characteristics that lead to more mates or better mates can be favored by natural selection. Typically, sexual selection favors different strategies in males and females. Generally, males can gain fitness with increased mating opportunities, while females gain more from high-quality mates or mates that confer resources. Since the variance in reproduction is usually greater in males than females, sexual selection is usually stronger in males, although it exists and is significant in both sexes.

Usually, though, males and females of a given species have very similar ways of making a living. A male deer and a female deer both eat grass or arugula or whatever. Sexual selection may drive them to evolve in different directions, but finding something to eat mostly drives them in the same direction.

Humans are an exception. In the long past, men hunted and women gathered. The mix varied: in Arctic regions, men produce almost all the food (while women made and repaired gear, as well as raising children). In groups like the Bushmen, women produced most of the calories, but done rightly you would count more than calories: if most of the local plants had low protein or low-quality protein (wrong amino acid mix), meat from hunting could be important out of proportion to its caloric value.

This has been going for a long time, so there must have been selection for traits that aided provisioning ability in each sex. Those job-related selective pressures probably changed with time. For example, male strength may have become less valuable when the Bushmen developed poison arrows.

I was looking for an intelligent discussion of this question – but I ran into this and couldn’t force myself to read further: ” It should not simply be assumed that the exclusion of women from hunting rests upon “natural” physiological differences. ”

God give me strength.

What does Greg think about the “plows vs hoes” theory? (As seen here, although Sarah Constantin didn’t invent it.)

The claim is that some societies adopted farming (Europe, the Middle East, Asia) while some societies adopted horticulture (Oceana, sub-Saharan Africa, various primitive peoples) and that this had an affect on gender relations.

Basically: farming is backbreaking work, which favours males, giving them a lot of social capital. You end up with a patriarchal kind of society, where the men do stuff and the women are mostly valuable for raising offspring.


It’s kinda true, in places. There is a connection I haven’t seen explicated: the ‘hoe culture” has to have some factor keeping population density low, so that labor is scarcer than land. Tropical diseases like malaria might be part of that. Then again, crops like yams don’t store well, better to keep them in the ground until eating. That means it’s hard to tax people – easy with grain bins. No taxes -> no State – > high local violence. At times, VD may also help limit density, cf Africa’s ‘sterility belt’.

I am not a Moron: https://westhunt.wordpress.com/2017/11/03/i-am-not-a-moron/
So said Augustin Fuentes on Twitter, a few days ago. He’s the same guy that said “Genes don’t do anything by themselves; epigenetics and complex metabolic and developmental systems are at play in how bodies work. The roundworm C. elegans has about 20,000 genes while humans have about 23,000 genes, yet it is pretty obvious that humans are more than 15-percent more complex than roundworms. So while genes matter, they are only a small part of the whole evolutionary picture. Focusing just on DNA won’t get you anywhere.”

Fuentes was claiming that we don’t really know that, back in prehistory, men did most of the hunting while women gathered.


Someone (Will@Evolving _Moloch) criticized this as a good candidate for the most misleading paragraph ever written. The folly of youth! When you’ve been around as long as I have, sonny, you will realize how hard it is to set records for stupidity.

Fuente’s para is multidimensional crap, of course. People used to hunt animals like red deer, or bison, or eland: sometimes mammoths or rhinos. Big animals. Back in the day, our ancestors used stabbing spears, which go back at least half a million years. Stand-off weapons like atlatls, or bows, or JSOW, are relatively recent. Hunters took big risks & suffered frequent injuries. Men are almost twice as strong as women, particularly in upper-body strength, which is what matters in spear-chucking. They’re also faster, which can be very important which your ambush fails.
So men did the hunting. This isn’t complicated.

Which contemporary hunter-gather societies followed this pattern, had men do almost all of the big-game hunting? All of them.


Look, feminists aren’t happy with human nature, the one that actually exists and is the product of long-term evolutionary pressures. Too bad for them. When they say stuff like “It should not simply be assumed that the exclusion of women from hunting rests upon “natural” physiological differences. “, they just sound like fools.. ‘natural physiological differences” exist. They’re as obvious a punch in the kisser.

Suppose you wanted to construct a society with effective sexual equality – which is probably just a mistake, but suppose it. The most effective approach would surely entail knowing and taking into account how the world actually ticks. You’d be better off understanding that about 6,000 genes (out of 20,000) show significant expression differences between the sexes, than by pretending that we’re all the same. You would to make it so: by hook or by crook, by state force and genetic engineering.

Similarly, if you want to minimize war, pretending that people aren’t warlike is a poor start – about as sensible as fighting forest fires by pretending that trees aren’t flammable.

My advice to Augustin Fuentes, about not being a moron: show, don’t tell.

Since DNA is the enduring part, the part that gets transmitted from one generation to the next, the part that contains the instructions/program that determine development and specify everything – he’s wrong. Stupid, like you. Well, to be fair, ignorant as well: there are technical aspects of genetics that Agustin Fuentes is unlikely to know anything about, things that are almost never covered in the typical education of an anthropologist. I doubt if he knows what a Fisher wave is, or anything about selfish genetic elements, or coalescent theory, or for that matter the breeder’s equation.

There are a number of complex technical subjects, things that at least some people understand: those people can do stuff that the man in the street can’t. In most cases, amateurs don’t jump in and pretend to know what’s going on. For example you don’t hear much concerning amateur opinions concerning detonation physics or group theory. But they’re happy to have opinions about natural selection, even though they know fuck-all about it.

"Significantly fewer females are present at hunts than males...females tend to appear at the hunting site once the capture has been made..."

“Women in Tech”: https://bloodyshovel.wordpress.com/2017/10/26/women-in-tech/
june 2017
If there are 3 space dimensions and one time dimension, is it theoretically possible to have multiple time demensions and if so how would it work? : askscience
Yes, we can consider spacetimes with any number of temporal or spatial dimensions. The theory is set up essentially the same. Spacetime is modeled as a smooth n-dimensional manifold with a pseudo-Riemannian metric, and the metric satisfies the Einstein field equations (Einstein tensor = stress tensor).
A pseudo-Riemannian tensor is characterized by its signature, i.e., the number of negative quadratic forms in its metric and the number of positive quadratic forms. The coordinates with negative forms correspond to temporal dimensions. (This is a convention that is fixed from the start.) In general relativity, spacetime is 4-dimensional, and the signature is (1,3), so there is 1 temporal dimension and 3 spatial dimensions.
Okay, so that's a lot of math, but it all basically means that, yes, it makes sense to ask questions like "what does a universe with 2 time dimensions and 3 spatial dimensions look like?" It turns out that spacetimes with more than 1 temporal dimension are very pathological. For one, initial value problems do not generally have unique solutions. There is also generally no canonical way to pick out 1 of the infinitely many solutions to the equations of physics. This means that predictability is impossible (e.g., how do you know which solution is the correct one?). Essentially, there is no meaningful physics in a spacetime with more than 1 temporal dimension.
june 2017
Genomic analysis of family data reveals additional genetic effects on intelligence and personality | bioRxiv
Using Extended Genealogy to Estimate Components of Heritability for 23 Quantitative and Dichotomous Traits: http://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1003520
Pedigree- and SNP-Associated Genetics and Recent Environment are the Major Contributors to Anthropometric and Cardiometabolic Trait Variation: http://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1005804

Missing Heritability – found?: https://westhunt.wordpress.com/2017/02/09/missing-heritability-found/
There is an interesting new paper out on genetics and IQ. The claim is that they have found the missing heritability – in rare variants, generally different in each family.

Some of the variants, the ones we find with GWAS, are fairly common and fitness-neutral: the variant that slightly increases IQ confers the same fitness (or very close to the same) as the one that slightly decreases IQ – presumably because of other effects it has. If this weren’t the case, it would be impossible for both of the variants to remain common.

The rare variants that affect IQ will generally decrease IQ – and since pleiotropy is the norm, usually they’ll be deleterious in other ways as well. Genetic load.

Happy families are all alike; every unhappy family is unhappy in its own way.: https://westhunt.wordpress.com/2017/06/06/happy-families-are-all-alike-every-unhappy-family-is-unhappy-in-its-own-way/
It now looks as if the majority of the genetic variance in IQ is the product of mutational load, and the same may be true for many psychological traits. To the extent this is the case, a lot of human psychological variation must be non-adaptive. Maybe some personality variation fulfills an evolutionary function, but a lot does not. Being a dumb asshole may be a bug, rather than a feature. More generally, this kind of analysis could show us whether particular low-fitness syndromes, like autism, were ever strategies – I suspect not.

It’s bad new news for medicine and psychiatry, though. It would suggest that what we call a given type of mental illness, like schizophrenia, is really a grab-bag of many different syndromes. The ultimate causes are extremely varied: at best, there may be shared intermediate causal factors. Not good news for drug development: individualized medicine is a threat, not a promise.

see also comment at: https://pinboard.in/u:nhaliday/b:a6ab4034b0d0

So the big implication here is that it's better than I had dared hope - like Yang/Visscher/Hsu have argued, the old GCTA estimate of ~0.3 is indeed a rather loose lower bound on additive genetic variants, and the rest of the missing heritability is just the relatively uncommon additive variants (ie <1% frequency), and so, like Yang demonstrated with height, using much more comprehensive imputation of SNP scores or using whole-genomes will be able to explain almost all of the genetic contribution. In other words, with better imputation panels, we can go back and squeeze out better polygenic scores from old GWASes, new GWASes will be able to reach and break the 0.3 upper bound, and eventually we can feasibly predict 0.5-0.8. Between the expanding sample sizes from biobanks, the still-falling price of whole genomes, the gradual development of better regression methods (informative priors, biological annotation information, networks, genetic correlations), and better imputation, the future of GWAS polygenic scores is bright. Which obviously will be extremely helpful for embryo selection/genome synthesis.

The argument that this supports mutation-selection balance is weaker but plausible. I hope that it's true, because if that's why there is so much genetic variation in intelligence, then that strongly encourages genetic engineering - there is no good reason or Chesterton fence for intelligence variants being non-fixed, it's just that evolution is too slow to purge the constantly-accumulating bad variants. And we can do better.

The surprising implications of familial association in disease risk: https://arxiv.org/abs/1707.00014
As Greg Cochran has pointed out, this probably isn’t going to work. There are a few genes like BRCA1 (which makes you more likely to get breast and ovarian cancer) that we can detect and might affect treatment, but an awful lot of disease turns out to be just the result of random chance and deleterious mutation. This means that you can’t easily tailor disease treatment to people’s genes, because everybody is fucked up in their own special way. If Johnny is schizophrenic because of 100 random errors in the genes that code for his neurons, and Jack is schizophrenic because of 100 other random errors, there’s very little way to test a drug to work for either of them- they’re the only one in the world, most likely, with that specific pattern of errors. This is, presumably why the incidence of schizophrenia and autism rises in populations when dads get older- more random errors in sperm formation mean more random errors in the baby’s genes, and more things that go wrong down the line.

The looming crisis in human genetics: http://www.economist.com/node/14742737
Some awkward news ahead
- Geoffrey Miller

Human geneticists have reached a private crisis of conscience, and it will become public knowledge in 2010. The crisis has depressing health implications and alarming political ones. In a nutshell: the new genetics will reveal much less than hoped about how to cure disease, and much more than feared about human evolution and inequality, including genetic differences between classes, ethnicities and races.

june 2017
Backwardness | West Hunter
Back around the time I was born, anthropologists sometimes talked about some cultures being more advanced than others. This was before they decided that all cultures are equal, except that some are more equal than others.


I’ve been trying to estimate the gap between Eurasian and Amerindian civilization. The Conquistadors were, in a sense, invaders from the future: but just how far in the future? What point in the history of the Middle East is most similar to the state of the Amerindian civilizations of 1500 AD ?

I would argue that the Amerindian civilizations were less advanced than the Akkadian Empire, circa 2300 BC. The Mayans had writing, but were latecomers in metallurgy. The Inca had tin and arsenical bronze, but didn’t have written records. The Akkadians had both – as well as draft animals and the wheel. You can maybe push the time as far back as 2600 BC, since Sumerian cuneiform was in pretty full swing by then. So the Amerindians were around four thousand years behind.

Excepting the use of iron, sub-Saharan Africa, excepting Ethiopia, was well behind the most advanced Amerindian civilizations circa 1492. I am right now resisting the temptation to get into a hammer-and-tongs discussion of Isandlwana, Rorke’s Drift, Blood River, etc. – and we would all be better off if I continued to do so.

The Battle of Blood River (Afrikaans: Slag van Bloedrivier; Zulu: iMpi yaseNcome) is the name given for the battle fought between _470 Voortrekkers_ ("Pioneers"), led by Andries Pretorius, and _an estimated 80,000 Zulu attackers_ on the bank of the Ncome River on 16 December 1838, in what is today KwaZulu-Natal, South Africa. Casualties amounted to over 3,000 of king Dingane's soldiers dead, including two Zulu princes competing with Prince Mpande for the Zulu throne. _Three Pioneers commando members were lightly wounded_, including Pretorius himself.


In the morning of Tuesday, June 15, while we sat at Dr. Adams's, we talked of a printed letter from the Reverend Herbert Croft, to a young gentleman who had been his pupil, in which he advised him to read to the end of whatever books he should begin to read. JOHNSON. 'This is surely a strange advice; you may as well resolve that whatever men you happen to get acquainted with, you are to keep to them for life. A book may be good for nothing; or there may be only one thing in it worth knowing; are we to read it all through? These Voyages, (pointing to the three large volumes of Voyages to the South Sea, which were just come out) WHO will read them through? A man had better work his way before the mast, than read them through; they will be eaten by rats and mice, before they are read through. There can be little entertainment in such books; one set of Savages is like another.' BOSWELL. 'I do not think the people of Otaheite can be reckoned Savages.' JOHNSON. 'Don't cant in defence of Savages.' BOSWELL. 'They have the art of navigation.' JOHNSON. 'A dog or a cat can swim.' BOSWELL. 'They carve very ingeniously.' JOHNSON. 'A cat can scratch, and a child with a nail can scratch.' I perceived this was none of the mollia tempora fandi; so desisted.

Déjà Vu all over again: America and Europe: https://westhunt.wordpress.com/2014/11/12/deja-vu-all-over-again-america-and-europe/
In terms of social organization and technology, it seems to me that Mesolithic Europeans (around 10,000 years ago) were like archaic Amerindians before agriculture. Many Amerindians on the west coast were still like that when Europeans arrived – foragers with bows and dugout canoes.

On the other hand, the farmers of Old Europe were in important ways a lot like English settlers: the pioneers planted wheat, raised pigs and cows and sheep, hunted deer, expanded and pushed aside the previous peoples, without much intermarriage. Sure, Anglo pioneers were literate, had guns and iron, were part of a state, all of which gave them a much bigger edge over the Amerindians than Old Europe ever had over the Mesolithic hunter-gatherers and made the replacement about ten times faster – but in some ways it was similar. Some of this similarity was the product of historical accidents: the local Amerindians were thin on the ground, like Europe’s Mesolithic hunters – but not so much because farming hadn’t arrived (it had in most of the United States), more because of an ongoing population crash from European diseases.

On the gripping hand, the Indo-Europeans seem to have been something like the Plains Indians: sure, they raised cattle rather than living off abundant wild buffalo, but they too were transformed into troublemakers by the advent of the horse. Both still did a bit of farming. They were also alike in that neither of them really knew what they were doing: neither were the perfected product of thousands of years of horse nomadry. The Indo-Europeans were the first raiders on horseback, and the Plains Indians had only been at it for a century, without any opportunity to learn state-of-the-art tricks from Eurasian horse nomads.

The biggest difference is that the Indo-Europeans won, while the Plains Indians were corralled into crappy reservations.

Quantitative historical analysis uncovers a single dimension of complexity that structures global variation in human social organization: http://www.pnas.org/content/early/2017/12/20/1708800115.full
Do human societies from around the world exhibit similarities in the way that they are structured, and show commonalities in the ways that they have evolved? These are long-standing questions that have proven difficult to answer. To test between competing hypotheses, we constructed a massive repository of historical and archaeological information known as “Seshat: Global History Databank.” We systematically coded data on 414 societies from 30 regions around the world spanning the last 10,000 years. We were able to capture information on 51 variables reflecting nine characteristics of human societies, such as social scale, economy, features of governance, and information systems. Our analyses revealed that these different characteristics show strong relationships with each other and that a single principal component captures around three-quarters of the observed variation. Furthermore, we found that different characteristics of social complexity are highly predictable across different world regions. These results suggest that key aspects of social organization are functionally related and do indeed coevolve in predictable ways. Our findings highlight the power of the sciences and humanities working together to rigorously test hypotheses about general rules that may have shaped human history.

Fig. 2.

The General Social Complexity Factor Is A Thing: https://www.gnxp.com/WordPress/2017/12/21/the-general-social-complexity-factor-is-a-thing/
may 2017
There can only be one! | West Hunter
Dynasties decay.   The founder generally has a lot on the ball – tough, a natural leader, and canny campaigner – but his son is unlikely to be so exceptional.  Partly this is a manifestation of regression to the mean, and partly because his mother was probably chosen for something other than her talents as a warlord. By the fourth or fifth generation, it can be hard to believe that the useless poet on the throne is truly a member of the Golden Family.

This decay is a fundamental historical fact – an inevitable consequence of  biology and primogeniture.  It’s one of the important weaknesses of dynastic rule. The Ottomans, however, found a way around it for some centuries – the law of fratricide.  Upon the death of the Sultan, all of his sons were theoretically eligible for the succession (not just the oldest).  Since the Sultan had a harem, there were a lot of them. Whoever first seized power then had all his brothers and half-brothers executed by ritual strangulation. Incompetents didn’t win out in this struggle.

Dynastic decline through regression seems meaning you go from a guy like Stalin to an imperfect copy of Stalin that’s more like the average person (less competent but lacking in other traits that make Stalin, Stalin). Might not always totally all be “bad” for most people’s status, depending on the Great Man’s balance of competence to traits in a ruler that you don’t really want, but which helped them to seize power anyway.

Speculation, but might inbreeding be a way to hang on to more of the founder’s right stuff?

Thereby why constitutional monarchy and similar arrangements might tend to emerge in more outbreeding populations, because the successors to the big Mafia that runs the state and all the little Mafias that run everything else down from there tend to retain less ancestral competence and less ancestral desire for domination and status, so alternative arrangements are found.

list of evaluations for presidents+
"Competence, of course, is not the same thing as acting in the best interests of the common people. But it is absolutely essential for the continuation of a royal lineage or dynasty."

If everyone in a country was absolutely obsessed with the idea that the human pinkie finger was a symbol of a leader’s connection to god and was required for him to be a good ruler, this whole problem would be solved. The oldest brother could simply have his younger siblings’ pinkie fingers cut off, and let them live.

This would mean that ruling families would be much more free to have big families and expand their power even more.
The Byzantines did something similar with deposed emperors: the disfigured were ineligible for the role of God’s regent, or whatever the emperors were theologically. So, off with the nose! Justinian II ruined this humane custom by by wearing a silver nose after his restoration, à la the Lee Marvin character in Cat Ballou.

Here is my take on a solid alternative to democracy in the West: adoptive monarchy. We are acquainted with the Five Good Emperors of Rome. Before an Emperor died, he would adopt his most capable, wise and just subordinate as his son, who would become Emperor after his adoptive father died. The Roman Empire was perhaps at its most stable and prosperous point duribg this time period, and Edward Gibbon esteemed it as the high point of human history up through the time he was writing his tomes about Roman history, at least for the common Roman citizen.

If the American electorate continues to decline, ever falling into idiocy, factionalism and dementia, then I think it might be prudent to put adoptive monarchy into play, and have our monarchs rule the country in the spirit of the Constitution while temporarily holding it in suspension. The monarchs would have 40 to 80 years to pursue enlightened demographic policies, which would ensure that the American people would be ready for democratic rule again. There of course would be a Congress and courts. Both of these branches of government would at first be impotent, but would gradually regain their former power and prestige. At some point, the monarchy would be abolished, with an elected president regaining command of the executive branch, having about as much power as, say, Eisenhower did.
may 2017
The Distance Between Mars and Venus: Measuring Global Sex Differences in Personality
something other than Big Five

In an email, Del Giudice explained his approach to me with an analogy. “Gender differences in personality are very much like gender differences in facial appearance,” he said. “Each individual trait (nose length, eye size, etc) shows small differences between men and women, but once you put them all together... differences become clear and you can distinguish between male and female faces with more than 95% accuracy.”

Gender Differences in Personality across the Ten Aspects of the Big Five: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3149680/
Replicating previous findings, women reported higher Big Five Extraversion, Agreeableness, and Neuroticism scores than men. However, more extensive gender differences were found at the level of the aspects, with significant gender differences appearing in both aspects of every Big Five trait. For Extraversion, Openness, and Conscientiousness, the gender differences were found to diverge at the aspect level, rendering them either small or undetectable at the Big Five level.

some moderation by ethnicity and aging
april 2017
Trust, Trolleys and Social Dilemmas: A Replication Study
Overall, the present studies clearly confirmed the main finding of Everett et al., that deontologists are more trusted than consequentialists in social dilemma games. Study 1 replicates Everett et al.’s effect in the context of trust games. Study 2 generalizes the effect to public goods games, thus demonstrating that it is not specific to the type of social dilemma game used in Everett et al. Finally, both studies build on these results by demonstrating that the increased trust in deontologists may sometimes, but not always, be warranted: deontologists displayed increased cooperation rates but only in the public goods game and not in trust games.

The Adaptive Utility of Deontology: Deontological Moral Decision-Making Fosters Perceptions of Trust and Likeability: https://sci-hub.tw/http://link.springer.com/article/10.1007/s40806-016-0080-6
Consistent with previous research, participants liked and trusted targets whose decisions were consistent with deontological motives more than targets whose decisions were more consistent with utilitarian motives; this effect was stronger for perceptions of trust. Additionally, women reported greater dislike for targets whose decisions were consistent with utilitarianism than men. Results suggest that deontological moral reasoning evolved, in part, to facilitate positive relations among conspecifics and aid group living and that women may be particularly sensitive to the implications of the various motives underlying moral decision-making.

Inference of Trustworthiness From Intuitive Moral Judgments: https://sci-hub.tw/10.1037/xge0000165

Exposure to moral relativism compromises moral behavior: https://sci-hub.tw/http://www.sciencedirect.com/science/article/pii/S0022103113001339

Is utilitarian sacrifice becoming more morally permissible?: http://cushmanlab.fas.harvard.edu/docs/Hannikainanetal_2017.pdf

Disgust and Deontology: http://journals.sagepub.com/doi/abs/10.1177/1948550617732609
Trait Sensitivity to Contamination Promotes a Preference for Order, Hierarchy, and Rule-Based Moral Judgment

We suggest that a synthesis of these two literatures points to one specific emotion (disgust) that reliably predicts one specific type of moral judgment (deontological). In all three studies, we found that trait disgust sensitivity predicted more extreme deontological judgment.

The Influence of (Dis)belief in Free Will on Immoral Behavior: https://www.frontiersin.org/articles/10.3389/fpsyg.2017.00020/full

Beyond Sacrificial Harm: A Two-Dimensional Model of Utilitarian Psychology.: http://psycnet.apa.org/record/2017-57422-001
Recent research has relied on trolley-type sacrificial moral dilemmas to study utilitarian versus nonutilitarian modes of moral decision-making. This research has generated important insights into people’s attitudes toward instrumental harm—that is, the sacrifice of an individual to save a greater number. But this approach also has serious limitations. Most notably, it ignores the positive, altruistic core of utilitarianism, which is characterized by impartial concern for the well-being of everyone, whether near or far. Here, we develop, refine, and validate a new scale—the Oxford Utilitarianism Scale—to dissociate individual differences in the ‘negative’ (permissive attitude toward instrumental harm) and ‘positive’ (impartial concern for the greater good) dimensions of utilitarian thinking as manifested in the general population. We show that these are two independent dimensions of proto-utilitarian tendencies in the lay population, each exhibiting a distinct psychological profile. Empathic concern, identification with the whole of humanity, and concern for future generations were positively associated with impartial beneficence but negatively associated with instrumental harm; and although instrumental harm was associated with subclinical psychopathy, impartial beneficence was associated with higher religiosity. Importantly, although these two dimensions were independent in the lay population, they were closely associated in a sample of moral philosophers. Acknowledging this dissociation between the instrumental harm and impartial beneficence components of utilitarian thinking in ordinary people can clarify existing debates about the nature of moral psychology and its relation to moral philosophy as well as generate fruitful avenues for further research. (PsycINFO Database Record (c) 2017 APA, all rights reserved)

A breakthrough in moral psychology: https://nintil.com/2017/12/28/a-breakthrough-in-moral-psychology/

Gender Differences in Responses to Moral Dilemmas: A Process Dissociation Analysis: https://www.ncbi.nlm.nih.gov/pubmed/25840987
The principle of deontology states that the morality of an action depends on its consistency with moral norms; the principle of utilitarianism implies that the morality of an action depends on its consequences. Previous research suggests that deontological judgments are shaped by affective processes, whereas utilitarian judgments are guided by cognitive processes. The current research used process dissociation (PD) to independently assess deontological and utilitarian inclinations in women and men. A meta-analytic re-analysis of 40 studies with 6,100 participants indicated that men showed a stronger preference for utilitarian over deontological judgments than women when the two principles implied conflicting decisions (d = 0.52). PD further revealed that women exhibited stronger deontological inclinations than men (d = 0.57), while men exhibited only slightly stronger utilitarian inclinations than women (d = 0.10). The findings suggest that gender differences in moral dilemma judgments are due to differences in affective responses to harm rather than cognitive evaluations of outcomes.
march 2017
How Universal Is the Big Five? Testing the Five-Factor Model of Personality Variation Among Forager–Farmers in the Bolivian Amazon
We failed to find robust support for the FFM, based on tests of (a) internal consistency of items expected to segregate into the Big Five factors, (b) response stability of the Big Five, (c) external validity of the Big Five with respect to observed behavior, (d) factor structure according to exploratory and confirmatory factor analysis, and (e) similarity with a U.S. target structure based on Procrustes rotation analysis.


We argue that Tsimane personality variation displays 2 principal factors that may reflect socioecological characteristics common to small-scale societies. We offer evolutionary perspectives on why the structure of personality variation may not be invariant across human societies.
pdf  study  psychology  cog-psych  society  embedded-cognition  personality  metrics  generalization  methodology  farmers-and-foragers  latin-america  context  homo-hetero  info-dynamics  water  psychometrics  exploratory  things  phalanges  dimensionality  anthropology  universalism-particularism  applicability-prereqs 
february 2017
interpretation - How to understand degrees of freedom? - Cross Validated
From Wikipedia, there are three interpretations of the degrees of freedom of a statistic:

In statistics, the number of degrees of freedom is the number of values in the final calculation of a statistic that are free to vary.

Estimates of statistical parameters can be based upon different amounts of information or data. The number of independent pieces of information that go into the estimate of a parameter is called the degrees of freedom (df). In general, the degrees of freedom of an estimate of a parameter is equal to the number of independent scores that go into the estimate minus the number of parameters used as intermediate steps in the estimation of the parameter itself (which, in sample variance, is one, since the sample mean is the only intermediate step).

Mathematically, degrees of freedom is the dimension of the domain of a random vector, or essentially the number of 'free' components: how many components need to be known before the vector is fully determined.


This is a subtle question. It takes a thoughtful person not to understand those quotations! Although they are suggestive, it turns out that none of them is exactly or generally correct. I haven't the time (and there isn't the space here) to give a full exposition, but I would like to share one approach and an insight that it suggests.

Where does the concept of degrees of freedom (DF) arise? The contexts in which it's found in elementary treatments are:

- The Student t-test and its variants such as the Welch or Satterthwaite solutions to the Behrens-Fisher problem (where two populations have different variances).
- The Chi-squared distribution (defined as a sum of squares of independent standard Normals), which is implicated in the sampling distribution of the variance.
- The F-test (of ratios of estimated variances).
- The Chi-squared test, comprising its uses in (a) testing for independence in contingency tables and (b) testing for goodness of fit of distributional estimates.

In spirit, these tests run a gamut from being exact (the Student t-test and F-test for Normal variates) to being good approximations (the Student t-test and the Welch/Satterthwaite tests for not-too-badly-skewed data) to being based on asymptotic approximations (the Chi-squared test). An interesting aspect of some of these is the appearance of non-integral "degrees of freedom" (the Welch/Satterthwaite tests and, as we will see, the Chi-squared test). This is of especial interest because it is the first hint that DF is not any of the things claimed of it.


Having been alerted by these potential ambiguities, let's hold up the Chi-squared goodness of fit test for examination, because (a) it's simple, (b) it's one of the common situations where people really do need to know about DF to get the p-value right and (c) it's often used incorrectly. Here's a brief synopsis of the least controversial application of this test:


This, many authorities tell us, should have (to a very close approximation) a Chi-squared distribution. But there's a whole family of such distributions. They are differentiated by a parameter νν often referred to as the "degrees of freedom." The standard reasoning about how to determine νν goes like this

I have kk counts. That's kk pieces of data. But there are (functional) relationships among them. To start with, I know in advance that the sum of the counts must equal nn. That's one relationship. I estimated two (or pp, generally) parameters from the data. That's two (or pp) additional relationships, giving p+1p+1 total relationships. Presuming they (the parameters) are all (functionally) independent, that leaves only k−p−1k−p−1 (functionally) independent "degrees of freedom": that's the value to use for νν.

The problem with this reasoning (which is the sort of calculation the quotations in the question are hinting at) is that it's wrong except when some special additional conditions hold. Moreover, those conditions have nothing to do with independence (functional or statistical), with numbers of "components" of the data, with the numbers of parameters, nor with anything else referred to in the original question.


Things went wrong because I violated two requirements of the Chi-squared test:

1. You must use the Maximum Likelihood estimate of the parameters. (This requirement can, in practice, be slightly violated.)
2. You must base that estimate on the counts, not on the actual data! (This is crucial.)


The point of this comparison--which I hope you have seen coming--is that the correct DF to use for computing the p-values depends on many things other than dimensions of manifolds, counts of functional relationships, or the geometry of Normal variates. There is a subtle, delicate interaction between certain functional dependencies, as found in mathematical relationships among quantities, and distributions of the data, their statistics, and the estimators formed from them. Accordingly, it cannot be the case that DF is adequately explainable in terms of the geometry of multivariate normal distributions, or in terms of functional independence, or as counts of parameters, or anything else of this nature.

We are led to see, then, that "degrees of freedom" is merely a heuristic that suggests what the sampling distribution of a (t, Chi-squared, or F) statistic ought to be, but it is not dispositive. Belief that it is dispositive leads to egregious errors. (For instance, the top hit on Google when searching "chi squared goodness of fit" is a Web page from an Ivy League university that gets most of this completely wrong! In particular, a simulation based on its instructions shows that the chi-squared value it recommends as having 7 DF actually has 9 DF.)
january 2017
teaching - Intuitive explanation for dividing by $n-1$ when calculating standard deviation? - Cross Validated
The standard deviation calculated with a divisor of n-1 is a standard deviation calculated from the sample as an estimate of the standard deviation of the population from which the sample was drawn. Because the observed values fall, on average, closer to the sample mean than to the population mean, the standard deviation which is calculated using deviations from the sample mean underestimates the desired standard deviation of the population. Using n-1 instead of n as the divisor corrects for that by making the result a little bit bigger.

Note that the correction has a larger proportional effect when n is small than when it is large, which is what we want because when n is larger the sample mean is likely to be a good estimator of the population mean.


A common one is that the definition of variance (of a distribution) is the second moment recentered around a known, definite mean, whereas the estimator uses an estimated mean. This loss of a degree of freedom (given the mean, you can reconstitute the dataset with knowledge of just n−1 of the data values) requires the use of n−1 rather than nn to "adjust" the result.
january 2017
Dvoretzky's theorem - Wikipedia
In mathematics, Dvoretzky's theorem is an important structural theorem about normed vector spaces proved by Aryeh Dvoretzky in the early 1960s, answering a question of Alexander Grothendieck. In essence, it says that every sufficiently high-dimensional normed vector space will have low-dimensional subspaces that are approximately Euclidean. Equivalently, every high-dimensional bounded symmetric convex set has low-dimensional sections that are approximately ellipsoids.

january 2017
Information Processing: Oppenheimer on Bohr (1964 UCLA)
I find it strange that psychometricians usually define "verbal ability" over a vocabulary set that excludes words from mathematics and other scientific areas. A person's verbal score is enhanced by knowing many (increasingly obscure) words for the same concept, as opposed to knowing words which describe new concepts beyond those which appear in ordinary language.
hsu  scitariat  thinking  metabuch  language  neurons  psychometrics  dimensionality  concept  list  critique  conceptual-vocab  quotes  giants  discussion  wordlessness  novelty  tricki 
january 2017
soft question - Thinking and Explaining - MathOverflow
- good question from Bill Thurston
- great answers by Terry Tao, fedja, Minhyong Kim, gowers, etc.

Terry Tao:
- symmetry as blurring/vibrating/wobbling, scale invariance
- anthropomorphization, adversarial perspective for estimates/inequalities/quantifiers, spending/economy

fedja walks through his though-process from another answer

Minhyong Kim: anthropology of mathematical philosophizing

Per Vognsen: normality as isotropy
comment: conjugate subgroup gHg^-1 ~ "H but somewhere else in G"

gowers: hidden things in basic mathematics/arithmetic
comment by Ryan Budney: x sin(x) via x -> (x, sin(x)), (x, y) -> xy
I kinda get what he's talking about but needed to use Mathematica to get the initial visualization down.
To remind myself later:
- xy can be easily visualized by juxtaposing the two parabolae x^2 and -x^2 diagonally
- x sin(x) can be visualized along that surface by moving your finger along the line (x, 0) but adding some oscillations in y direction according to sin(x)
january 2017 by nhaliday
Ethnic fractionalization and growth | Dietrich Vollrath
Garett Jones did a podcast with The Economics Detective recently on the costs of ethnic diversity. It is particularly worth listening to given that racial identity has re-emerged as a salient element of politics. A quick summary - and the link above includes a nice write-up of relevant sources - would be that diversity within workplaces does not appear to improve outcomes (however those outcomes are measured).

At the same time, there is a parallel literature, touched on in the podcast, about ethnic diversity (or fractionalization, as it is termed in that literature) and economic growth. But one has to be careful drawing a bright line between the two literatures. It does not follow that the results for workplace diversity imply the results regarding economic growth. And this is because the growth results, to the extent that you believe they are robust, all operate through political systems.

So here let me walk through some of the core empirical relationships that have been found regarding ethnic fractionalization and economic growth, and then talk about why you need to take care with over-interpreting them. This is not a thorough literature review, and I realize there are other papers in the same vein. What I’m after is characterizing the essential results.


- objection about sensitivity of measure to definition of clusters seems dumb to me (point is to fix definitions than compare different polities. as long as direction and strength of correlation is fairly robust to changes in clustering, this is a stupid critique)
- also, could probably define a less arbitrary notion of fractionalization (w/o fixed clustering or # of clusters) if using points in a metric/vector/euclidean space (eg, genomes)
- eg, A Generalized Index of Ethno-Linguistic Fractionalization: http://www-3.unipv.it/webdept/prin/workpv02.pdf
So like -E_{A, B ~ X} d(A, B). Or maybe -E_{A, B ~ X} f(d(A, B)) for f an increasing function (in particular, f(x) = x^2).

Note that E ||A - B|| = Θ(E ||E[A] - A||), and E ||A - B||^2 = 2Var A,
for A, B ~ X, so this is just quantifying deviation from mean for Euclidean spaces.

In the case that you have a bunch of difference clusters w/ centers equidistant (so n+1 in R^n), measures p_i, and internal variances σ_i^2, you get E ||A - B||^2 = -2∑_i p_i^2σ_i^2 - ∑_{i≠j} p_ip_j(1 + σ_i^2 + σ_j^2) = -2∑_i p_i^2σ_i^2 - ∑_{i≠j} p_ip_j(1 + σ_i^2 + σ_j^2) = -∑_i p_i^2(1 + 2σ_i^2) - ∑_i 2p_i(1-p_i)σ_i^2
(inter-center distance scaled to 1 wlog).
(in general, if you allow _approximate_ equidistance, you can pack in exp(O(n)) clusters via JL lemma)
december 2016
Information Processing: Thought vectors and the dimensionality of the space of concepts
If we trained a deep net to translate sentences about Physics from Martian to English, we could (roughly) estimate the "conceptual depth" of the subject. We could even compare two different subjects, such as Physics versus Art History.
hsu  ai  deep-learning  google  speculation  commentary  news  language  embeddings  neurons  thinking  papers  summary  scitariat  dimensionality  conceptual-vocab  vague  nlp  nibble  state-of-art  features 
december 2016
Science Policy | West Hunter
If my 23andme profile revealed that I was the last of the Plantagenets (as some suspect), and therefore rightfully King of the United States and Defender of Mexico, and I asked you for a general view of the right approach to science and technology – where the most promise is, what should be done, etc – what would you say?

genetically personalized medicine: https://westhunt.wordpress.com/2016/12/08/science-policy/#comment-85698
I have no idea how personalized medicine is supposed to work. Suppose that we sequence your entire genome, and then we intend to tailor a therapeutic approach to your genome.

How do we test it? By trying it on a bunch of genetically similar people? The more genetic details we take into account, the smaller that class is. It could easily become so small that it would be difficult to recruit enough people for a reasonable statistical trial. Second, the more details we take into account, the smaller the class that benefits from the whole testing process – which as far as I can see, is just as expensive as conventional Phasei/II etc trials.

What am I missing?

Now if you are a forethoughtful trillionaire, sure: you manufacture lots of clones just to test therapies you might someday need, and cost is no object.

I think I can see ways you could make it work tho [edit: what did I mean by this?...damnit]
december 2016
gt.geometric topology - Intuitive crutches for higher dimensional thinking - MathOverflow
Terry Tao:
I can't help you much with high-dimensional topology - it's not my field, and I've not picked up the various tricks topologists use to get a grip on the subject - but when dealing with the geometry of high-dimensional (or infinite-dimensional) vector spaces such as R^n, there are plenty of ways to conceptualise these spaces that do not require visualising more than three dimensions directly.

For instance, one can view a high-dimensional vector space as a state space for a system with many degrees of freedom. A megapixel image, for instance, is a point in a million-dimensional vector space; by varying the image, one can explore the space, and various subsets of this space correspond to various classes of images.

One can similarly interpret sound waves, a box of gases, an ecosystem, a voting population, a stream of digital data, trials of random variables, the results of a statistical survey, a probabilistic strategy in a two-player game, and many other concrete objects as states in a high-dimensional vector space, and various basic concepts such as convexity, distance, linearity, change of variables, orthogonality, or inner product can have very natural meanings in some of these models (though not in all).

It can take a bit of both theory and practice to merge one's intuition for these things with one's spatial intuition for vectors and vector spaces, but it can be done eventually (much as after one has enough exposure to measure theory, one can start merging one's intuition regarding cardinality, mass, length, volume, probability, cost, charge, and any number of other "real-life" measures).

For instance, the fact that most of the mass of a unit ball in high dimensions lurks near the boundary of the ball can be interpreted as a manifestation of the law of large numbers, using the interpretation of a high-dimensional vector space as the state space for a large number of trials of a random variable.

More generally, many facts about low-dimensional projections or slices of high-dimensional objects can be viewed from a probabilistic, statistical, or signal processing perspective.

Scott Aaronson:
Here are some of the crutches I've relied on. (Admittedly, my crutches are probably much more useful for theoretical computer science, combinatorics, and probability than they are for geometry, topology, or physics. On a related note, I personally have a much easier time thinking about R^n than about, say, R^4 or R^5!)

1. If you're trying to visualize some 4D phenomenon P, first think of a related 3D phenomenon P', and then imagine yourself as a 2D being who's trying to visualize P'. The advantage is that, unlike with the 4D vs. 3D case, you yourself can easily switch between the 3D and 2D perspectives, and can therefore get a sense of exactly what information is being lost when you drop a dimension. (You could call this the "Flatland trick," after the most famous literary work to rely on it.)
2. As someone else mentioned, discretize! Instead of thinking about R^n, think about the Boolean hypercube {0,1}^n, which is finite and usually easier to get intuition about. (When working on problems, I often find myself drawing {0,1}^4 on a sheet of paper by drawing two copies of {0,1}^3 and then connecting the corresponding vertices.)
3. Instead of thinking about a subset S⊆R^n, think about its characteristic function f:R^n→{0,1}. I don't know why that trivial perspective switch makes such a big difference, but it does ... maybe because it shifts your attention to the process of computing f, and makes you forget about the hopeless task of visualizing S!
4. One of the central facts about R^n is that, while it has "room" for only n orthogonal vectors, it has room for exp⁡(n) almost-orthogonal vectors. Internalize that one fact, and so many other properties of R^n (for example, that the n-sphere resembles a "ball with spikes sticking out," as someone mentioned before) will suddenly seem non-mysterious. In turn, one way to internalize the fact that R^n has so many almost-orthogonal vectors is to internalize Shannon's theorem that there exist good error-correcting codes.
5. To get a feel for some high-dimensional object, ask questions about the behavior of a process that takes place on that object. For example: if I drop a ball here, which local minimum will it settle into? How long does this random walk on {0,1}^n take to mix?

Gil Kalai:
This is a slightly different point, but Vitali Milman, who works in high-dimensional convexity, likes to draw high-dimensional convex bodies in a non-convex way. This is to convey the point that if you take the convex hull of a few points on the unit sphere of R^n, then for large n very little of the measure of the convex body is anywhere near the corners, so in a certain sense the body is a bit like a small sphere with long thin "spikes".
december 2016
Degenerate Neanderthals | West Hunter
Both papers talk about the likely genetic burden that Eurasians picked up from that Neanderthal admixture. Since East Asians have a somewhat higher level of Neanderthal admixture than people in Europe or the Middle East (~20% more) then they must have even more toxic Neanderthal genes, and Africans the least. This echoes earlier papers that have argued that population history (out-of-Africa bottleneck, Neanderthal admixture, etc) must have increased genetic load in Eurasians.
Evidently extra genetic load has anti-intuitive effects.

interesting: https://westhunt.wordpress.com/2015/11/03/degenerate-neanderthals/#comment-73074


The model allows us to investigate compensatory mutations, which restore fitness losses incurred by other mutations, in a context-dependent manner. We have conducted a moment analysis of the model, supplemented by the numerical results of computer simulations. The mean reduction of fitness (i.e., expected load) scaled to one is approximately n/(n + 2Ne), where Ne is the effective population size. The reciprocal relationship between the load and Ne implies that the fixation of deleterious mutations is unlikely to cause extinction when there is a broad scope for compensatory mutations, except in very small populations. Furthermore, the dependence of load on n implies that pleiotropy plays a large role in determining the extinction risk of small populations.
west-hunter  sapiens  genetics  genetic-load  archaics  speculation  methodology  competition  population-genetics  gene-flow  europe  critique  evolution  mutation  pop-structure  multi  study  links  commentary  discussion  context  dimensionality  scitariat  stylized-facts  poast  gene-drift  population  magnitude  street-fighting  nibble  aphorism  pop-diff  africa  antiquity  comparison  troll  lol  stereotypes  alien-character  speaking 
november 2016
Overcoming Bias : Why Men Are Bad At “Feelings”
Mating in mammals has a basic asymmetry – females must invest more in each child than males. This can lead to an equilibrium where males focus on impressing and having sex with as many females as possible, while females do most of the child-rearing and choose impressive males.

Since human kids require extra child-rearing, human foragers developed pair-bonding, wherein for a few years a male gave substantial resource support to help raising a kid in trade for credible signs that the kid was his. Farmers strengthened such bonds into “marriage” — while both lived, the man gave resources sufficient to raise kids, and the woman only had sex with him. Such strong pair-bonds were held together not only by threats of social punishment, but also by strong feelings of attachment.

Such bonds can break, however. And because they are asymmetric, their betrayal is also asymmetric. Women betray bonds more by temporarily having fertile sex with other men, while men betray bonds more by directing resources more permanently to other women. So when farmer husbands and wives watch for signs of betrayal, they watch for different things. Husbands watch wives more for signs of a temporary inclination toward short-term mating with other men, while wives watch husbands more for signs of an inclination to shift toward a long-term resource-giving bond with other women. (Of course they both watch for both sorts of inclinations; the issue is emphasis.)

Emotionally, Men Are Far, Women Near: http://www.overcomingbias.com/2011/08/emotional-men-are-far-women-near.html
Now add two more assumptions:
1. Each gender is more emotional about the topic area (short vs. long term mating) where its feelings are more complex, layered, and opaque.
2. Long term mating thoughts tend to be in far mode, while short term mating thoughts tend to be in near mode. (Love is far, sex is near.)

Given these assumptions we should expect emotional men to be more in far mode, and emotional women to be more in near mode. (At least if mating-related emotions are a big part of emotions overall.) And since far modes tend to have a more positive mood, we should expect men to have more positive emotions, and women more negative.

In fact, even though overall men and women are just as emotional, men report more positive and less negative emotions than women. Also, after listening to an emotional story, male hormones help one remember its far-mode-abstract gist, while female hormones help one remembrer its near-mode-concrete details. (Supporting study quotes below.)

I’ve been wondering for a while why we don’t see a general correlation between near vs. far and emotionality, and I guess this explains it – the correlation is there but it flips between genders. This also helps explain common patterns in when the genders see each other as overly or underly emotional. Women are more emotional about details (e.g., his smell, that song), while men are more emotional about generalities (e.g., patriotism, fairness). Now for those study quotes:

Love Is An Interpretation: http://www.overcomingbias.com/2013/10/love-is-an-interpretation.html
What does it mean to feel loved: http://journals.sagepub.com/doi/abs/10.1177/0265407517724600
Cultural consensus and individual differences in felt love

We examined different romantic and nonromantic scenarios that occur in daily life and asked people if they perceived those scenarios as loving signals and if they aligned with the cultural agreement... More specifically, we found that male participants show less knowledge of the consensus on felt love than female participants... Men are more likely to think about sexual commitment and the pleasure of intercourse when thinking about love, whereas women are more prone to thinking about love as emotional commitment and security... In terms of relationship status, we also found that people in relationships know more about the consensus on felt love than people who are single... Our results also demonstrated personality differences in people’s ability to know the consensus on felt love. Based on our findings, people who were higher in agreeableness and/ or higher in neuroticism showed more knowledge about the consensus on felt love... The finding that neuroticism is related to more knowledge of the consensus on felt love is surprising when considering the literature which typically links neuroticism to problematic relationship outcomes, such as divorce, low relationship satisfaction, marital instability, and shorter relationships... Results indicated that in this U.S. sample Black people showed less knowledge about the consensus on felt love than other racial and ethnic groups. This finding is expected because the majority of the U.S. sample recruited is of White racial/ethnic background and thus this majority (White) mostly influences the consensus on the indicators of love.

Lost For Words, On Purpose: https://www.overcomingbias.com/2014/07/lost-for-words-on-purpose.html
But consider the two cases of food and love/sex (which I’m lumping together here). It seems to me that while these topics are of comparable importance, we have a lot more ways to clearly express distinctions on foods than on love/sex. So when people want to express feelings on love/sex, they often retreat to awkward analogies and suggestive poetry.
october 2016
Overcoming Bias : Beware General Visible Prey
So, bottom line, the future great filter scenario that most concerns me is one where our solar-system-bound descendants have killed most of nature, can’t yet colonize other stars, are general predators and prey of each other, and have fallen into a short-term-predatory-focus equilibrium where predators can easily see and travel to most all prey. Yes there are about a hundred billion comets way out there circling the sun, but even that seems a small enough number for predators to careful map and track all of them.
hanson  risk  prediction  futurism  speculation  pessimism  war  ratty  space  big-picture  fermi  threat-modeling  equilibrium  slippery-slope  anthropic  chart  deep-materialism  new-religion  ideas  bio  nature  plots  expansionism  malthus  marginal  convexity-curvature  humanity  farmers-and-foragers  diversity  entropy-like  homo-hetero  existence  volo-avolo  technology  frontier  intel  travel  time-preference  communication  civilization  egalitarianism-hierarchy  peace-violence  ecology  cooperate-defect  dimensionality  whole-partial-many  temperance  patience  thinking  long-short-run  prepping  offense-defense 
october 2016
CS229T/STATS231: Statistical Learning Theory
Course by Percy Liang covers a mix of statistics, computational learning theory, and some online learning. Also surveys the state-of-the-art in theoretical understanding of deep learning (not much to cover unfortunately).
yoga  stanford  course  machine-learning  stats  👳  lecture-notes  acm  kernels  learning-theory  deep-learning  frontier  init  ground-up  unit  dimensionality  vc-dimension  entropy-like  extrema  moments  online-learning  bandits  p:***  explore-exploit 
june 2016
Talagrand’s concentration inequality | What's new
Proposition 1 follows easily from the following statement, that asserts that if a convex set {A \subset {\bf R}^n} occupies a non-trivial fraction of the cube {\{-1,+1\}^n}, then the neighbourhood {A_t := \{ x \in {\bf R}^n: \hbox{dist}(x,A) \leq t \}} will occupy almost all of the cube for {t \gg 1}:
exposition  math.CA  math  gowers  concentration-of-measure  mathtariat  random-matrices  levers  estimate  probability  math.MG  geometry  boolean-analysis  nibble  org:bleg  high-dimension  p:whenever  dimensionality  curvature  convexity-curvature 
may 2016
