Internal R&D Project #1: Netflix-O-Matic | Good, Form & Spectacle
I’ve planted a flag in the sand with Good, Form & Spectacle. It’s declaring my territory of interest in the world of cultural heritage. To me, that’s at once a strong direction (libraries, museums, archives), and a broad remit (support open data, encourage exploratory interfaces, promote open access, develop interested community). It might be slightly odd to call Netflix-O-Matic a cultural heritage project, but let me explain why I think it is.

Netflix is a repository of objects that happen to be movies
Each movie has a bunch of metadata about it (directors, actors, genres etc)
Movies are collected together in genres and their variants (Slapstick Comedies; Feel-Good Slapstick Comedies). And Alexis Madrigal wrote an interesting article in The Atlantic Monthly about How Netflix Reverse-Engineered Hollywood, well worth a read.

This sounds a bit like a library. It’s a library.
glo  netflix  library  visualization 
6 hours ago
literary machines - digital libraries, books, mountains
For a long time the only free (i'm unaware of commercial ones) implementation of a web archival replay software has been the Wayback Machine (now Openwayback). It's a stable and mature software, with a strong community behind.
To use it you need to be confident with the deploy of a java web application; not so difficult, and documentation is exaustive.
But there is a new player in the game, pywb, developed by Ilya Kramer, a former Internet Archive developer.
Built in python, relatively simpler than wayback, and now used in a pro archiving project at Rhizome.
waybackmachine  archive  internetarchive  historyboxes  warc  python 
6 hours ago
Escape from Microsoft Word by Edward Mendelson | NYRblog | The New York Review of Books
A friend at Microsoft, speaking not for attribution, solved the mystery. Word, it seems, obeys the following rule: when a “style” is applied to text that is more than 50 percent “direct-formatted” (like the italics I applied to the magazine titles), then the “style” removes the direct formatting. So The New York Review of Books (with the three-letter month May) lost its italics. When less than 50 percent of the text is “direct-formatted,” as in the example with The New Yorker (with the nine-letter month September), the direct-formatting is retained.

No writer has ever thought about the exact percentage of italics in a line of type, but Word is reduced to this kind of arbitrary principle because its Platonic model—like all Platonic models—is magnificent in its inner coherence but mostly irrelevant to the real world. In order to make a connection between heavenly ideas and tangible realities, Plato himself was reduced to inventing something he called the Demiurge, an intermediate being who translates the ideal forms in heaven into something tangible in the world. The Demiurge is an early instance of what programmers call a kludge—a clumsy and illogical expedient for dealing with a problem that seems too intractable to solve more elegantly. Word’s 50-percent rule for applying styles is a descendent of the Demiurge, and just as much of a kludge.
microsoft  publishing  motive 
pywb is a python implementation of web archival replay tools, sometimes also known as 'Wayback Machine'.

pywb allows high-quality replay (browsing) of archived web data stored in standardized ARC and WARC.

pywb can be used as a traditional web application or an HTTP or HTTPS proxy server.
python  archive  internetarchive  warc 
thisisaaronland 2014 draft : Aaron Straup Cope : Free Download & Streaming : Internet Archive
it is october which means that I have posted a draft of all the blog posts from 2014 to the internet /
from twitter
3 days ago
Chef Solo tutorial: Managing a single server with Chef
“The Chef documentation assumes you have an entire server farm to manage, so it hits you with a lot of complexity. If all you want is to set up and maintain a single VM, this tutorial will help.

We will be creating “throwaway” VMs that we can recreate with a single call. If you need to have a single server system persist for years across all changes, you should perhaps check out Puppet for a less cloudy solution.

Using Amazon EC2?

Any fresh instance of an Ubuntu AMI will do, for example ami-06ad526f (11.04, 32-bit EBS in us-east-1).

Hint: The EC2 API tools are (by nature) complex and error-prone, so I recommend you stay with the point-and-click web interface for as long as possible, instead of trying to automate launching new instances.

Hint 2: Create a separate EBS volume as a data partition to hold your databases, etc.

Hint 3: Use Elastic IPs for all your instances.

All you need is

a laptop with Bash and SSH (no need to even install Chef), and

any vanilla Linux server that you can SSH into.

Why do we run against vanilla servers? In my opinion, the system your deployment scripts run against should be as minimal possible, essentially straight from the vendor. So instead of manually installing packages like Chef and then creating a snapshot to instantiate our VMs from, we will use a freshly installed system and make the entire process scripted. On the downside, we need some bootstrapping code (~30 LOC), but on the upside we get more repeatability and less maintenance overhead, and we stay independent from any specific cloud or virtualization provider.”
chef  tutorial  devops  linux  sysadmin  aws  ec2  via:migurski 
3 days ago
[this is aaronland] interpretation roomba
Brilliant work. Inspiring. RT : (Littlenets references this talk: )”
from twitter_favs
5 days ago
Ethermap makes heavy use of WebSockets to synchronize the map editor between all clients. socket.io as a library has been chosen for the automatic fallback to Long-Polling.

Within Leaflet, the L.stamp function has been adapted to provide unique layer id's which are required to synchronize the map content between different clients. Based on the Leaflet.Draw events, all changes are sent to the server, where they are stored in CouchDB and distributed again via WebSockets. Leaflet.Draw has however been modified to allow only one feature to be edited at a time. In addition to that, every change will be transmitted. Usually Leaflet.Draw only applies changes after hitting "Save". The feature properties views are adapted from the JSON categories of the iD editor to provide OSM like feature properties. Changes to the feature properties will also be directly transferred. Features are transferred as GeoJSON objects.

To watch users or show there current workarea, all movements (panning/zooming) are also transferred with the current bounding box of the map window. To prevent feedback, custom event listeners have been used instead of the existing Leaflet events.
maps  geo  community 
5 days ago
RT : Aaron talks about the future, and large pens. @ Makeshift Society Brooklyn
from twitter
5 days ago
The Ins and Outs of the Yahoo Flickr Creative Commons 100 Million Dataset | code.flickr.com
"Some photos and videos (3,350,768 to be exact) carry machine tags. Noteworthy machine tags are those having the “siwild” namespace, referring to photos uploaded by scientists of the Smithsonian, and the “taxonomy” namespace, which refers to photos in which flora and fauna have been carefully classified. The most frequently occurring namespace, “uploaded,” refers to the applications used to share the photos on Flickr, which are principally the Flickr and Instagram iOS apps. Other interesting machine tags are those referring to the different filters that can be applied to a photo, or roughly 750,000 photos. Overall, most machine tags are related to food and drink, events, camera and application metadata, as well as locations."
flickr  machinetags 
7 days ago
What Does the US Military's New Space Plane Really Do? | VICE News
But there is one area in the Air Force that is getting money. It's the new hot and sexy trend for those in love with the technological bleeding edge: hypersonics. As other countries chip away at the stealth advantage, there's been more and more interest in going faster and faster. If stealth technology is based on the idea that you can't hit what you can't see, hypersonics go with the idea that you can't hit what you can't catch. A hypersonic missile could reach out and hit a target very quickly, not only evading defenses but nailing elusive targets before they have a chance to get away.
hypersonices  voicefromabove  whosonfirst  space 
7 days ago
The Ins and Outs of the Yahoo Flickr Creative Commons 100 Million Dataset | code.flickr.com
1.17M machine tags have the "siwild" namespace referring to photos uploaded by scientists of the
from twitter
7 days ago
Electric Objects - Artist Interview Series - Jenny-odell
RT : We’re excited to announce Jenny Odell as the EO x Net Artist in Residence.
from twitter
9 days ago
Make geometries editable in Leaflet.
geo  maps  leaflet  javascript 
9 days ago
the astronauts turned the computers off and guided the moon lander to the surface by touch and feel
aa:idx=index  aa:id=pixelshag  aa:post=pixelshag  aa:year=2012  2012  aa:ima=post 
12 days ago
[this is aaronland] blog posts for 2012
finally all the blog posts since 2006 have proper YMD indexes / for example or and so on
from twitter
12 days ago
[this is aaronland] blog posts for 2007
finally all the blog posts since 2006 have proper YMD indexes / for example or and so on
from twitter
12 days ago
So You Want To Use A Metro Extract
RT : I made one of those big-picture blog posts for newbs searching for information on working with OSM data:
osm  mapzen  from twitter
13 days ago
Fast forward 20 years: Your typical “cloud” Unix server, designed in the 1970s to be a very social place, is today a ghost town with one or two factories still clanking in the town square—factories that receive our email, or accept our Instagram photos and store them, and manage our data. But there’s no one walking around and chatting downtown. So when people talk about “cloud computing” they are talking about millions of tiny ghost towns. Ironic, because what do people build on these ghost towns but social networks.

In the last decade, social networks like Twitter, LinkedIn, Facebook—even Google Plus—appeared to bring all those lonesome folks back together, at huge space. Not just a few dozen people on one computer but millions, even a billion people all sharing one giant meta-computer.

Many of those services make very heavy use of Unix under the hood. So: We collectively took a very social computing platform, papered over its social parts, and used it to build a social computing platform.

Purely for kicks, I decided to turn the social part back on and throw a nerd party.
aarontalk  eyebeam  network  aa:idx=interpretation  aa:id=brick  aa:post=brick  aa:year=2014  2014  aa:ima=post  ftrain 
14 days ago
[this is aaronland] links for interpretation roomba
in other news all the blog posts since 2006 should have their own dedicated “links.html” page now / for example
from twitter
14 days ago
« earlier      
/ 2012 2014 3d aa:id=brick aa:idx=index aa:idx=interpretation aa:idx=things aa:ima=link aa:ima=post aa:post=agency aa:post=airportcity aa:post=barcode aa:post=beta aa:post=brick aa:post=bwr aa:post=dconstruct aa:post=delmaps_02 aa:post=dogeared aa:post=donut-hole aa:post=dragons aa:post=enplacify aa:post=face aa:post=filtr03 aa:post=hills aa:post=historybox aa:post=mw09 aa:post=mw2010 aa:post=mw2011 aa:post=mw2012 aa:post=nearby aa:post=objects aa:post=otaku aa:post=question aa:post=spacemountain aa:post=things aa:post=turkishmmap aa:post=woelr aa:post=ws-decode aa:post=ws-modestmaps aa:post=youarehere aa:post=zomg aa:year=2005 aa:year=2006 aa:year=2007 aa:year=2008 aa:year=2009 aa:year=2010 aa:year=2011 aa:year=2012 aa:year=2013 aa:year=2014 aaronland aarontalk airplanes airports americaland android api appengine architecture archive arduino artisyourfriend aws barcode bonus bwr camera capacityplanning cats cli cloudcastles clustr colour communitiesofsuggestion community computervision cooperhewitt crouching crypto css database delicious design devel dotspotting drones dt:day=05 dt:day=07 dt:day=13 dt:day=19 dt:day=20 dt:day=21 dt:day=26 dt:day=28 dt:day=30 dt:month=01 dt:month=02 dt:month=03 dt:month=04 dt:month=05 dt:month=06 dt:month=07 dt:month=08 dt:month=09 dt:month=10 dt:month=11 dt:month=12 dt:year=2013 dt:year=2014 ec2 email etsy eyebeam facebook finance flickr food foursquare from:dogeared from:instapaper galleries geo git github golang google hardware hidden highlights history historyboxes http instagram ios iphone java javascript jquery keyboard language law london machinetags magicwords maps mobile montreal motive mozilla museum museums-and-the-network must namingthings network newaesthetic nodejs notifications nsa nyc ohyeahthat osm osx painting papernet perl photography php pinboard play prettymaps privacy publicspaces publictransportation pubsub python raspberrypi readme redacted redis roboteyes robots search security semweb sfba solr spacetime stamen stml surveillance sysadmin tagging twitter typos unfinishedhistory unicode unix video visualization voicefromabove websockets whosonfirst wifi wikipedia yahoo

Copy this bookmark: