3581
The Case for Learned Index Structures – Arxiv Vanity
https://arxiv.org/pdf/1712.01208v1.pdf

Indexes are models: a B-Tree-Index can be seen as a model to map a key to the position of a record within a sorted array, a Hash-Index as a model to map a key to a position of a record within an unsorted array, and a BitMap-Index as a model to indicate if a data record exists or not. In this exploratory research paper, we start from this premise and posit that all existing index structures can be replaced with other types of models, including deep-learning models, which we term learned indexes. The key idea is that a model can learn the sort order or structure of lookup keys and use this signal to effectively predict the position or existence of records. We theoretically analyze under which conditions learned indexes outperform traditional index structures and describe the main challenges in designing learned index structures. Our initial results show, that by using neural nets we are able to outperform cache-optimized B-Trees by up to
70
%
in speed while saving an order-of-magnitude in memory over several real-world data sets. More importantly though, we believe that the idea of replacing core components of a data management system through learned models has far reaching implications for future systems designs and that this work just provides a glimpse of what might be possible.
#technology  #database  #machine.learning 
2 days ago
Twitter
♫ My Top 5 artists: Queens of the Stone Age (17), The National (13), Margo Price (10), The Flatliners (6) &…
lastfm  from twitter
5 days ago
Twitter
"When scientists say bears are going extinct, I want people to realize what it looks like," says photographer Paul…
from twitter_favs
7 days ago
Miguel Grinberg's Tutorials - The Flask Mega-Tutorial
The Flask Mega-Tutorial is an overarching tutorial for Python beginner and intemediate developers that teaches web development with the Flask framework. The tutorial has been thoroughly revised and expanded in 2017, now containing 23 chapters. The concepts that are covered go well beyond Flask, including a wide range of topics Python web developers need to know when writing their own applications. You can preview this tutorial on Miguel Grinberg's blog.

Order Ebook + Video $60

Order Ebook $15
#technology  #python  #tutorial 
7 days ago
Counting Page Views in PostgreSQL < SQL | The Art of Web
This is an example of a seemingly simple task that becomes more complicated as if fails to adapt to increases in database size and traffic.
#technology  #data  #database  #postgresql  #public.good 
9 days ago
Faster PostgreSQL Counting
Everybody counts, but not always quickly. This article is a close look into how PostgreSQL optimizes counting. If you know the tricks there are ways to count rows orders of magnitude faster than you do already.

The problem is actually underdescribed – there are several variations of counting, each with its own methods. First think whether you need an exact count or whether an estimate suffices. Next, are you counting duplicates or just distinct values? Finally do you want a lump count of an entire table or will you want to count only those rows matching extra criteria?

We’ll analyze the techniques available for each situation and compare their speed and resource consumption. After learning about techniques for a single database we’ll use Citus to demonstrate how to parallelize counts in a distributed database.
#technology  #postgresql  #optimization  #data  #database  #public.good 
9 days ago
Import.io | Extract data from the web
Web scraping service, data extraction, etc.
#technology  #api 
15 days ago
gopass - the team password manager
gopass is a simple but powerful password manager for your terminal.
#technology  #security  #golang 
16 days ago
Rick Rubin on Crashing Kanye’s Album in 15 Days
wisdom

There’s a tremendous power in using the least amount of information to get a point across.
#music  #philosophy 
16 days ago
Twitter
RT : No community is immune.

Support the solution: .

GivingTuesday2017  from twitter
18 days ago
Twitter
RT : Did you know that our upcoming show The Ears Have Walls spans two galleries? Join us for the Vernissage at
from twitter
18 days ago
Twitter
Really good thread with lots of specific suggestions on how the NYT's "wow Nazis are just like us!" piece could hav…
from twitter_favs
19 days ago
She helps connect people with causes at Public Good - Chicago Tribune
Public Good is in the business of making it easier for people to do good through technology. Our co-founders were leading technologists at the Obama campaign in 2012, and prior to that, they had experience in the world of e-commerce.
#technology  #public.good 
24 days ago
Twitter
It is worth reflecting on just how wildly unsupported by the public and wrong the FCC is on its effort to end an Op…
from twitter_favs
25 days ago
Twitter
RT : Engaging discussion on Data Science for Good panel moderated by w/ Frances Zlotnick from Ha…
from twitter_favs
26 days ago
Twitter
"Portland is a city that pushes black people out of neighborhoods and replaces them with 'Black Lives Matter' signs…
from twitter_favs
4 weeks ago
Public Benefit Corporation Practice Note
This Practice Note provides an overview of
public benefit corporations organized in
Delaware. It highlights the special provisions
that apply to this type of corporation, including
how public benefit is defined, duties of directors,
notice and reporting requirements, and the
conversion process
#entrepreneurship  #startup  #public.good 
4 weeks ago
ferd.ca -> Simhashing (hopefully) made simple
When it comes to figuring out how similar various pieces of data are from one another (and which is the closest matching one in a large group of candidates), simhashing is one of my favourite algorithms. It's somewhat simple, brilliant in its approach, but still not obvious enough for most people (myself included) to come up with it on their own. Readers may be familiar with hashing algorithms such as MD5 or SHA, which aim to very quickly create a unique signature (hash) of the data. These functions are built so that identical files or blobs of data share the same hash, so you can rapidly see whether two blobs are identical or not, or if a blob still has the same signature after transmission to see if it was corrupted or not. Then different blobs, even if mostly the same, get an entirely different signature.
While simhashes still aim to have unique signatures for documents, they also attempt to make sure that documents that look the same get very similar hashes. That way, you can look for similar hashes to figure out if the documents are closely related, without needing to compare them bit by bit. It's a statistical tool to help us find near-duplicates faster.
#technology  #algorithms  #public.good  #simhash 
5 weeks ago
The Boomtown That Shouldn’t Exist - POLITICO Magazine
Cape Coral, Florida, was built on total lies. One big storm could wipe it off the map. Oh, and it’s also the fastest-growing city in the United States.
#technology  #real.estate 
6 weeks ago
[1710.08522] Automating, Operationalizing and Productizing Journalistic Article Analysis
Public Good Software's products match journalistic articles and other narrative content to relevant charitable causes and nonprofit organizations so that readers can take action on the issues raised by the articles' publishers. Previously an expensive and labor-intensive process, application of machine learning and other automated textual analyses now allow us to scale this matching process to the volume of content produced daily by multiple large national media outlets. This paper describes the development of a layered system of tactics working across a general news model that minimizes the need for human curation while maintaining the particular focus of concern for each individual publication. We present a number of general strategies for categorizing heterogenous texts, and suggest editorial and operational tactics for publishers to make their publications and individual content items more efficiently analyzed by automated systems.
#technology  #public.good  #machine.learning 
6 weeks ago
Building High-level Features Using Large Scale Unsupervised Learning
We consider the problem of building highlevel,
class-specific feature detectors from
only unlabeled data. For example, is it possible
to learn a face detector using only unlabeled
images? To answer this, we train a 9-
layered locally connected sparse autoencoder
with pooling and local contrast normalization
on a large dataset of images (the model has
1 billion connections, the dataset has 10 million
200x200 pixel images downloaded from
the Internet). We train this network using
model parallelism and asynchronous SGD on
a cluster with 1,000 machines (16,000 cores)
for three days. Contrary to what appears to
be a widely-held intuition, our experimental
results reveal that it is possible to train a face
detector without having to label images as
containing a face or not. Control experiments
show that this feature detector is robust not
only to translation but also to scaling and
out-of-plane rotation. We also find that the
same network is sensitive to other high-level
concepts such as cat faces and human bodies.
Starting with these learned features, we
trained our network to obtain 15.8% accuracy
in recognizing 22,000 object categories
from ImageNet, a leap of 70% relative improvement
over the previous state-of-the-art.
#technology  #machine.learning 
6 weeks ago
HTTP Status Codes
HTTP Status Codes
This page is created from HTTP status code information found at ietf.org and Wikipedia. Click on the category heading or the status code link to read more.
#technology  #http  #rest  #api 
11 weeks ago
Documenting Hate News Index | ProPublica
This page lists media reports, collected by Google News, about hate crimes and bias incidents. The “keywords” column contains names and places found in the news reports in the “articles” column. The larger the word, the more prevalent it is in those stories. Select one of the words to see a list of news stories that contain it. Download the data.
#technology  #data  #analytics  #machine.learning  #public.good 
12 weeks ago
45648.pdf
Using Native Plants in the Garden Flyer
#landscaping 
12 weeks ago
books&webIL.pdf
Illinois Native Landscaping Book Recommendations
#landscaping 
12 weeks ago
Common Crawl
We build and maintain an open repository of web crawl data that can be accessed and analyzed by anyone.
#technology  #data  #data.science  #analytics 
12 weeks ago
Twitter
Study shows mainstream press condemns Nazis and anti-racist activists at comparable rates
from twitter_favs
12 weeks ago
Wild Stonecrop (Sedum ternatum)
Comments: Both the flowers and the foliage are quite attractive; this native species should be grown in flower gardens more often. Wild Stonecrop is the only Sedum sp. that is native to central and northern Illinois; other Sedum spp. that have naturalized in this portion of the state have been introduced from abroad for horticultural purposes.
#landscaping 
september 2017
NestWatch
Whether you’re buying or building, this infographic will take you on a tour of a state of the art birdhouse.
#landscaping 
august 2017
Gandalf Decision Engine · Apiary
Gandalf is a Open-Source Decision Engine for Big-Data.
#technology  #database  #public.good 
august 2017
Why Product Market Fit Isn't Enough — Brian Balfour's Coelevate
One of my main observations is that there are certain companies where growth seems to come easily, like guiding a boulder down hill. These companies grow despite having organizational chaos, not executing the “best” growth practices, and missing low hanging fruit. I refer to these companies as Smooth Sailers - a little effort for lots of speed.

In other companies, growth feels much harder. It feels like pushing a boulder up hill. Despite executing the best growth practices, picking the low hanging fruit, and having a great team, they struggle to grow. I refer to these companies as Tugboats - a lot of effort for little speed.

What is the difference between these two types of companies? This is a question I’ve pondered for a long time and have pieced together a framework to explain the difference. The framework has many implications for how you seek out growth and build a company. 
#technology  #entrepreneurship  #startup  #business 
august 2017
A Few Useful Things to Know About Machine Learning
Machine learning algorithms can figure out how to perform
important tasks by generalizing from examples. This is often
feasible and cost-effective where manual programming
is not. As more data becomes available, more ambitious
problems can be tackled. As a result, machine learning is
widely used in computer science and other fields. However,
developing successful machine learning applications requires
a substantial amount of “black art” that is hard to find in
textbooks. This article summarizes twelve key lessons that
machine learning researchers and practitioners have learned.
These include pitfalls to avoid, important issues to focus on,
and answers to common questions
#technology  #data.science  #machine.learning  #public.good 
july 2017
Deep Learning Project
An end to end implementation of a Machine Learning pipeline
#technology  #data.science  #machine.learning  #public.good 
july 2017
Twitter
Guess what, folks? I've seen hackathons that're only 8 hours per day & people go home & sleep

If yours isn't like…
from twitter_favs
july 2017
KPI dashboard software for businesses | Geckoboard
LIVE TV DASHBOARD SOFTWARE FOR BUSINESSES
Geckoboard helps everyone in your team focus on work that improves the metrics that matter to your business.
#technology  #metrics  #visualization  #management 
july 2017
AP computer science doubles, led by female, black, Latino students
"AP computer science doubles, led by female, black, Latino students..."
from twitter_favs
july 2017
(429) https://twitter.com/voxdotcom/status/884430174703558656/video/1
I am completely convinced by this argument to move major government agencies from DC to the Midwest:
from twitter_favs
july 2017
Norway Maple (Acer platanoides)
This is the tree on the parkway in our front yard.
#landscaping  #botany 
july 2017
White Oak (Quercus alba)
The native White Oak is a common tree that is found in every county of Illinois (see Distribution Map). It is the state tree of Illinois.
#landscaping  #botany 
july 2017
Common Chicory (Cichorium intybus)
This plant originated from Eurasia, and was present in Illinois by the late 19th century. It has been spreading steadily ever since and has become common, particularly in disturbed open areas. Typical habitats include pastures, abandoned fields, areas along roadsides and railroads, grassy areas that are not mowed regularly
#landscaping  #botany 
july 2017
Ravens ignore a treat in favor of a useful tool for the future | Ars Technica
RT : Corvids are so amazing! Ravens choose a tool over a treat, demonstrating ability to plan for the future
from twitter_favs
july 2017
Twitter
RT : We're so happy to see the Kickstarter for the VGA Gallery running strong. If you value games and art as culture, su…
from twitter
july 2017
The Rise of the Thought Leader | New Republic
But intellectual intervention alone will never be enough. The same conditions that gave us the Ideas Industry overwhelmingly favor concentrated economic and political power. Even as we cast a critical light on the connections between one percenters and thought leaders, we must organize in the physical and social world where the “ideas” of the economic elite have their most pernicious effects. The new energy behind the unionization of the academy and the media are an excellent start, but only a start. What intellectuals need is the same as what everyone else needs: a society that prioritizes human flourishing over private profit, and strong political networks that guard public goods against the prophets of an atomized, high-tech future. However difficult that society may be to achieve, one thing about the present gives hope. We are finally getting clear about who its enemies are.
#politics 
july 2017
Friends IRL
We're all experts at something.
CHICAGO WEBFRIENDS is a monthly gathering of designers, developers, and friends from all over the city of Chicago. Join us each month as a member of our group shares the insights, knowledge, and skills they're passionate about!
#technology  AList  #networking 
july 2017
Twitter
RT : The 's Ryan Smith gets the scoop on VGA's new home, some damn fine reading right there
from twitter_favs
july 2017
Twitter
RT : Pls RT! I'm starting a weekly mailing list with diversity & inclusion tips for tech companies employees and open so…
from twitter
july 2017
Twitter
RT : so y'all are just gonna skip the whole slavery and native genocide part
from twitter_favs
july 2017
Give Me Urban Prairie, Give Me My Home | Center for Humans & Nature
We tend to think wildness doesn’t exist in cities, that this incredibly altered environment for the purpose and pleasure of one species is devoid of ecology. And yet, we know this is not true when we see weeds in cracks or slam on brakes for birds crossing the road. Earlier this year a prairie chicken was found in the entertainment district of my city, stuck in a storm drain, then caught and released into a nearby tallgrass prairie remnant. There’s irony in this act of compassion—moving a bird from her former home to an echo of her home that likely also can’t sustain her.
#landscaping  #nature 
june 2017
How to Stop Worrying and Start Loving Clay Soil
There are a few basic soil types: loam, silt, sand and clay, with variations between. We’ve been taught to want a loam or silt-loam soil, with its high organic matter and nice drainage, but it’s OK if you don’t already have that magical concoction. I’m here to tell you that you can still have the flower garden of your dreams in clay without having to buy amendments by the truckload or till it all in — disrupting soil structure and life as you till.
#landscaping  #kenilworth 
june 2017
Twitter
After yesterday's tweets, I put this in my front yard today.
from twitter_favs
june 2017
A story of a woman in the tech industry – Karolina Szczur – Medium
RT : I’ve been in tech for 12 years—here’s my story. Warning: you won’t like what reality women face in the industry.
from twitter
june 2017
Untitled (https://www.wired.com/2015/02/air-traffic-control/)
On a small scale, sure. On a large scale and dealing with regulations and bureaucracy, you get this:
from twitter
june 2017
Twitter
Real-world analogies are hard, but depends on the complexity and size of the system. Similarly you c…
from twitter
june 2017
Twitter
Sounds like software. Success --> More devs and features --> increased times to maintain an…
from twitter
june 2017
« earlier      
#5-15 #agile #algorithms #analytics #android #angular #apache #apartment #api #architecture #art #artificial.intelligence #auth #automation #aws #backups #bdd #benchmarking #bi #bitcoin #blog #books #botany #business #business.intelligence #caching #calendar #career #cartography #chat #cheatsheets #chicago #child #cli #clojure #cloud #cocktails #coding #collaboration #communication #community #compensation #computer.science #conference #consulting #containerization #continous.delivery #contracting #coroutines #creativity #css #cto #culture #data #data.science #database #datacenter #ddd #debug #deploy #design #development #devops #diversity #diy #documentation #domain.driven.design #economics #editor #education #email #encryption #engineering #entrepreneurship #equity #estimation #etl #feedback #finance #financial #fonts #frameworks #freelance #functional.programming #furniture #gardening #geochat #geolocation #git #gitflow #github #go #golang #google #graphdb #graphics #grunt #hack #hacking #hardware #health #hiring #history #hosting #html #html5 #htpc #http #human.resources #humor #icons #ide #identity #infosec #infrastructure #innovation #interviewing #investing #ios #irc #javascript #jelly #jenkins #jobs #journalism #jruby #kenilworth #landscaping #languages #leadership #learning #legal #linux #live #mac #machine.learning #management #markdown #marketing #messaging #metaprogramming #metrics #micro.services #misty #mobile #monitoring #morality #music #mvc #mysql #networking #nlp #node #oauth #opensource #operations #optimization #organization #payment #payments #performance #philosophy #phone #photography #php #politics #postgresql #power #presentation #privacy #product #productivity #profiles #profiling #programming #project.management #psychology #public.good #puppet #python #qa #query.builder #quotes #r #rails #real.estate #realty #recipes #recruiting #recycling #refactoring #registrar #release #reliability #remarx #remote.work #rental #resources #rest #reviews #rss #ruby #ruby.decorators #safety #salary #sales #saml #scala #scalability #search #security #serialization #serverless #sharing #shell #shopping #smoking #soa #social #software #software.architecture #software.engineering #software.process #solr #ssl #startup #statistics #stock #storage #streetart #sysadmin #tattoo #taxes #tdd #team.building #technology #telecommuting #tent #terminal #testing #time.management #todo #torrent #training #travel #tutorial #tv #twitter #ubuntu #ui #ux #vc #venture.capital #vim #virtualization #visualization #weather #web #web.programming #web.serving #wellspring #wireframe #writing #xkcd #xmarks aeonfilm alist altcurrency android aqua aws bloodmoon chicago chicagogram chicagoig chicagoshots chitecture city clojure coolarchitecture copyright css3 dddesign design devops eyeo2014 fancydrinkfriday fog fullmoon gifting givingtuesday2017 golang guru hushmail indiegogo infinitejukebox innovation ios iphone jinteki lakemichigan lastfm local lucasmuseum lulsec4realz movember moxieaward netrunner neveragain openinnovation php prodmgmt python rest ruby rubyconf scna scotus sloop sloopin squeezebox tabletennis trending tweeklyfm twitter uncubed ux weather webgl zeromq

Copy this bookmark:



description:


tags: