plaxx + distributed   75

Apache NiFi
open source data flow software suite created by the NSA
distributed  data  etl  nifi  software  apache 
january 2019 by plaxx
Upspin · Upspin
Upspin is an experimental project to build a framework for naming and sharing files and other data securely, uniformly, and globally: a global name system of sorts.
distributed  filesharing  sharing  google  golang  file  naming 
april 2017 by plaxx
Matrix is an open standard for interoperable, decentralised, real-time communication over IP. It can be used to power Instant Messaging, VoIP/WebRTC signalling, Internet of Things communication - or anywhere you need a standard HTTP API for publishing and subscribing to data whilst tracking the conversation history.
im  chat  communication  distributed  voip  internet  protocol 
march 2016 by plaxx
TensorFlow open source machine learning library
TensorFlow™ is an open source software library for numerical computation using data flow graphs. Nodes in the graph represent mathematical operations, while the graph edges represent the multidimensional data arrays (tensors) communicated between them.
data  graph  opensource  library  c++  python  machine-learning  ai  research  neural  processing  distributed  cpu  gpu 
december 2015 by plaxx
coreos/etcd · GitHub
etcd is a distributed, consistent key-value store for shared configuration and service discovery, with a focus on being:

Simple: curl'able user facing API (HTTP+JSON) Secure: optional SSL client cert authentication Fast: benchmarked 1000s of writes/s per instance Reliable: properly distributed using Raft
distributed  configuration  sysadmin  etcd  lock  high-availability  architecture  golang 
november 2015 by plaxx
Apache ZooKeeper - Home
ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. All of these kinds of services are used in some form or another by distributed applications.
configuration  deployment  distributed  zookeeper  java  opensource  lock  high-availability 
november 2015 by plaxx
Presto | Distributed SQL Query Engine for Big Data
Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes.
database  sql  big-data  hadoop  presto  distributed  data  analytics  db 
december 2014 by plaxx
CoreOS is Linux for Massive Server Deployments
CoreOS enables warehouse-scale computing on top of a minimal, modern operating system.
linux  distro  cloud  deployment  devops  coreos  server  sysadmin  distributed 
october 2014 by plaxx
Celery: Distributed Task Queue
Celery is an asynchronous task queue/job queue based on distributed message passing. It is focused on real-time operation, but supports scheduling as well. The execution units, called tasks, are executed concurrently on a single or more worker servers using multiprocessing, Eventlet, or gevent. Tasks can execute asynchronously (in the background) or synchronously (wait until ready).
distributed  python  task  queue  multi-process  daemon  worker  scalability 
september 2014 by plaxx
hydra - distributed data processing and storage system
It ingests streams of data (think log files) and builds trees that are aggregates, summaries, or transformations of the data. These trees can be used by humans to explore (tiny queries), as part of a machine learning pipeline (big queries), or to support live consoles on websites (lots of queries).
distributed  cluster  streaming  data  processing  framework  big-data 
april 2014 by plaxx
Welcome to Apache Flume — Apache Flume
Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that allows for online analytic application.
apache  logging  sysadmin  distributed  search 
february 2014 by plaxx
Serf is a service discovery and orchestration tool that is decentralized, highly available, and fault tolerant. Serf runs on every major platform: Linux, Mac OS X, and Windows. It is extremely lightweight: it uses 5 to 10 MB of resident memory and primarily communicates using infrequent UDP messages.
discovery  service  sysadmin  configuration  cloud  scalability  high-availability  distributed 
january 2014 by plaxx
gearman [Gearman Job Server]
Gearman provides a generic application framework to farm out work to other machines or processes that are better suited to do the work. It allows you to do work in parallel, to load balance processing, and to call functions between languages.
distributed  gearman  scalability  cluster  performance  queue  server  high-availability 
november 2013 by plaxx

related tags

ai  algorithm  analysis  analytics  apache  architecture  article  backup  big-data  bittorrent  bugs  bugtracking  bzr  c++  caching  cep  cfengine  chat  citusdb  cli  cloud  cluster  clustering  collaboration  communication  computation  computer-science  computing  configuration  corba  coreos  cowrie  cpu  cracking  crypto  cryptography  daemon  data  database  datacenter  dataformat  db  dbus  decentralized  deploy  deployment  desktop  dev  development  devops  discovery  disk  distributed  distro  drbd  dvcs  dynomite  eclipse  egit  elk  embedded  encryption  engine  erlang  etcd  etl  exchange  extension  fedora  file  filesharing  filesystem  format  framework  func  functional  gearman  git  golang  google  gpfs  gpl  gpu  graph  grid  gui  ha  hadoop  high-availability  honeypot  hosting  how-to  http  im  infrastructure  init  internet  interview  ipython  issue-tracking  java  job  jvm  key-value  keynote  kibana  language  library  linux  lock  logging  logstash  lustre  machine-learning  management  math  memory  message  messaging  monitoring  monotone  mpi  mtn  multi-platform  multi-process  naming  netflix  neural  nifi  nosql  opensource  os  p2p  packages  Packaging  parallel  parity  password  performance  persistence  plan9  plugin  poc  postgres  postgresql  presentations  presto  processing  programming  protocol  proxy  pub-sub  python  QA  queue  raft  raid  realtime  redhat  reference  replication  research  rest  rocks  ruby  rust  sas  scala  scalability  scalable  science  scm  search  security  serialization  server  service  shard  sharing  shell  soa  software  sql  sqlite  ssh  storage  stream  streaming  sun  sysadmin  systemd  task  technology  testing  tool  tracker  tutorial  unicode  vcs  version-control  virtualization  voip  vpn  web  webdev  webservices  windows  worker  workload  world  wrapper  xml  zookeeper 

Copy this bookmark: