cs   31100

« earlier    

Explore - LeetCode
Curated topic based curricula for CS.
education  cs  free  leetcode 
14 hours ago by nebkor
Deduplicating files in Public Git Archive · source{d} blog
This summer, we announced the release of Public Git Archive, a dataset with 3TB of Git data from the most starred repositories on GitHub. Now it’s time to tell how we tried to deduplicate files in the latest revision of the repositories in PGA using our research project for code deduplication, src-d/apollo. Before diving deep, let’s quickly see why we created it. To the best of our knowledge, the only efforts to detect code clones at massive scale have been made by Lopes et. al., who leveraged a huge corpus of over 428 million files in 4 languages to map code clones on GitHub (DéjàVu project). They relied on syntactic features, i.e. identifiers (my_list, your_list, …) and literals (if, for, …), to compute the similarity between a pair of files. PGA has fewer files in the latest (HEAD) revision - 54 million, and we did not want to give our readers a DéjàVu by repeating the same analysis. So we aimed at something different: not only copy-paste between files, but also involuntary rewrites of the same abstractions. Thus we extracted and used semantic features from Universal Abstract Syntax Trees.
cs  git  github  source  dedupe 
2 days ago by euler
Functional Programming Principles in Scala | Coursera
Functional Programming Principles in Scala from École Polytechnique Fédérale de Lausanne. Functional programming is becoming increasingly widespread in industry. This trend is driven by the adoption of Scala as the main programming language for ...
cs  course  coursera  mooc 
2 days ago by yoci642
Decision Tables • Hillel Wayne
A decision table is a means of concisely representing branching and conditional computations. In the most basic form, you have some columns that represent the “inputs” as booleans and some columns that represent outputs and effects. It looks like this:
programming  software-engineering  cs 
3 days ago by hellsten

« earlier    

related tags

3d  ai  algorithms  automated  badscience  bigdata  blog  blogpost  blogs  book  books  btree  buffer-overflow  buy  c-lang  c  cache  careers  cheatsheet  class  code  codefixer  codesniffer  college  communication  compiler-internals  compilers  compsci  computer  computergraphics  computers  computerscience  computing  couchdb  course  coursera  cpp  cpu  ct  data  database  databases  dataflow  datalog  datastructures  debugging  dedupe  deep-wizardry  distributedsystems  ds  edu  education  error  errorcorrection  errorhandling  fec  flash  fp  free  git  github  graph  graphics  guides-tutorials-courses  hacking  hackme  hettinger  history  how-to  http  ideje  ifttt  import  inspiration  internet  interviews  izzivi  javascript  kernel  lang  latency  learn  leetcode  linux  lisp-lang  lisp  list  lists  map-files  masters  math  maths  mba  mccarthy  memory  metrics  micro:bit  mišljenje  ml  monad  mooc  naloge  nand  ocap  operating  operatingsystems  optimization  os-internals  os  papers  php  phpcodefixer  phpcodesniffer  pl-experimental  pl  pocket  programiranje  programming  projekti  prolog  protocol  psr  psr1  psr2  pycon  python  računalniško  računalništvo  re  rendering  research  reverse_engineering  rust  setup  simulation  sketch  software-engineering  source  stack  standards  statements  statistics  stopwatch  structures  sw-illustrated  symbols  systems  talk  textbook  theory  tool  toread  tutorial  ui  use  usestatements  vid  video  winapi  wince  windows 

Copy this bookmark:



description:


tags: