前沿:这个个人Blog网站最初的定位是关注Industry,Research,Markdet三个主题。但是因为离职之后,身处学校,距离工业界和商业越来越远,blog大部分内容都是论文阅读。之前看到PingCAP在B站上的账号开始讲开源和商业,感觉可以拿过来完善自己的知识体系,顺带给网站增加一点Market相关的内容。

《关于规范金融业开源技术应用与发展的意见》最近解读

Read more »

前言:这个论文是在字节时候的隔壁组发的论文,他们主要是做一些创新型的DB的组件。团队的班底来自字节收购的TerarkDB。关注到论文主要是因为在知乎上看到一些“争论”。其中也提到了我老板的Hailstorm,算是对remote compaction的进一步延伸。加上我对TerarkDB和FaaS也有一些兴趣,就拿来看看。

TL:DR版

Read more »

前言:这个是我自己老板参与的论文,发表在ASPLOS 20上,之后可能要做的东西和这个思路有一些关联,有必要读下。存算分离是解决scalability的大趋势,在网络带宽不再是瓶颈的时代,提供了存储和计算分别扩缩容的能力。文章聚焦到LSM上,把compaction作为算的部分,做了remote compaction的设计。

TL;DR版

Read more »

This is a 'Paper Reading' post for Course ECE1759. The topic is 'Big Data'. This paper list is here:

  • Jeffrey Dean and Sanjay Ghemawat, MapReduce: Simplified Data Processing on Large Clusters, OSDI 2004
  • Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall and Werner Vogels, Dynamo: Amazon's Highly Available Key-value Store, SOSP 2007
  • CLP: Efficient and Scalable Search on Compressed Text Logs. Kirk Rodrigues, Yu Luo, Ding Yuan. In the Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI'21). July 14--16, 2021. Pages 183--198.

MapReduce: Simplified Data Processing on Large Clusters

Read more »

This is a 'Paper Reading' post for Course ECE1759. The topic is 'Bugs'. This paper list is here:

  • Dawson Engler, Benjamin Chelf, Andy Chou, and Seth Hallem, Checking System Rules Using System-Specific, Programmer-Written Compiler Extensions, In Proceedings of the 4th Symposium on Operating System Design & Implementation, October 2000, San Diego, California, pp. 1-16.
  • Stefan Savage, Michael Burrows, Greg Nelson, Patrick Sobalvarro, Thomas Anderson, Eraser: a dynamic data race detector for multithreaded programs, ACM Transactions on Computer Systems (TOCS), Nov. 1997, Pages 391-411
  • Zhenmin Li, Shan Lu, Suvda Myagmar, Yuanyuan Zhou, CP-Miner: a tool for finding copy-paste and related bugs in operating system code, Proceedings of the 6th conference on Symposium on Opearting Systems Design and Implementation (OSDI), 2004

Checking System Rules Using System-Specific, Programmer-Written Compiler Extensions

Read more »

This is a 'Paper Reading' post for Course ECE1759. The topic is 'Virtualization'. This paper list is here:

  • Edouard Bugnion, Scott Devine, and Mendel Rosenblum, Disco: Running Commodity Operating Systems on Scalable Multiprocessors, Proceedings of the Sixteenth ACM Symposium on Operating Systems Principles (SOSP), October 1997, Saint Malo, France.
  • Carl A. Waldspurger, Memory Resource Management in VMware ESX Server, In Proceedings of 5th Symposium on Operating Systems Design and Implementation (OSDI), Dec. 2002
  • Keith Adams and Ole Agesen, A Comparison of Software and Hardware Techniques for x86 Virtualization, In Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems, October 2006.
  • [Optional reading] P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R. Neugebauer, I. Pratt, and A. Warfield, Xen and the Art of Virtualization, In Proceedings of the 19th Symposium on Operating System Principles, October, 2003.

Disco: Running Commodity Operating Systems on Scalable Multiprocessors

Read more »

Introduction

A new design for highly concurrent Internet services designed to support massive concurrency demands and simplify the construction of well-conditioned services. It allows services to be well-conditioned to load, preventing resources from being overcommitted when demand exceeds service capacity.

Read more »

This is a 'Paper Reading' post for Course ECE1759. The topic is 'File System'. This paper list is here:

  • Marshall K. McKusick, William N. Joy, Samuel J. Leffler, and Robert S. Fabry, "A Fast File System for Unix," ACM Transactions on Computer Systems, 2(3), August 1984, pp. 181-197.
  • Mendel Rosenblum and John K. Ousterhout, "The Design and Implementation of a Log-Structured File System," Proceedings of the 13th ACM Symposium on Operating Systems Principles, December 1991.
  • Gregory R. Ganger, Marshall Kirk McKusick, Craig A.N. Soules, and Yale N. Patt. "Soft Updates: A Solution to the Metadata Update Problem in File Systems," ACM Transactions on Computer Systems, Vol. 18, No. 2, May 2000, Pages 127-153.

A Fast File System for Unix

Read more »

This is a 'Paper Reading' post for Course ECE1759. The topic is 'Distribution'. This paper list is here:

  • Leslie Lamport, Time, Clocks, and the Ordering of Events in a Distributed System, Communications of the ACM 21(7):558-565, July 1978.
  • D. R. Cheriton and W. Zwaenepoel, The Distributed V Kernel and its Performance for Diskless Workstations, Proceedings of the 9th Symposium on Operating Systems Principles, pp. 129-140, November 1983.
  • Cary G. Gray and David R. Cheriton, Leases: An Efficient Fault-Tolerant Mechanism for Distributed File Cache Consistency, Proceedings of the Twelfth ACM Symposium on Operating Systems Priciples (SOSP), December 1989, Litchfield Park, AZ, USA.

Time, Clocks, and the Ordering of Events in a Distributed System

Read more »