My name is Wei Zhang.
I am a Ph.D student in the Department of Computer Science
at Texas Tech University.
My research interest broadly lies in:
High Performance Computing
Particularly focusing on:
Graph Data Management
Metadata Indexing Technology
Please contact me by Email: X-Spirit [dot] zhang [at] ttu.edu
Find Me on Linked-In :
Find Me on GitHub :
Aug/2014 - Present
PhD Program in Computer Science Texas Tech University, United States
Sep/2003 - Jul/2007
Bachelor Degree in Computer Science Hebei University of Science and Technology, China
Thesis: “Feed-Based Online Socializing”
W.Zhang, S.Byna, H.Tang, B. Williams, Y.Chen. MIQS: Metadata Indexing and Querying Service for Self-describing File Formats. Accepted to appear in The Proceedings of The 31st ACM/IEEE Supercomputing Conference (SC’19), Denver, CO, 2019. (first-around acceptance rate: 72/344=21%, another 15 papers being asked for major revisions per SC’19)
W.Zhang, H.J. Tang, S. Byna, Y. Chen. DART: Distributed Adaptive Radix Tree for Efficient Affix-based Keyword Search on HPC Systems. In the Proceedings of The 27th International Conference on Parallel Architectures and Compilation Techniques (PACT '18), 2018. (acceptance rate: 36/126=28.6%)[Paper][Slides][Teaser Video][BibTex]
W. Zhang, Y. Chen and D. Dai. AKIN: A Streaming Graph Partitioning Algorithm for Distributed Graph Storage Systems. Accepted to appear in The Proceedings of The 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), 2018. (acceptance rate: 20.8%) [Paper] [Slides][Presentation] [Poster][BibTex]
D.Dai, W.Zhang, Y.Chen. IOGP: An Incremental Online Graph Partitioning for Large-Scale Distributed Graph Databases, In the Proc. of The 26th ACM International Symposium on High Performance Parallel and Distributed Computing (HPDC'17), 2017. (acceptance rate: 19%)
D.Dai, W.Zhang, Y.Chen. IOGP: An Incremental Online Graph Partitioning for Large-Scale Distributed Graph Databases[POSTER], PPoPP 2017
D.Dai, Y.Chen, P.Carns, J.Jenkins, W.Zhang, R.Ross. GraphMeta: Managing HPC Rich Metadata in Graphs, in Proceedings of the IEEE International Conference on Cluster Computing, (Cluster'16), 2016. (acceptance rate: 39/162=24.1%)
W.Zhang, Y.Chen. Analysis on Graph Partitioning for Distributed Graph-centric Computing [PDF]
May/2017 - Aug/2017
Research Intern at Scientific Data Management Group, Lawrence Berkeley National Laboratory
Investigated scientific data management solutions in HPC environment
Investigated data indexing technologies for improving metadata search efficiency
Baseline evaluation on existing metadata management platform.
Proposed new system architecture for indexing metadata in object-centric storage
Proposed new distributed indexing technology for prefix search and suffix search
Implemented the proposed metadata indexing and conducted the evaluation.
Aug/2016 - May/2017
Teaching Assistant at Computer Science Department, Texas Tech University
Lab Instructor and Course Grader
Data Structure Lab: Conducted Lab sessions for the students.
Grader and Q&A session host for the following Courses:
Object Oriented Programming
Advanced Operating System
Computer Organization and Assembly Language
Research Assistant at STARLab, Texas Tech University
Geo-spatial data mining and analysis on social network.
Data mining platform on top of Spark, HBase and Hadoop.
Optimization were performed, especially data compression on the entire technical stack.
Geo-spatial visualization on social media user distribution.
Implemented both NodeJS version and Python version of the data processing and visualization platform, along with Redis serving as the data storage.
Twitter sentiment analysis
Implemented with both Naive Bayes model and Stanford CoreNLP package.
Geo-spatial demographic and political information analysis based on social media users.
Implemented residential location recognition using unsupervised clustering technique, conducted demographic information analysis on the basis of residential location analysis and Naive Bayes analysis on the statistical data of name, age and gender. Also conducted political preference analysis using the result of twitter sentimental analysis on the topic of 2012 U.S. presidential election.
Jan/2016 - Dec/2016
Jan/2015 - May/2015
Student Assistant at DISCL, Texas Tech University
Implementation improvement on research project for better code quality as well as evaluation and experiments.
GraphFS: Project re-factoring on the implementation of GraphFS, makes it more mod- ularized.
Control Experiments on Meshwork(In comparison with GraphMeta implementation).)
Control Experiments on Titan(In comparison with GraphMeta), the related code implementation and performance tuning.
Feb/2014 - Jul/2014
Senior System R&D Engineer at Beijing Serious Technology Co., Ltd , China
Designed and developed server-side application which provides distributed data service with great performance, high availability, flexible scalability for Enjoy!.
PCVF: Parameter constraining and validating framework for RESTful Web Service APIs.(Secretive project.)
DevOps practice involving Maven, Jenkins, Unit Testing and customized document generator for RESTful Web Service APIs(Compatible with PCVF.)
BrookSide: Message processing framework for AMQP (specifically RabbitMQ)
Meshwork: Graph-like data access API for both MySQL and Redis
Commons: Several utilities including redis access API providing HA capability for spring-data-redis
Webshot-rest-amqp-service: A NodeJS project for capturing the snapshot for any specified website according to the message received from amqp implementation like rabbitmq.
2010 - 2013
System R&D Engineer at Sina.com Technology (China) Co.,Ltd.
Design and development of several critical server-side applications providing distributed data services for open platform of weibo.com (the Chinese version of twitter), including URL shorten service T.cn and user & profile data service for Weibo.com.
REST API optimization for boosting the friendliness of Weibo REST API, several fruitful work was done, including the development of BDD Testing Tool, specification for Weibo Open API documentation, specification for Weibo Open API implementation.
T.cn: A url shorten service and its related url-hits counting program.
In charge of user data service for Weibo Open API which is the critical data access path of almost every single REST API for weibo.com, which requires high performance and high availability as well as the flexibility to change in terms of the functionality.
User service v2.0 for Weibo Open API. In charge of the data migration, service migration as well as the development of the critical distributed data service and message processing system.
Cache service optimization for user service v2.0. Reduce the total Memcache resource usage based on the intensive analysis to the system cache usage.
Visualized service monitoring system from which one can easily watch the running status of the user service in terms of the cache hit ratio, throughput of MySQL instances as well as user-related critical services such as the Relationship service and the Feed service.
Senior Software Developer at Beijing JustMusic Co.,Ltd., China
Designed and developed business data management system for JustMusic! Co.,Ltd.
Development of business data management system.
Simple batch processing framework.
Software Developer at Beijing Datuu.com Technology Co.,Ltd., China
Developed data service for business management system.
Development of operation management system, including routine feature development and data maintenance as well as generating business report.
Java, NodeJS, Scala, Python, C/C++, RESTful WebService, Bash, .Net, Ruby.
AWS experience, Docker
MySQL, Oracle, SQL Server, NoSQL Databases: Memcache, Redis, Neo4J, HBase
Linux, Unix, Windows
Design Patterns, UML, Continuous Integration.