¡î
Mitigating Scalability Walls of Container Networks
|
✎ RDMA-offloaded container networks (RCNs) provide
high-performance data communications among containers. But this
is subject to scale¡ªwhen there are millions of containers
running in a data center, the performance drops sharply. In
particular, we observe that most performance issues are related
to RDMA NICs (RNICs), whose defects might constitute the
¡°scalability wall¡± of the RCN. Hence, we conduct combinatorial
causal testing to efficiently reason about an RNIC's
architecture model, effectively approximate its performance
model, and proactively optimize the offloading schedule. We
embody the design into a practical system dubbed
ScalaCN, which can
increase the bandwidth by 1.4¡Á and decrease the latency by 31%.
♨
Paper in
NSDI'25 |
|
¡î
Mobile Cloud Gaming with Low Interactive Latency
|
✎ Via cloud-side computing & rendering, mobile cloud
gaming (MCG) should deliver high-quality gaming experience to
budget mobile devices. But our study on mainstream MCG platforms
reveals that even under good network conditions, all platforms
exhibit high interactive latency of 112-403 ms, from a
user-input action to its display response. To address this, we
conduct the first in-depth analysis on interactive latency, and
identify VSync (the synchronization primitive of Android
graphics pipeline) to be a key contributor to the excessive
interactive latency. Thus, we design a whole-loop regulator (LoopTailor) to
minimize VSync events, reducing the latency to
stably below 100 ms.
♨
Paper in
NSDI'25 |
|
¡î
Efficient Virtual System-on-Chip on Heterogeneous Hardware
|
✎
Emerging mobile apps (e.g.
UHD/AR/VR/metaverse) require accessing diverse high-throughput
hardware like video codecs, cameras, and image processors, but
today's mobile emulators exhibit poor performance when emulating
them. We pinpoint the major reason to be the discrepancy between
guest's and host's memory architectures -- mobile guest's
centralized memory on system-on-chip (SoC) vs. PC/server's
separated memory on individual hardware. It makes the shared
virtual memory (SVM) architecture of mobile emulators highly
inefficient. To address this, we design
vSoC that enables
virtual devices to efficiently share data through a unified SVM
framework, and implement it for Android and OpenHarmony.
It has
been adopted by the Huawei DevEco
Studio IDE with millions of developers.
♨
Paper in
SOSP'24 |
|
¡î
High-Performance
Mobile Emulation through Graphics Projection
|
✎
Mobile emulation is pivotal to the mobile ecosystem by
facilitating app debugging, PC-based mobile gaming, market-scale
malware detection, and so on. By adding a small-size graphics
projection space inside the guest memory, where the critical
control contexts and resource handles required by the host GPU
are strategically maintained and delivered, we build a
high-performance
Android emulator called
Trinity which exhibits excellent
efficiency, compatibility, and security at the same time for the
first time. It has
been adopted by the Huawei DevEco
Studio IDE with millions of developers.
♨
Papers in
OSDI'22
and
TOCS'24 |
|
¡î
A
Better Process Management Model for Android
|
✎ Android users constantly suffer from slow UI responsiveness
(SUR). We conduct the first large-scale measurement study on SUR
with 47M devices, identifying the root causes of SUR from the
perspectives of device, system, application, and app market. The
largest root cause lies in the wide existence of ¡°hogging¡± apps,
which persistently occupy an unreasonable amount of system
resources by leveraging the optimistic design of Android process
management. Hence, we remodel Android process states by fully
considering their time-sensitive transitions and the actual
behaviors of processes, reducing the occurrences of SUR by
60%, saving the battery consumption by 10.7%, and benefiting
hundreds of millions of MIUI
OS
users.
♨
Paper in
MobiCom'24 |
|
¡î
Nationwide
Study on
Cellular (Data Connection)
Reliability
|
✎ We make the first large-scale, in-depth study on
cellular reliability with over 70M Android phones across 34
different hardware models. We identify the critical factors that
affect cellular reliability and clear up misleading intuitions
indicated by common wisdom. More importantly, we have developed
enhancements that effectively address cellular reliability
issues with remarkable real-world impact -- our optimizations on
Android's cellular implementations have reduced 40% cellular
connection failures for 5G phones and 36% failure duration
across all phones, benefiting ~28M MIUI
OS
users.
♥
Best Student Paper Award
in
SIGCOMM'21 |
|
¡î
Why Does Android
Stop Responding?
|
✎ Android users often experience poor responsiveness, in
particular Application Not Responding (ANR) and System Not
Responding (SNR) that directly disrupt user experience. We make
an in-depth study of ANR/SNR at scale in the wild, uncover
multifold root causes of ANR/SNR, and pinpoint the largest
pathology that roots in Android's flawed implementation of Write
Amplification Mitigation (WAM). We design a practical approach
to eliminating this largest root cause, which has been adopted
by the MIUI
OS,
benefiting over 20 million users.
♨
Papers in
MobiCom'20
and
TMC'24 |
|
¡î
High-Fidelity
Mobile Emulation with Virtual Device Farms
|
✎ To understand the fidelity of virtual devices for mobile
app testing in industrial settings, we build a massive
commercial testing infrastructure that consists of a physical
device farm and its virtualized counterpart. We show that
high-fidelity app testing can be achieved by sensible design and
implementation of virtual device farms, and present effective
solutions to significantly improve testing fidelity. We also
share our experiences of using virtual device farms to
substantially improve the efficiency and accessibility of mobile
app testing, without compromising safety. The resulting
solutions have been adopted by
ByteDance and
Google Android Emulator.
♨
Paper in
MobiCom'23 |
|
¡î
Nationwide Study
on Mobile Access
Bandwidth
|
✎
By
working with a major mobile bandwidth testing app (UUSpeedTest)
to deeply analyze mobile access bandwidths of 3.54M end
users in China, we reveal the root causes of the critical
performance gaps between the advertised 4G/5G/WiFi access
bandwidth and what is actually delivered in the wild. Based on
the analysis insights, we build an ultra-fast, ultra-light
bandwidth testing service (Swiftest)
to dramatically reduce the test time (from 10 seconds to ~1
second) and backend cost.
♨
Paper
in
SIGCOMM'22 |
|
¡î
Fast and Light Bandwidth Testing for Internet Users
|
✎ Internet bandwidth testing services (BTSes) measure the access
bandwidth of end hosts, which are however oftentimes slow and
costly. To make BTS fast and cheap while maintaining high
accuracy, we develop
FastBTS through a novel statistical sampling framework
termed fuzzy rejection sampling. With only 30 test
servers, FastBTS achieves the same level of accuracy compared to
the state-of-the-art BTS (SpeedTest.net) that deploys 12,000+
servers. Most importantly, FastBTS makes bandwidth tests 5.6¡Á
faster and 10.7¡Á more data-efficient.
♨
Papers in
NSDI'21
and
TON'23 |
|
¡î
Xiaomi MVNO
|
✎ Mobile virtual network operators (MVNOs) operate on top of
existing cellular infrastructures of base carriers, while
offering cheaper or more flexible data plans. We present a
two-year measurement study towards understanding various key
aspects of today's MVNO ecosystem, based on
Xiaomi MVNO with 1M
customers. Our measurements clarify several key concerns raised
by MVNO customers. We also leverage big data analytics and
machine learning to optimize an MVNO's key businesses such as
data plan reselling and customer churn mitigation. ♨
Papers in
MobiSys'19
and
TON'20 |
|
¡î
Mobile Gaming on PCs with Direct
Android Emulation
|
✎ Efficiently emulating heavy 3D Android games on Windows PCs
has long been desired while highly challenging. This paper
presents DAOW, a commercial
game-oriented Android emulator that implements the idea of
direct Android emulation, which directly executes Android
app binaries on top of x86-based Windows PCs without employing
virtualization techniques. It has been widely adopted by 50M+ PC
users to run thousands of heavy 3D Android games, achieving
almost the same level of smoothness as being played natively on
Android smartphones.
♨
Paper and
Best Demo Award
in
MobiCom'19 |
|
¡î
APIChecker
|
✎ By examining apps' API invocations, we build APIChecker, an
machine learning-based mobile malware detection system. The key
to successfully developing such systems is manifold: 1) feature
selection, 2) feature engineering, 3) app analysis speed, 4)
developer engagement, and 5) model evolution. Failure in any of
the above aspects would lead to "wooden barrel effect" of the
entire system. APIChecker is adopted by
Tencent Android App Store
to efficiently check 10K newly submitted apps per day. ♨
Papers in
EuroSys'20
and TPDS'21 |
|
¡î
OverlayChecker
|
✎ As a key UI feature of Android, overlay enables one app to
draw over other apps by creating an extra View layer on top of
the host View. While greatly facilitating user interactions with
multiple apps at the same time, it is often exploited by
malicious apps (malware) to attack users. To combat this threat,
we build OverlayChecker to automatically detect overlay-based
malware at market scales. It is adopted by
Tencent Android App Store
to efficiently check 10K newly submitted apps per day. ♨
Papers in
MobiSys'19
and
TMC'22 |
|
¡î
HoneyCloud
|
✎ Linux-based IoT devices have emerged as one primary target of
today's cyber attacks. To understand realistic attacks on such
devices at scale in the wild, we deploy 4 hardware IoT honeypots
and 108 specially designed software IoT honeypots across eight
mainstream public clouds. The measurement study leads to
multi-fold insights towards actionable defense strategies that
can be adopted by IoT vendors and end users. ♨
Paper in
MobiSys'19 |
|
¡î
UFC2: User-Friendly
Collaborative Cloud
|
✎ Mainstream cloud storage services all support collaborative
file editing, which however often leads to poor experiences. We
unravel a number of their design and implementation issues as
the root causes. Thereby, we reconsider their collaboration
support from a novel perspective of operations without
using any locks. To enable this, we design intelligent
approaches to the inference and transformation of users' editing
operations, as well as optimizations to the maintenance of
files' historic versions. We build an open-source system
UFC2 to embody our design,
which can avoid most conflicts with little overhead. ♨
Papers in
FAST'20
and
TPDS'22 |
|
¡î Web-based Delta Sync for
Cloud Storage Services
|
✎ Striving towards a practical web-based delta synchronization
solution for cloud storage services (e.g., Dropbox). Leveraging
a series of state-of-the-art web techniques (JavaScript,
WebSocket, HTML5 File APIs, and asm.js) and cloud techniques
(virtual machine, object storage, and node.js), we have
developed an open-source prototype solution
WebR2sync+, and a
usable online system
THU-CloudDisk that has integrated our solution. ♨
Papers in
FAST'18 and
TOS'22 |
|
¡î
Nationwide
Census on WiFi Security Threats
|
✎ By deploying a crowdsourced WiFi security checking system on
14 million mobile devices, we characterize a wide variety of
real-world WiFi security threats at an unprecedented scale,
involving 19 million WiFi access points (APs) mostly located in
China. Surprisingly, we detect attacks at around 4% of all WiFi
APs, uncover that most WiFi attacks are driven by an underground
economy, and provide strong evidence of web analytics platforms
being the bottleneck of its monetization chain. Our system has
been adopted by WiFi.com.
♨
Paper in
MobiCom'21 |
|
¡î
FBS-Radar: Uncovering Fake Base
Stations at Scale in the Wild
|
✎ Yet another key component of
Baidu PhoneGuard,
FBS-Radar precisely identifies malicious fake base
stations (FBSes) and accurately geolocates FBSes based on
crowdsourced data of nearly 100 million users. It protects users
from millions of spam and fraud SMS messages per day, and has
helped the Ministry of Public Security of China arrest hundreds
of FBS operators. Also, we have made a
public website that shows the current locations of detected
FBSes in real time. ♨
Paper in
NDSS'17 |
|
¡î
Cross-App Cellular Traffic Optimization with Baidu TrafficGuard
|
✎ As a key component of
Baidu PhoneGuard,
Baidu TrafficGuard is a third-party mobile traffic proxy
widely deployed for Android 4.0+ devices in China. It
effectively reduces cellular traffic using a network-layer VPN
that connects a client-side proxy to a centralized traffic
processing cloud. Most importantly, it works transparently
across heterogeneous applications, so is not constrained
to any specific app. ♨
Paper in
NSDI'16 |
|
¡ï
Cloud Download (or says Offline Download)
|
✎
Achieving high-quality content distribution (especially for
unpopular videos) by using cloud utilities as a cache to guarantee the data
health and enhance the data transfer rate.
Please try the large-scale
Xuanfeng Cloud Download
system! ♨
Papers in
ACM-MM'11
and
JCST'15 |
|
¡ï
Cloud Tracker
(or says Open-P2SP
content distribution)
|
✎ Dynamically tracking and integrating various third-party servers,
contents and data transfer protocols all over the Internet into
a large, open and federated P2SP (peer-to-server/peer) platform,
so as to accelerate the content distribution from servers to
clients and among peer swarms. It also facilitates
load-balancing among the involved servers.
Please try the large-scale
Xunlei or
Xuanfeng
system! ♨
Papers in
ACM-MM'11
and
TPDS'13 |
|