Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory ...
Abstract: This work addresses an energy-minimized deadline-constrained task scheduling problem in human-cyber-physical systems. It consists of three subproblems: processor allocation, task sequencing, ...
Abstract: This paper presents a novel neural network-based optimization framework, NNDE, to solve the traveling salesman problem (TSP). The core idea is to use a radial basis function network (RBFN) ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results