Author name cluster

Adrian Weller

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

91 papers

2 author rows

JBHI Journal 2025 Journal Article

Ankle Kinematics Estimation Using Artificial Neural Network and Multimodal IMU Data

Lefan Wang
Pingfan Song
Thomas Stone
Adrian Weller
Sebastian W. Pattinson

Inertial measurement units (IMUs) have become attractive for monitoring joint kinematics due to their portability and versatility. However, their limited accuracy, inability to analyze data in real-time, and complex data fusion algorithms requiring precise sensor-to-segment calibrations hinder their clinical and daily use. This paper introduces KEEN (KinEmatics Estimation Network), an innovative framework that exploits lightweight artificial neural networks (ANNs) to provide real-time predictions of multi-plane ankle kinematics using a minimal number of IMUs, without calibration requirements. Five ANN algorithms were developed and evaluated using 42 inputs derived from four IMUs in both intra-subject and inter-subject tasks. Extensive experimental results yielded exciting findings: even a single IMU located at the heel can provide clinically acceptable estimations of ankle kinematics, implying significant potential for cost and energy savings. Statistical analysis demonstrated the superiority of the developed Long Short-Term Memory (LSTM) network over the other models in intra-subject tasks, achieving impressive accuracy (RMSE: 1. 88 $\mathrm{^{\circ }}$ $\pm$ 0. 02 $\mathrm{^{\circ }}$, MAE: 1. 41 $\mathrm{^{\circ }}$ $\pm$ 0. 01 $\mathrm{^{\circ }}$, and r2 score: 0. 93 $\pm$ 0. 01), indicating strong generalization within the same subject. In inter-subject tasks, the convolutional neural network (CNN) and the CNN-LSTM models showed comparable performance but statistically outperformed the other models in terms of estimation accuracy across various inputs. When using a single IMU, the CNN model achieved the lowest error (RMSE: 4. 13 $\mathrm{^{\circ }}$ $\pm$ 0. 55 $\mathrm{^{\circ }}$, MAE: 3. 33 $\mathrm{^{\circ }}$ $\pm$ 0. 48 $\mathrm{^{\circ }}$, and r2 score: 0. 50 $\pm$ 0. 21), showcasing its effective generalization to new subjects. Furthermore, deploying the CNN into a microcontroller, with a sinlge IMU at the heel, resulted in promising real-time ankle kinematics estimations (RMSE: 3. 34 $\mathrm{^{\circ }}$ $\pm$ 0. 48 $\mathrm{^{\circ }}$, MAE: 2. 68 $\mathrm{^{\circ }}$ $\pm$ 0. 46 $\mathrm{^{\circ }}$ and r2 score: 0. 63 $\pm$ 0. 07). Overall, this research highlights the potential of combining IMUs with ANNs as reliable and practical tools for early prevention and rehabilitation of ankle injuries.

Details DOI

ICLR Conference 2025 Conference Paper

Can Large Language Models Understand Symbolic Graphics Programs?

Zeju Qiu
Weiyang Liu
Haiwen Feng
Zhen Liu 0019
Tim Z. Xiao
Katherine M. Collins
Joshua B. Tenenbaum
Adrian Weller

Against the backdrop of enthusiasm for large language models (LLMs), there is a growing need to scientifically assess their capabilities and shortcomings. This is nontrivial in part because it is difficult to find tasks which the models have not encountered during training. Utilizing symbolic graphics programs, we propose a domain well-suited to test multiple spatial-semantic reasoning skills of LLMs. Popular in computer graphics, these programs procedurally generate visual data. While LLMs exhibit impressive skills in general program synthesis and analysis, symbolic graphics programs offer a new layer of evaluation: they allow us to test an LLM's ability to answer semantic questions about the images or 3D geometries without a vision encoder. To semantically understand the symbolic programs, LLMs would need to possess the ability to "imagine" and reason how the corresponding graphics content would look with only the symbolic description of the local curvatures and strokes. We use this task to evaluate LLMs by creating a large benchmark for the semantic visual understanding of symbolic graphics programs, built procedurally with minimal human effort. Particular emphasis is placed on transformations of images that leave the image level semantics invariant while introducing significant changes to the underlying program. We evaluate commercial and open-source LLMs on our benchmark to assess their ability to reason about visual output of programs, finding that LLMs considered stronger at reasoning generally perform better. Lastly, we introduce a novel method to improve this ability -- Symbolic Instruction Tuning (SIT), in which the LLM is finetuned with pre-collected instruction data on symbolic graphics programs. Interestingly, we find that SIT not only improves LLM's understanding on symbolic programs, but it also improves general reasoning ability on various other benchmarks.