Arrow Research search

Author name cluster

Anup Basu

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

25 papers
2 author rows

Possible papers

25

AAAI Conference 2026 Conference Paper

StyleFM: Frequency Manipulation Empowered by Recursive Attention on Diffusion Models for Arbitrary Style Transfer

  • Yingnan Ma
  • Zhenye Liu
  • Siying Liu
  • Anup Basu

Given the remarkable performance of diffusion models in image generation, recent research has been exploring their adaptation to style transfer. However, current diffusion-based approaches encounter persistent challenges, such as style distortions and the reliance on textual prompts for content preservation. To address these limitations, we introduce StyleFM, a novel training-free diffusion-based style transfer approach that incorporates optimization strategies into both the frequency and temporal domains. The proposed method provides two core innovations: (1) Tripartite Frequency Manipulation: To more precisely tailor frequency manipulation, StyleFM introduces a tripartite frequency design with a buffer band accounting for the overlap of content and style representations. In addition, StyleFM designs a frequency superposition editing method to achieve frequency enhancement. (2) Recursive Attention: StyleFM proposes the recursive attention strategy within the diffusion process, which facilitates the progressive and consistent injection of style information throughout the temporal process without reliance on text guidance. Experiments demonstrate that StyleFM outperforms state-of-the-art methods. It effectively preserves content fidelity while achieving sufficient style embedding.

EAAI Journal 2025 Journal Article

Block information strategy for multi-modal remote sensing image registration

  • Yameng Hong
  • Chengcai Leng
  • Beihua Liu
  • Jinye Peng
  • Irene Cheng
  • Anup Basu

Registration of multi-modal remote sensing image pairs (MRSI) is challenging given the distinct imaging mechanisms of multi-modal data sources, which lead to substantial geometric and radiometric distortions and inaccuracies in correspondences. To tackle this issue, we propose a novel approach that integrates local image information into feature representations through the design of local regions and the extraction of local information. The latter comprises of two key components: rank-based feature redistribution and residual information extraction utilizing a pyramid-like structure of local patches. This enhanced feature representation technique, termed Reinforced Local Information of LSS (RLILSS), embeds local information to improve the performance of the Local Self-Similarity (LSS)-based framework for MRSI registration. RLILSS strengthens feature characterization across various regions and addresses the limitations of supplementary information. This enables more reliable correspondences between images. Experimental results show that the proposed method achieves higher accuracy and better registration across diverse multi-modal datasets. Detailed analyses confirm its superiority over state-of-the-art methods in both accuracy and robustness. This approach holds significant potential for applications in automatic geographic registration and disaster area reconstruction.

EAAI Journal 2025 Journal Article

Dual graph-regularized low-rank representation for hyperspectral image denoising

  • Chengcai Leng
  • Mingpei Tang
  • Zhao Pei
  • Jinye Peng
  • Anup Basu

Hyperspectral images have a wide range of applications in many fields. However, when hyperspectral images are captured by spectrometers, there is inevitably considerable noise, which affects subsequent research. In recent years, many hyperspectral image denoising methods based on low-rank representations have been proposed. Artificial intelligence denoising methods are also popular. However, the research on multi noise denoising is rarely mentioned, and most literatures only focus on one noise in hyperspectral images. Thus, we propose a denoising model for hyperspectral image based on dual graph-regularized low-rank representation, which can not only reduce multiple types of noise simultaneously, but also preserves details of the original image. In particular, this is the first time that the dual low-rank representation and dual graph regularizations are used on hyperspectral images. We solve this method using the linearized alternating direction method with adaptive penalty. Finally, we conduct experiments on simulated and real data sets to verify the effectiveness of our method. The experimental results show that our method can not only effectively remove a variety of mixed noises, but also well retain the details of the image.

EAAI Journal 2025 Journal Article

Orthogonal Diversity Nonnegative Matrix Factorization for multi-view clustering

  • Xinling Zhang
  • Chengcai Leng
  • Jinye Peng
  • Irene Cheng
  • Anup Basu

In the context of rapid development of artificial intelligence, how to extract valuable information from complex multidimensional data has become a core research problem. Multi-view clustering methods based on non-negative matrix factorization (NMF) are widely used in multi-view data analysis, but still face many challenges in practical applications. Current multi-view clustering methods usually solve the problem of diversity among viewpoints by orthogonalization of view representations. However, they fail to fully utilize the rich features of each viewpoint because data from different viewpoints may be interrelated. In addition, existing methods fail to fully consider the orthogonality between base matrices while emphasizing the diversity of view representations. For this reason, this paper proposes a new orthogonal diversity non-negative matrix factorization method (ODNMF). First, ODNMF explores the orthogonality of the representations of sample pairs between different viewpoints. This approach preserves the characteristics of each perspective and enhances the diversity of data representations. Second, ODNMF orthogonalizes the basis matrix of each viewpoint to reduce redundant features and enhance data interpretability and representation. Finally, ODNMF introduces graph regularization for each view to reveal the intrinsic geometric and structural information of features. Experimental results show that ODNMF significantly outperforms existing state-of-the-art algorithms on seven datasets.

EAAI Journal 2024 Journal Article

Bayesian non-negative matrix factorization with Student’s t-distribution for outlier removal and data clustering

  • Ruixue Yuan
  • Chengcai Leng
  • Shuang Zhang
  • Jinye Peng
  • Anup Basu

Non-negative Matrix Factorization (NMF) is an effective way to solve the redundancy of non-negative high-dimensional data. Most of the traditional probability-based NMF methods use Gaussian distribution to model the differences between the matrices before and after decomposition. However, the Gaussian distribution is strongly affected by outliers, and it may not fit all datasets accurately when there are no outliers in the data. In this article, we propose a novel Bayesian NMF with the Student’s t-distribution, i. e. , TNMF. specifically, in order to reduce the impact of outliers on the algorithm, we use the Student’s t-distribution to fit the data points instead of the Gaussian distribution. In addition, it is possible to adjust the Degree of Freedom (DF) to make the Student’s t-distribution more flexible than the Gaussian distribution to fit data points when there are no outliers. Next, we combine the Automatic Relevance Determination (ARD) prior in our algorithm to simplify the model and allow for better performance of the algorithm. Finally, the article used 10 datasets to design two kinds of experiments, outlier removal and data clustering. The outlier removal results of this proposed algorithm are significantly better than the other methods, and it performs better in clustering compared to the other methods in the majority of cases.

EAAI Journal 2024 Journal Article

Feature matching based on Gaussian kernel convolution and minimum relative motion

  • Kun Wang
  • Chengcai Leng
  • Huaiping Yan
  • Jinye Peng
  • Zhao Pei
  • Anup Basu

Feature matching is a necessary and important step for remote sensing image registration, intended to establish reliable point correspondences between two sets of features. In this paper, we propose a feature registration model based on local relative motion, which combines Gaussian kernel convolution with relative motion (GRM) vector to obtain better results by removing wrong matches and improving the inlier point accuracy. We first establish putative matching based on the similarity between local descriptors. Then, the preliminary hypothetical matching point set is filtered using consistency with nearest neighbors among the inlier points to obtain a more accurate motion vector, and to fit the real motion vector through the Gaussian convolution kernel. Finally, we find the displacement between the fitted motion vector and the matching generated motion vector. And combine the displacement with the optimization model to find the inlier point set. Experimental results show that our GRM method outperforms related work, achieving better matching results.

EAAI Journal 2024 Journal Article

Incremental semi-supervised graph learning NMF with block-diagonal

  • Xue Lv
  • Chengcai Leng
  • Jinye Peng
  • Zhao Pei
  • Irene Cheng
  • Anup Basu

Non-negative matrix factorization (NMF), as a good data dimensionality reduction method, is widely used in the field of image recognition. Incremental non-negative matrix factorization (INMF) as an improvement solves the problem of inefficiency caused by repeated running of data samples during online learning. However, in the traditional incremental non-negative matrix factorization algorithm, the newly added training samples do not contain label information. Some samples may be marked in both the initial sample and the new training sample in the real application scenario. In order to make full use of the label information carried by the dataset, in this paper, we propose a semi-supervised non-negative matrix factorization model for batch incremental data, incremental semi-supervised graph learning NMF with block diagonal (ISGDNMF). The model is divided into three cases according to the degree of label-carrying for the new batch data: all label-carrying, no label-carrying, and partial label-carrying. The label information is also used to add a diagonal structure to the coefficient matrix, which makes it possible to have stronger discriminatory ability and to distinguish different classes of images more easily. And graph regularization information is added in order to maintain the spatial-geometric structure of the data. Experiments on six image datasets show that this algorithm has superior performance relative to the other seven NMF-based algorithms.

EAAI Journal 2023 Journal Article

β -divergence NMF with biorthogonal regularization for data representation

  • Ruixue Yuan
  • Chengcai Leng
  • Bing Li
  • Anup Basu

Non-Negative Matrix Factorization (NMF) has become a commonly used method for data representation. Orthogonal NMF improves the clustering performance by adding orthogonal constraints to the decomposed matrices. The existing orthogonal NMF methods typically use Euclidean distance to measure the difference between before and after factorization for convenience and simplicity. However, limitations of the Euclidean distance can lead to inflexibilities. In addition, failure to consider orthogonality of the decomposed features and sparsity of the data representation can also lead to degraded performance of the algorithm. In order to overcome the above shortcomings, we propose a novel β -divergence-based NMF with biorthogonal regularization (BO- β NMF). Our BO- β NMF method uses generalized β -divergence instead of Euclidean distance to measure the similarity between matrices, and selects an appropriate β for each type of data to obtain a more flexible way of measuring similarity. In addition, we also incorporate biorthogonal constraints into the minimized objective function, which ensures both orthogonality of the decomposed features and sparsity of the data representation. Furthermore, we use trace rather than Euclidean distance to measure the orthogonality of the decomposed matrices, which reduces execution time. Finally, clustering experiments on image datasets show that the overall clustering effect of BO- β NMF is better than state-of-the-art methods.

YNICL Journal 2015 Journal Article

Stochastic process for white matter injury detection in preterm neonates

  • Irene Cheng
  • Steven P. Miller
  • Emma G. Duerden
  • Kaiyu Sun
  • Vann Chau
  • Elysia Adams
  • Kenneth J. Poskitt
  • Helen M. Branson

Preterm births are rising in Canada and worldwide. As clinicians strive to identify preterm neonates at greatest risk of significant developmental or motor problems, accurate predictive tools are required. Infants at highest risk will be able to receive early developmental interventions, and will also enable clinicians to implement and evaluate new methods to improve outcomes. While severe white matter injury (WMI) is associated with adverse developmental outcome, more subtle injuries are difficult to identify and the association with later impairments remains unknown. Thus, our goal was to develop an automated method for detection and visualization of brain abnormalities in MR images acquired in very preterm born neonates. We have developed a technique to detect WMI in T1-weighted images acquired in 177 very preterm born infants (24-32 weeks gestation). Our approach uses a stochastic process that estimates the likelihood of intensity variations in nearby pixels; with small variations being more likely than large variations. We first detect the boundaries between normal and injured regions of the white matter. Following this we use a measure of pixel similarity to identify WMI regions. Our algorithm is able to detect WMI in all of the images in the ground truth dataset with some false positives in situations where the white matter region is not segmented accurately.

IROS Conference 2005 Conference Paper

Visual gesture recognition for ground air traffic control using the Radon transform

  • Meghna Singh
  • Mrinal Mandal 0001
  • Anup Basu

Human gesture recognition is an active topic of vision research which has applications in diverse fields such as collaborative virtual environments and robot teleoperation. We propose a novel method for the recognition of hand gestures, used by air marshals for steering aircraft on the runway, using the Radon transform. Various aspects of the algorithm, including acquisition, segmentation, labeling and recognition using the parametric Radon transform are addressed in this paper. A binary skeleton representation of the human subject is computed. The Radon transform is used to generate maxima corresponding to specific orientations of the skeletal representation. Feature vectors are extracted from the transform space by computing the normalized cumulative projections of the Radon transform on the angle axis. K-means clustering is then applied to recognize static gestures from the extracted features. This technique has the potential to provide information about the exact orientation of gesture segments and can find use in ground control of unmanned air vehicles. Experiments with image data corresponding to the various ground air traffic control gestures used in directing aircrafts, highlight the potential application of this approach.

ICRA Conference 1999 Conference Paper

Panoramic Video with Predictive Windows for Telepresence Applications

  • Jonathan Baldwin
  • Anup Basu
  • Hong Zhang 0013

We describe the application of a predictive Kalman filter to the display of panoramic images. We discuss integrating a panoramic imaging system with prediction of viewing direction to create an effective telepresence system over low bandwidth links. Panoramic imaging using a reflective mirror surface offers an alternative to pan-tilt systems for obtaining a 360 degree field of view. Selecting a small window within a panoramic image allows a meaningful part of an image from a remote site to be seen at a higher refresh rate. Because of the delay in transmitting an image from a remote site, it is necessary to have additional image information available locally. This information can be used to simulate continuously flowing pictures with reduced apparent delay. Continuity in image viewing is achieved by predicting the next viewpoint of an operator and preemptively transmitting parts of an image. Experimental results are given to evaluate the proposed telepresence system.

ICRA Conference 1998 Conference Paper

Predictive Windows for Delay Compensation in Telepresence Applications

  • Jonathan Baldwin
  • Anup Basu
  • Hong Zhang 0013

Predictive Kalman filters can be used to predict positions of a mouse when it is operated under some basic assumptions. This prediction can be used to estimate what portions of a larger image an operator wants to view. This paper discusses theory and experimentation being done at the University of Alberta using predictive Kalman filters to provide predictive windows for low bandwidth telepresence applications. We compare several state models used in prediction with each other and also with having no prediction, both with numerical measures, and on human subjects. We show that the constant velocity model provides the best prediction results.

ICRA Conference 1996 Conference Paper

A conical mirror pipeline inspection system

  • David Southwell
  • Basil Vandegriend
  • Anup Basu

Presents a novel system that provides high quality imaging of the interior surface of pipelines. The imaging device uses a refined conical mirror micro-surface machined from aluminum to image a 360/spl deg/ strip of the pipe in a single frame. A continuous stream of such images is transmitted to the surface where it is stored on a standard video recorder. The image sequences are processed off-line, ultimately producing high definition imagery of areas of interest. We address the design issues of the conic profile of the mirror, and some aspects of the image reconstruction/registration process.

ICRA Conference 1995 Conference Paper

Active Camera Calibration Using Pan, Tilt and Roll

  • Anup Basu
  • Kavita Ravi

Three dimensional vision applications, such as robot vision, require modelling of the relationship between the 2D images and the 3D world. Camera calibration is a process which accurately models this relationship. The calibration procedure determines the geometric parameters of the camera, such as focal length and center of the image. Most of the existing calibration techniques use predefined patterns and a static camera. Recently, A. Basu (1993) developed a novel calibration technique for computing the focal length and image center which uses an active camera. This technique does not require any predefined patterns or point to point correspondence between images-only a set of scenes with some stable edges. It was observed that the algorithms developed for image center are sensitive to noise and hence unreliable in real situations. The article extends the techniques provided by Basu to develop a simpler, yet more robust method for computing the image center.

IROS Conference 1995 Conference Paper

An active technique for piecewise calibration of robot manipulators

  • Kavita Ravi
  • Anup Basu

Robot calibration is essential to improve the positioning accuracy of robot manipulators. A mathematical (kinematic) model is used to describe the geometric structure of a robot manipulator. Robot calibration procedure involves calculating and improving the values of this model's parameters. The robot calibration technique presented in this paper uses a vision system, to calibrate the kinematic model. For an n-linked robot manipulator, the procedure calibrates one link at a time, starting with the n/sup th/ link by making small movements. The resulting equations are linear, consequently, the algorithms are simple.

ICRA Conference 1995 Conference Paper

Robust Detection of Moving Objects by a Moving Observer on Planar Surfaces

  • Ashraf Elnagar
  • Anup Basu

We introduce a technique for detecting moving objects from an image sequence obtained with a moving camera using the planarity constraint. To increase the robustness of this technique, false motion caused by inaccuracies in sensor readings is eliminated by use of a morphological filter. This involves two successive operations-erosion and dilation-performed on a motion compensated image. Experimental results with real images are presented. Applications to the compression of moving images are now being investigated.

ICRA Conference 1995 Conference Paper

Surface Integration for Inspection Tasks

  • Anup Basu
  • Ashraf Elnagar
  • Mark Fiala

In underwater environments it is often difficult to obtain a big/clear picture of a scene. For that reason, a system that can integrate small pieces of images (taken from close range) into a composite 3D surface, is developed here. The device, along with 3D position/orientation estimation equipment, can be used for inspection of hulls of ships anchored in a bay, or for examination of underwater pipes and tanks. Experimental results are presented which validate the algorithms developed.

ICRA Conference 1994 Conference Paper

Crack Detection Using Contact Sensing

  • Raju Patil
  • Anup Basu
  • Hong Zhang 0013

In this paper, we describe a robotic system employing contact sensing to detect the presence of cracks in surfaces. Automated detection of cracks in surfaces has many practical applications. Tasks such as inspection of underground pipes carrying fluids and the inspection of ship hulls are examples of jobs that are difficult for human operators. Robots equipped with vision are ill-suited for such tasks because of low visibility and fluid disturbance. Contact sensing is shown to provide a viable alternative for surface inspection tasks in situations where images cannot be obtained. The mathematical modeling of the system, some simulations and experimental results are provided. >

IROS Conference 1993 Conference Paper

Active tracking

  • Don Murray
  • Anup Basu

This work describes a method for real-time motion detection using an active camera mounted on a pan/tilt platform. Image mapping is used to align images of different viewpoints so that static camera motion detection can be applied. In the presence of camera position noise, the image mapping is inexact and compensation techniques fail. The use of morphological filtering of motion images is explored to desensitize the detection of algorithm in inaccuracies in background compensation. Two motion detection techniques are examined, and experiments to verify the methods are presented. The system successfully extracts moving edges from dynamic images, even when the pan/tilt angles between successive frames are as large as 3/spl deg/.

IROS Conference 1993 Conference Paper

Modeling fish-eye lenses

  • Anup Basu
  • Sergio Licardie

The human visual system can be characterized as a variable-resolution system: foveal information is processed at very high spatial resolution whereas peripheral information is processed at low spatial resolution. Various transforms have been proposed to model spatially varying resolution. Unfortunately, special sensors need to be designed to acquire images according to existing transforms. In this work, two models of fish-eye transform are presented. The validity of the transformations is demonstrated by fitting the alternative models to a real fish-eye lens.

IROS Conference 1993 Conference Paper

Smooth and acceleration minimizing trajectories for mobile robots

  • Ashraf Elnagar
  • Anup Basu

An approach to generating smooth piecewise local trajectories for mobile robots is proposed. Given the configurations (position and direction) of two points, one searches for the trajectory that minimizes the integral of acceleration (tangential and normal). The resulting trajectory should not only be smooth but also safe in order to be applicable in real-life situations, so the authors investigate two different obstacle-avoidance constraints that satisfy the minimization problem. Unfortunately, in this case the problem becomes more complex and unsuitable for real-time implementations. Therefore, the authors introduce two simple solutions, based on the idea of polynomial fitting to generate safe trajectories, once a collision is detected with the original smooth trajectory. Simulation results for the different algorithms are presented.

ICRA Conference 1992 Conference Paper

Heuristics for local path planning

  • Ashraf Elnagar
  • Anup Basu

The authors describe a heuristic technique for solving the problem of path planning based on local information for a mobile robot with acceleration constraints moving amidst a set of stationary obstacles. The concept of safety is introduced to design a planning strategy. A path which maximizes the product of safety based on local information and attraction towards the goal is chosen. The safety function depends on the acceleration bounds. The attraction toward the goal depends on the distance from the goal. Two additional heuristics are proposed to improve the efficiency of the search process and to enhance the ability of the robot to avoid obstacles. Some simulation examples of the algorithm corresponding to different navigational environments are discussed. >

ICRA Conference 1990 Conference Paper

Approximate constrained motion planning

  • Anup Basu
  • Yiannis Aloimonos

The problem of finding a collision-free path connecting two points (start and goal) in the presence of obstacles, with constraints on the curvature of the path, is examined. This problem of curvature-constrained motion planning arises when, for example, a vehicle with constraints on its steering mechanism needs to be maneuvered through obstacles. Though no lower bound on the difficulty of the problem in 2-D is known, exact algorithms given to date for the reachability questions are exponential. It is shown that a variation of the problem is NP-hard. Notably, however, the same variation to polynomially solvable motion planning problems does not make them intractable. In addition, it is proven that epsilon -approximations to this problem cannot exist unless the underlying decision problem is polynomially solvable. An algorithm which is expected to find a desired path, when one exists, with a required probability is presented. Results indicate that a variable-size discretization is necessary for the task, linking the required probability to the size of the discretization locally. >