233 results on '"Alternating least squares"'
Search Results
2. Hybrid Movie Recommendation System With User Partitioning and Log Likelihood Content Comparison
- Author
-
Yongmao Yang, Kampol Woradit, and Kenneth Cosh
- Subjects
Recommendation system ,log-likelihood ,content-based ,collaborative filtering ,alternating least squares ,particle swarm optimization ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
fIn the domain of recommendation systems, matrix decomposition is an effective strategy for mitigating issues related to sparsity and low space utilization. The Alternating Least Squares (ALS) method, in particular, stands out for its ability to process data in parallel, thereby enhancing computational efficiency. However, when dealing with an original rating matrix, the ALS method may inadvertently sacrifice some information, leading to increased error rates. To address these challenges, this paper proposes a novel hybrid model that integrates matrix factorization with additional features. Furthermore, it leverages weighted similarity measures and employs advanced log-likelihood text mining techniques. These innovations are designed to tackle cold-start problems and sparsity issues while compensating for information loss to mitigate errors. Under the premise that our model employs consistent evaluation metrics and datasets, comparative analysis against existing models from related literature demonstrates superior performance. Specifically, our model achieves a lower Root Mean Square Error (RMSE) of 0.82 and 0.88, alongside a higher F1 score of 0.94 and 0.92 in two datasets. Our proposed hybrid approach effectively addresses sparsity and mitigates information loss in matrix factorization, as demonstrated by these results.
- Published
- 2025
- Full Text
- View/download PDF
3. SVD-based algorithms for tensor wheel decomposition.
- Author
-
Wang, Mengyu, Cui, Honghua, and Li, Hanyu
- Abstract
Tensor wheel (TW) decomposition combines the popular tensor ring and fully connected tensor network decompositions and has achieved excellent performance in tensor completion problem. A standard method to compute this decomposition is the alternating least squares (ALS). However, it usually suffers from slow convergence and numerical instability. In this work, the fast and robust SVD-based algorithms are investigated. Based on a result on TW-ranks, we first propose a deterministic algorithm that can estimate the TW decomposition of the target tensor under a controllable accuracy. Then, the randomized versions of this algorithm are presented, which can be divided into two categories and allow various types of sketching. Numerical results on synthetic and real data show that our algorithms have much better performance than the ALS-based method and are also quite robust. In addition, with one SVD-based algorithm, we also numerically explore the variability of TW decomposition with respect to TW-ranks and the comparisons between TW decomposition and other famous formats in terms of the performance on approximation and compression. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. Information from Noise: Measuring Dyslexia Risk Using Rasch-like Matrix Factorization with a Procedure for Equating Instruments.
- Author
-
Moulton, Mark H. and Eide, Brock L.
- Subjects
- *
MATRIX decomposition , *DYSLEXIA , *UNCERTAINTY (Information theory) , *RECEIVER operating characteristic curves , *RASCH models - Abstract
This study examines the psychometric properties of a screening protocol for dyslexia and demonstrates a special form of matrix factorization called Nous based on the Alternating Least Squares algorithm. Dyslexia presents an intrinsically multidimensional complex of cognitive loads. By building and enforcing a common 6-dimensional space, Nous extracts a multidimensional signal for each person and item from test data that increases the Shannon entropy of the dataset while at the same time being constrained to meet the special objectivity requirements of the Rasch model. The resulting Dyslexia Risk Scale (DRS) yields linear equal-interval measures that are comparable regardless of the subset of items taken by the examinee. Each measure and cell estimate is accompanied by an efficiently calculated standard error. By incorporating examinee age into the calibration process, the DRS can be generalized to all age groups to allow the tracking of individual dyslexia risk over time. The methodology was implemented using a 2019 calibration sample of 828 persons aged 7 to 82 with varying degrees of dyslexia risk. The analysis yielded high reliability (0.95) and excellent receiver operating characteristics (AUC = 0.96). The analysis is accompanied by a discussion of the information-theoretic properties of matrix factorization. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
5. Blind signal separation for coprime planar arrays: An improved coupled trilinear decomposition method
- Author
-
Zhongyuan Que, Xiaofei Zhang, and Benzhou Jin
- Subjects
alternating least squares ,coprime planar array ,coupled canonical polyadic decomposition ,signal separation ,tensor decompositions ,Telecommunication ,TK5101-6720 ,Electronics ,TK7800-8360 - Abstract
In this study, the problem of blind signal separation for coprime planar arrays is investigated. For coprime planar arrays comprising two uniform rectangular subarrays, we link the signal separation to the tensor-based model called coupled canonical polyadic decomposition (CPD) and propose an improved coupled trilinear decomposition approach. The output data of coprime planar arrays are modeled as a coupled tensor set that can be further interpreted as a coupled CPD model, allowing a signal separation to be achieved using coupled trilinear alternating least squares (TALS). Furthermore, in the procedure of the coupled TALS, a Vandermonde structure enforcing approach is explicitly applied, which is shown to ensure fast convergence. The results of Monto Carlo simulations show that our proposed algorithm has the same separation accuracy as the basic coupled TALS but with a faster convergence speed.
- Published
- 2023
- Full Text
- View/download PDF
6. Blind signal separation for coprime planar arrays: An improved coupled trilinear decomposition method.
- Author
-
Que, Zhongyuan, Zhang, Xiaofei, and Jin, Benzhou
- Subjects
SIGNAL separation ,LEAST squares ,ARRAY processing ,BLIND source separation - Abstract
In this study, the problem of blind signal separation for coprime planar arrays is investigated. For coprime planar arrays comprising two uniform rectangular subarrays, we link the signal separation to the tensor‐based model called coupled canonical polyadic decomposition (CPD) and propose an improved coupled trilinear decomposition approach. The output data of coprime planar arrays are modeled as a coupled tensor set that can be further interpreted as a coupled CPD model, allowing a signal separation to be achieved using coupled trilinear alternating least squares (TALS). Furthermore, in the procedure of the coupled TALS, a Vandermonde structure enforcing approach is explicitly applied, which is shown to ensure fast convergence. The results of Monto Carlo simulations show that our proposed algorithm has the same separation accuracy as the basic coupled TALS but with a faster convergence speed. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
7. Multivariate Curve Resolution Alternating Least Squares Analysis of In Vivo Skin Raman Spectra.
- Author
-
Matveeva, Irina, Bratchenko, Ivan, Khristoforova, Yulia, Bratchenko, Lyudmila, Moryatov, Alexander, Kozlov, Sergey, Kaganov, Oleg, and Zakharov, Valery
- Subjects
- *
RAMAN spectroscopy , *LEAST squares , *BASAL cell carcinoma , *TISSUES , *MELANOMA , *VIBRATIONAL spectra - Abstract
In recent years, Raman spectroscopy has been used to study biological tissues. However, the analysis of experimental Raman spectra is still challenging, since the Raman spectra of most biological tissue components overlap significantly and it is difficult to separate individual components. New methods of analysis are needed that would allow for the decomposition of Raman spectra into components and the evaluation of their contribution. The aim of our work is to study the possibilities of the multivariate curve resolution alternating least squares (MCR-ALS) method for the analysis of skin tissues in vivo. We investigated the Raman spectra of human skin recorded using a portable conventional Raman spectroscopy setup. The MCR-ALS analysis was performed for the Raman spectra of normal skin, keratosis, basal cell carcinoma, malignant melanoma, and pigmented nevus. We obtained spectral profiles corresponding to the contribution of the optical system and skin components: melanin, proteins, lipids, water, etc. The obtained results show that the multivariate curve resolution alternating least squares analysis can provide new information on the biochemical profiles of skin tissues. Such information may be used in medical diagnostics to analyze Raman spectra with a low signal-to-noise ratio, as well as in various fields of science and industry for preprocessing Raman spectra to remove parasitic components. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
8. Optical Coupler Network Modeling and Parameter Estimation Based on a Generalized Tucker Train Decomposition
- Author
-
Danilo S. Rocha, Francisco T. C. B. Magalhaes, Gerard Favier, Antonio S. B. Sombra, and Glendo F. Guimaraes
- Subjects
Alternating least squares ,multidimensional signal processing ,multilinear algebra ,optical arrays ,optical directional coupler ,optical fiber devices ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Tensor models have been used extensively in signal processing applications to design different types of communication systems. This paper proposes, for the first time, the use of tensor models for optical communications. The signals of an optical dual-core coupler network are modeled as a multiway array (tensor), which satisfies a generalized Tucker train decomposition. This tensor model is then used to develop an estimation algorithm that enables the network parameters to be estimated from the input and output signals. The performance of this algorithm was evaluated by means of computer simulations, in terms of NMSE of the estimated parameters and convergence speed. For the tested configurations, good levels of NMSE with fast convergence were obtained, demonstrating the effectiveness of the proposed method as a promising tool for studying and designing optical devices, with a wide range of applications in the context of lightwave systems.
- Published
- 2022
- Full Text
- View/download PDF
9. A stochastic algorithm for the ParaTuck decomposition.
- Author
-
Zniyed, Yassine and de Almeida, André L.F.
- Subjects
- *
LEAST squares , *SWAMPS , *ALGORITHMS - Abstract
This paper introduces a novel stochastic algorithm for the ParaTuck Decomposition (PTD), addressing the challenge of local minima encountered in the traditional alternating least squares (ALS) approach. The proposed method integrates stochastic steps into the ALS framework to avoid the common swamp problems, where numerical difficulties prevent accurate decompositions. Our simulations indicate good convergence properties for PTD, suggesting a potential increase in the efficiency and reliability of this tensor decomposition across various applications. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
10. Reverse-mode differentiation in arbitrary tensor network format: with application to supervised learning.
- Author
-
Gorodetsky, Alex A., Safta, Cosmin, and Jakeman, John D.
- Subjects
- *
SUPERVISED learning , *AUTOMATIC differentiation , *BENCHMARK problems (Computer science) , *CAPABILITIES approach (Social sciences) , *COMPUTATIONAL complexity - Abstract
This paper describes an effcient reverse-mode differentiation algorithm for contraction operations for arbitrary and unconventional tensor network topologies. The approach leverages the tensor contraction tree of Evenbly and Pfeifer (2014), which provides an instruction set for the contraction sequence of a network. We show that this tree can be effciently leveraged for differentiation of a full tensor network contraction using a recursive scheme that exploits (1) the bilinear property of contraction and (2) the property that trees have a single path from root to leaves. While differentiation of tensor-tensor contraction is already possible in most automatic differentiation packages, we show that exploiting these two additional properties in the specific context of contraction sequences can improve effciency. Following a description of the algorithm and computational complexity analysis, we investigate its utility for gradient-based supervised learning for low-rank function recovery and for fitting real-world unstructured datasets. We demonstrate improved performance over alternating least-squares optimization approaches and the capability to handle heterogeneous and arbitrary tensor network formats. When compared to alternating minimization algorithms, we find that the gradient-based approach requires a smaller oversampling ratio (number of samples compared to number model parameters) for recovery. This increased effciency extends to fitting unstructured data of varying dimensionality and when employing a variety of tensor network formats. Here, we show improved learning using the hierarchical Tucker method over the tensor-train in high-dimensional settings on a number of benchmark problems. [ABSTRACT FROM AUTHOR]
- Published
- 2022
11. ALSBMF: Predicting lncRNA-Disease Associations by Alternating Least Squares Based on Matrix Factorization
- Author
-
Wen Zhu, Kaimei Huang, Xiaofang Xiao, Bo Liao, Yuhua Yao, and Fang-Xiang Wu
- Subjects
Alternating least squares ,disease similarity ,lncRNA similarity ,leave-one-out cross validation ,matrix factorization ,ROC curve ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
In recent years, it has been increasingly clear that long non-coding RNAs (lncRNAs) are able to regulate their target genes at multi-levels, including transcriptional level, translational level, etc and play key regulatory roles in many important biological processes, such as cell differentiation, chromatin remodeling and more. Inferring potential lncRNA-disease associations is essential to reveal the secrets behind diseases, develop novel drugs, and optimize personalized treatments. However, biological experiments to validate lncRNA-disease associations are very time-consuming and costly. Thus, it is critical to develop effective computational models. In this study, we have proposed a method by alternating least squares based on matrix factorization to predict lncRNA-disease associations, referred to as ALSBMF. ALSBMF first decomposes the known lncRNA-disease correlation matrix into two characteristic matrices, then defines the optimization function using disease semantic similarity, lncRNA functional similarity and known lncRNA-disease associations and solves two optimal feature matrices by least squares method. The two optimal feature matrices are finally multiplied to reconstruct the scoring matrix, filling the missing values of the original matrix to predict lncRNA-disease associations. Compared to existing methods, ALSBMF has the same advantages as BPLLDA. It does not require negative samples and can predict associations related to novel lncRNAs or novel diseases. In addition, this study performs leave-one-out cross-validation (LOOCV) and five-fold cross-validation to evaluate the prediction performance of ALSBMF. The AUCs are 0.9501 and 0.9215, respectively, which are better than the existing methods. Furthermore colon cancer, kidney cancer, and liver cancer are selected as case studies. The predicted top three colon cancer, kidney cancer, and liver cancer-related lncRNAs were validated in the latest LncRNADisease database and related literature. In order to test the ability of ALSBMF to predict novel disease-associated lncRNAs and new lncRNA-associated diseases, all known associations of diseases and lncRNAs were eliminated, the predicted top five breast cancer, nasopharyngeal carcinoma cancer-related lncRNAs and top five H19, MALAT1 lncRNA-related cancers were validated in PubMed and dbSNP.
- Published
- 2020
- Full Text
- View/download PDF
12. Matrix Polynomial Predictive Model: A New Approach to Accelerating the PARAFAC Decomposition
- Author
-
Ming Shi, Dan Li, and Jian Qiu Zhang
- Subjects
Alternating least squares ,matrix polynomial predictive model ,PARAFAC decomposition ,tensor ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Alternating least squares (ALS) and its variations are the most commonly used algorithms for the PARAFAC decomposition of a tensor. However, it is still troubled for one how to accelerate the ALS algorithm with the reduced computational complexity. In this paper, a new acceleration method for the ALS with a matrix polynomial predictive model (MPPM) is proposed. In the MPPM, a matrix-valued function is first approximated by a matrix polynomial. It is shown that the future value of the function can be predicted by an FIR filter with the coefficients determined offline. By viewing each factor matrix of a tensor as a matrix-valued function, a new ALS algorithm, the ALS-MPPM algorithm, is then given. Analyses show that our ALS-MPPM algorithm is of low computational complexity and a close relation with the existing ALS algorithms. Moreover, to further accelerate the convergence of the proposed algorithm, a new technique called the multi-model (MM) prediction is also introduced. While the analytical results are verified by the numerical simulations, it is also shown that our ALS-MPPM outperforms the existing ALS-based algorithms in terms of the rate of convergence.
- Published
- 2019
- Full Text
- View/download PDF
13. Chemometric differentiation of natural gas types in the northwestern Junggar Basin, NW China.
- Author
-
Wang, Yao-Ping, Zhan, Xin, Gao, Yuan, Wang, Sibo, Xia, Jia, and Zou, Yan-Rong
- Abstract
In recent years, the natural gas has displayed a growing significance in oil and gas exploration in the northwestern Junggar Basin (NWJB), although oil has been the main focus of exploration in the basin. Here, we systematically discuss the classification and origin of the natural gases from the NWJB based on the natural gas geochemistry and chemometric methods. The natural gases collected from the NWJB were chemometrically classified into three groups. Group A gases, defined as coal-derived gases, were likely generated from the mixing of the Jiamuhe Formation and Carboniferous strata. Group B gases, defined as the mixing of coal-derived and oil-associated gases, were restricted to the source rocks of group A and C gases. Group C gases, defined as oil-associated gases, were likely derived from both the Fengcheng and Wuerhe Formations, with a higher contribution from the latter strata. The result of this study suggests that the potential of oil generation in the Wuerhe Formation has been underestimated in the past. This is in accordance with geochemical and geological evidence. This study provides an effective chemometric method of natural gas classification and evaluation of hydrocarbon generation potential. This contributes to a better understanding of the origin of gases and distribution of oil and gas, assisting in exploration deployment in the basin. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
14. TRANSIENT DYNAMICS OF BLOCK COORDINATE DESCENT IN A VALLEY.
- Author
-
MOHLENKAMP, MARTIN, YOUNG, TODD R., and BÁRÁNY, BALÁZS
- Subjects
- *
TRANSIENTS (Dynamics) , *VALLEYS - Abstract
We investigate the transient dynamics of Block Coordinate Descent algorithms in valleys of the optimization landscape. Iterates converge linearly to a vicinity of the valley oor and then progress in a zig-zag fashion along the direction of the valley oor. When the valley sides are symmetric, the contraction factor to a vicinity of the valley oor appears to be no worse than 1/8, but without symmetry the contraction factor can approach 1. Progress along the direction of the valley oor is proportional to the gradient on the valley oor and inversely proportional to the narrowness" of the valley. We quantify narrowness using the eigenvalues of the Hessian on the valley oor and give explicit formulas for certain cases. Progress also depends on the direction of the valley with respect to the blocks of coordinates. When the valley sides are symmetric, we give an explicit formula for this dependence and use it to show that in higher dimensions nearly all directions give progress similar to the worst case direction. Finally, we observe that when starting the algorithm, the ordering of blocks in the first few steps can be important, but show that a greedy strategy with respect to objective function improvement can be a bad choice. [ABSTRACT FROM AUTHOR]
- Published
- 2020
15. Multi-linear sparse reconstruction for SAR imaging based on higher-order SVD
- Author
-
Yu-Fei Gao, Guan Gui, Xun-Chao Cong, Yue Yang, Yan-Bin Zou, and Qun Wan
- Subjects
Spotlight SAR imaging ,Kronecker constraint ,Compressed sensing ,HOSVD ,Alternating least squares ,Telecommunication ,TK5101-6720 ,Electronics ,TK7800-8360 - Abstract
Abstract This paper focuses on the spotlight synthetic aperture radar (SAR) imaging for point scattering targets based on tensor modeling. In a real-world scenario, scatterers usually distribute in the block sparse pattern. Such a distribution feature has been scarcely utilized by the previous studies of SAR imaging. Our work takes advantage of this structure property of the target scene, constructing a multi-linear sparse reconstruction algorithm for SAR imaging. The multi-linear block sparsity is introduced into higher-order singular value decomposition (SVD) with a dictionary constructing procedure by this research. The simulation experiments for ideal point targets show the robustness of the proposed algorithm to the noise and sidelobe disturbance which always influence the imaging quality of the conventional methods. The computational resources requirement is further investigated in this paper. As a consequence of the algorithm complexity analysis, the present method possesses the superiority on resource consumption compared with the classic matching pursuit method. The imaging implementations for practical measured data also demonstrate the effectiveness of the algorithm developed in this paper.
- Published
- 2017
- Full Text
- View/download PDF
16. Bilinear model factor decomposition: A general mixture analysis tool
- Author
-
Omidikia, N., Ghaffari, M., Jansen, J., Buydens, L., Tauler, Romà, Omidikia, N., Ghaffari, M., Jansen, J., Buydens, L., and Tauler, Romà
- Abstract
The analysis of mixtures is a routine task in the analytical chemistry area as well as in other research fields. The objective is to identify, quantify, and interpret the chemical components of the mixtures. Various bilinear factor decomposition methods, including MCR-ALS, NMFand BNFA, have been proposed to solve this problem. However, there is little knowledge about their comparative performance in terms of different factors, such as solution reliability, calculation speed, convergence, flexibility in constraint implementation, and ease of results interpretation. To address these issues, this work aims to compare these methods using data examples from data simulations, environmental source apportionment studies, and chromatographic analysis of chemical mixtures. Through this comparison, we hope to gain insights into the strengths and weaknesses of each method and provide recommendations for researchers working in this field. This comprehensive comparison will help researchers choose the appropriate method for their specific analysis needs, ultimately leading to more accurate and efficient analysis.
- Published
- 2023
17. The geometry of rank decompositions of matrix multiplication II: 3 × 3 matrices.
- Author
-
Ballard, Grey, Ikenmeyer, Christian, Landsberg, J.M., and Ryder, Nick
- Subjects
- *
DECOMPOSITION method , *MATRIX multiplications , *PERMUTATION groups , *LEAST squares , *MATHEMATICAL symmetry - Abstract
Abstract This is the second in a series of papers on rank decompositions of the matrix multiplication tensor. We present new rank 23 decompositions for the 3 × 3 matrix multiplication tensor M 〈 3 〉. All our decompositions have symmetry groups that include the standard cyclic permutation of factors but otherwise exhibit a range of behavior. One of them has 11 cubes as summands and admits an unexpected symmetry group of order 12. We establish basic information regarding symmetry groups of decompositions and outline two approaches for finding new rank decompositions of M 〈 n 〉 for larger n. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
18. Performance considerations for scalable parallel tensor decomposition.
- Author
-
Rolinger, Thomas B., Simon, Tyler A., and Krieger, Christopher D.
- Subjects
- *
SINGULAR value decomposition , *HILBERT-Huang transform , *BIG data , *DATA distribution , *COMMUNICATION patterns , *LEAST squares - Abstract
Tensor decomposition, the higher-order analogue to singular value decomposition, has emerged as a useful tool for finding relationships in large, sparse, multidimensional data. As this technique matures and is applied to increasingly larger data sets, the need for high performance implementations becomes critical. A better understanding of the performance characteristics of tensor decomposition on large and sparse tensors can help drive the development of such implementations. In this work, we perform an objective empirical evaluation of three state of the art parallel tools that implement the Canonical Decomposition/Parallel Factorization tensor decomposition algorithm using alternating least squares fitting (CP-ALS): SPLATT, DFacTo, and ENSIGN. We conduct performance studies across a variety of data sets and evaluate the tools with respect to total memory required, processor stall cycles, execution time, data distribution, and communication patterns. Furthermore, we investigate the performance of the implementations on tensors with up to 6 dimensions and when executing high rank decompositions. We find that tensor data structure layout and distribution choices can result in differences as large as 14.6x with respect to memory usage and 39.17x with respect to execution time. We provide an outline of a distributed heterogeneous CP-ALS implementation that addresses the performance issues we observe. • Performance study of parallel CP-ALS implementations for sparse tensors. • Measure memory usage, processor stall cycles, execution time and scalability. • Discuss MPI data distribution schemes and communication patterns. • Evaluate high rank decompositions and data sets with up to 6 dimensions. • Performance gaps are as large as 14.6x for memory usage and 39.17x for run time. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
19. The Gifi System for Nonlinear Multivariate Analysis
- Author
-
Michailides, George and de Leeuw, Jan
- Subjects
Optimal Scaling ,Alternating Least Squares ,Multivariate Techniques ,Loss Functions ,Stability - Abstract
The Gifi system of analyzing categorical data through nonlinear varieties of classical multivariate analysis techniques is reviewed. The system is characterized by the optimal scaling of categorical variables which is implemented through alternating least squares algorithms. The main technique of homogeneity analysis is presented, along with its extensions and generalizations leading to nonmetric principal components analysis and canonical correlation analysis. A brief account of stability issues and areas of applications of the techniques is also given.
- Published
- 1998
20. Application of Improved Recommendation System Based on Spark Platform in Big Data Analysis
- Author
-
Xie Li, Zhou Wenbo, and Li Yaosen
- Subjects
spark ,recommendation system ,collaborative filtering ,alternating least squares ,Cybernetics ,Q300-390 - Abstract
In the era of big data, people have to face information filtration problem. For those cases when users do not or cannot express their demands clearly, recommender system can analyse user’s information more proactive and intelligent to filter out something users want. This property makes recommender system play a very important role in the field of e-commerce, social network and so on. The collaborative filtering recommendation algorithm based on Alternating Least Squares (ALS) is one of common algorithms using matrix factorization technique of recommendation system. In this paper, we design the parallel implementation process of the recommendation algorithm based on Spark platform and the related technology research of recommendation systems. Because of the shortcomings of the recommendation algorithm based on ALS model, a new loss function is designed. Before the model is trained, the similarity information of users and items is fused. The experimental results show that the performance of the proposed algorithm is better than that of algorithm based on ALS.
- Published
- 2016
- Full Text
- View/download PDF
21. %ERA: A SAS Macro for Extended Redundancy Analysis
- Author
-
Pietro Giorgio Lovaglio and Gianmarco Vacca
- Subjects
extended redundancy analysis ,SAS macro ,alternating least squares ,latent components ,Statistics ,HA1-4737 - Abstract
A new approach to structural equation modeling based on so-called extended redundancy analysis has been recently proposed in the literature, enhanced with the added characteristic of generalizing redundancy analysis and reduced-rank regression models for more than two blocks. In this approach, the relationships between the observed exogenous variables and the observed endogenous variables are moderated by the presence of unobservable composites that were estimated as linear combinations of exogenous variables, permitting a great flexibility to specify and fit a variety of structural relationships. In this paper, we propose the SAS macro %ERA to specify and fit structural relationships in the extended redundancy analysis (ERA) framework. Two examples (simulation and real data) are provided in order to reproduce results appearing in the original article where ERA was proposed.
- Published
- 2016
- Full Text
- View/download PDF
22. Towards a Vector Field Based Approach to the Proper Generalized Decomposition (PGD)
- Author
-
Antonio Falcó, Lucía Hilario, Nicolás Montés, Marta C. Mora, and Enrique Nadal
- Subjects
proper generalised decomposition ,alternating least squares ,greedy rank one update algorithm ,tensor numerical methods ,Mathematics ,QA1-939 - Abstract
A novel algorithm called the Proper Generalized Decomposition (PGD) is widely used by the engineering community to compute the solution of high dimensional problems. However, it is well-known that the bottleneck of its practical implementation focuses on the computation of the so-called best rank-one approximation. Motivated by this fact, we are going to discuss some of the geometrical aspects of the best rank-one approximation procedure. More precisely, our main result is to construct explicitly a vector field over a low-dimensional vector space and to prove that we can identify its stationary points with the critical points of the best rank-one optimization problem. To obtain this result, we endow the set of tensors with fixed rank-one with an explicit geometric structure.
- Published
- 2020
- Full Text
- View/download PDF
23. A seminorm regularized alternating least squares algorithm for canonical tensor decomposition.
- Author
-
Chen, Yannan, Sun, Wenyu, Xi, Min, and Yuan, Jinyun
- Subjects
- *
LEAST squares , *MATHEMATICAL regularization , *DECOMPOSITION method , *ACCELERATION of convergence in numerical analysis , *NUMERICAL analysis , *EXTRAPOLATION - Abstract
Abstract The regularization method could deal with the swamp effect of alternating least squares (ALS) algorithms for tensor decomposition. Usually, the regularization term is a norm of the difference between the solution and the current iterate. In this paper, we show that the norm could be weakened to a seminorm, so the selection of the regularization term could be more flexible. To overcome the swamp effect and avoid the drawback that the Hessian of the subproblem may get close to singular in the iterative process, we propose a seminorm regularized ALS algorithm for solving the canonical tensor decomposition. Moreover, in the new algorithm, we introduce a novel extrapolation in the update of each mode factor which makes an immediate impression on the update of subsequent ones. By assuming the boundness of the infinite sequence of iterates generated by the new algorithm, we establish the global convergence and the (weakly) linear convergence rate of the sequence of iterates Numerical experiments on synthetic and real-world problems illustrate that the new method is efficient and promising. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
24. Multivariate Curve Resolution Alternating Least Squares Analysis of In Vivo Skin Raman Spectra
- Author
-
Irina Matveeva, Ivan Bratchenko, Yulia Khristoforova, Lyudmila Bratchenko, Alexander Moryatov, Sergey Kozlov, Oleg Kaganov, and Valery Zakharov
- Subjects
alternating least squares ,benign neoplasm ,malignant neoplasm ,multivariate curve resolution ,Raman probe ,Raman spectroscopy ,Electrical and Electronic Engineering ,Biochemistry ,Instrumentation ,Atomic and Molecular Physics, and Optics ,Analytical Chemistry - Abstract
In recent years, Raman spectroscopy has been used to study biological tissues. However, the analysis of experimental Raman spectra is still challenging, since the Raman spectra of most biological tissue components overlap significantly and it is difficult to separate individual components. New methods of analysis are needed that would allow for the decomposition of Raman spectra into components and the evaluation of their contribution. The aim of our work is to study the possibilities of the multivariate curve resolution alternating least squares (MCR-ALS) method for the analysis of skin tissues in vivo. We investigated the Raman spectra of human skin recorded using a portable conventional Raman spectroscopy setup. The MCR-ALS analysis was performed for the Raman spectra of normal skin, keratosis, basal cell carcinoma, malignant melanoma, and pigmented nevus. We obtained spectral profiles corresponding to the contribution of the optical system and skin components: melanin, proteins, lipids, water, etc. The obtained results show that the multivariate curve resolution alternating least squares analysis can provide new information on the biochemical profiles of skin tissues. Such information may be used in medical diagnostics to analyze Raman spectra with a low signal-to-noise ratio, as well as in various fields of science and industry for preprocessing Raman spectra to remove parasitic components.
- Published
- 2022
25. Collaborative Filtering on Movie Recommendation Using Big Data
- Author
-
Manivannan, M., Jahnavi, D. Sai, Gowthami, A., Jaswitha, V., Yadav, P. Vinay, Prasad, C. Jaya, Manivannan, M., Jahnavi, D. Sai, Gowthami, A., Jaswitha, V., Yadav, P. Vinay, and Prasad, C. Jaya
- Abstract
These days, digitalization is increasing with rapid growth of personal and by home digital devices and the daily usage of internet, we generate a very large amount of amount of data, termed as "Big data”. Movie recommendation systems can be enhanced to the needs of the users as individually. Collaborative filtering is a popular approach in big data domain to create recommendation systems. We describe a technique collaborative recommendation technique based on an algorithm specifically designed to mine association rules for this purpose. We use Alternating Least Squares. We use the Association rule mining approach to generate the rules to recommend movies to a user. We employ associations between users and association between the items. We build collaborative filtering in Apache spark. Apache Spark is the leading open- source unified analytics engine for big data processing. We use Euclidean distance similarity.
- Published
- 2022
26. Accelerating Jackknife Resampling for the Canonical Polyadic Decomposition
- Author
-
Psarras, Christos, Karlsson, Lars, Bro, Rasmus, Bientinesi, Paolo, Psarras, Christos, Karlsson, Lars, Bro, Rasmus, and Bientinesi, Paolo
- Abstract
The Canonical Polyadic (CP) tensor decomposition is frequently used as a model in applications in a variety of different fields. Using jackknife resampling to estimate parameter uncertainties is often desirable but results in an increase of the already high computational cost. Upon observation that the resampled tensors, though different, are nearly identical, we show that it is possible to extend the recently proposed Concurrent ALS (CALS) technique to a jackknife resampling scenario. This extension gives access to the computational efficiency advantage of CALS for the price of a modest increase (typically a few percent) in the number of floating point operations. Numerical experiments on both synthetic and real-world datasets demonstrate that the new workflow based on a CALS extension can be several times faster than a straightforward workflow where the jackknife submodels are processed individually.
- Published
- 2022
- Full Text
- View/download PDF
27. GROCERY PRODUCT RECOMMENDATIONS : USING RANDOM INDEXING AND COLLABORATIVE FILTERING
- Author
-
Orrenius, Axel, Wiebe Werner, Axel, Orrenius, Axel, and Wiebe Werner, Axel
- Abstract
The field of personalized product recommendation systems has seen tremendous growth in recent years. The usefulness of the algorithms’ abilities to filter out data from vast sets has been shown to be crucial in today’s information-heavy online experience. Our goal is therefore to compare two recommender models, one based on Random Indexing, the other on Collaborative Filtering, in order to find out if one is better suited to the task than the other. We bring up relevant previous research to set the context for our study, its limitations and possibilities. We then explain the theories, models and algorithms underlying our two recommender systems and finally we evaluate them, partly through empirical data collection from our employer Kavall’s platform, and partly through analysing data from interviews. We judge that our study is scientifically relevant as it compares an algorithm that is rarely used in this context, Random Indexing, to a more established recommendation algorithm, Collaborative Filtering, and as such the result of this comparison might give useful insights into the further development of new or existing algorithms. While more testing is required, the study did show signs that Random Indexing does have the potential of outperforming Collaborative Filtering in some areas, and further development of the model might be a worthwhile endeavor., Området för personliga produktrekommendationer har sett en enorm tillväxt under de senaste Åren. Användbarheten av algoritmernas förmåga att filtrera ut data ur stora uppsättningar har visat sig vara avgörande i dagens informationstunga onlineupplevelse. Vårt mål Är därför att jämföra två rekommendatormodeller, en baserad på Random Indexing, den andra på Collaborative Filtering, för att ta reda på om den ena Är bättre lämpad för uppgiften Än den andra. Vi tar upp relevant tidigare forskning för att sätta sammanhanget för vår studie, dess begränsningar och möjligheter. Vi förklarar sedan de teorier, modeller och algoritmer som ligger till grund för våra två rekommendationssystem och slutligen utvärderar vi dem, dels genom empirisk datainsamling från vår arbetsgivare Kavalls plattform, dels genom att analysera data från intervjuer. Vi bedömer att vår studie Är vetenskapligt relevant då den jämför en algoritm som sällan används i detta sammanhang, Random Indexing, med en mer etablerad rekommendationsalgoritm, Collaborative Filtering, och som sådan kan resultatet av denna jämförelse ge användbara insikter i den fortsatta utvecklingen av nya eller befintliga algoritmer. även om fler tester krävs, visade studien tecken på att Random Indexing har potentialen att överträffa Collaborative Filtering på vissa områden, och vidareutveckling av modellen kan vara ett givande åtagande.
- Published
- 2022
28. Multi-linear sparse reconstruction for SAR imaging based on higher-order SVD.
- Author
-
Gao, Yu-Fei, Gui, Guan, Cong, Xun-Chao, Yang, Yue, Zou, Yan-Bin, and Wan, Qun
- Subjects
SYNTHETIC aperture radar ,SINGULAR value decomposition ,SPARSE matrices - Abstract
This paper focuses on the spotlight synthetic aperture radar (SAR) imaging for point scattering targets based on tensor modeling. In a real-world scenario, scatterers usually distribute in the block sparse pattern. Such a distribution feature has been scarcely utilized by the previous studies of SAR imaging. Our work takes advantage of this structure property of the target scene, constructing a multi-linear sparse reconstruction algorithm for SAR imaging. The multi-linear block sparsity is introduced into higher-order singular value decomposition (SVD) with a dictionary constructing procedure by this research. The simulation experiments for ideal point targets show the robustness of the proposed algorithm to the noise and sidelobe disturbance which always influence the imaging quality of the conventional methods. The computational resources requirement is further investigated in this paper. As a consequence of the algorithm complexity analysis, the present method possesses the superiority on resource consumption compared with the classic matching pursuit method. The imaging implementations for practical measured data also demonstrate the effectiveness of the algorithm developed in this paper. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
29. Hybrid semantic recommender system for chemical compounds in large-scale datasets
- Author
-
Francisco M. Couto, M. Barros, and André Moitinho
- Subjects
0301 basic medicine ,Computer science ,Bayesian probability ,02 engineering and technology ,Library and Information Sciences ,Ontology (information science) ,Recommender system ,Machine learning ,computer.software_genre ,lcsh:Chemistry ,03 medical and health sciences ,Semantic similarity ,0202 electrical engineering, electronic engineering, information engineering ,Relevance (information retrieval) ,Physical and Theoretical Chemistry ,lcsh:T58.5-58.64 ,business.industry ,Ontology ,lcsh:Information technology ,Scale (chemistry) ,Computer Graphics and Computer-Aided Design ,Computer Science Applications ,030104 developmental biology ,Ranking ,lcsh:QD1-999 ,Alternating least squares ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Research Article ,Chemical compound - Abstract
The increasing number of Chemical Compounds is a challenge for the researchers to explore such datasets. In this work, we propose the use of Recommender Systems in the exploration of new Chemical Compounds of interest to scientific researchers. Our approach consists in a Hybrid recommender model suitable for implicit feedback datasets and focused in retrieving a ranked list according to the relevance of the items. The model integrates collaborative-filtering algorithms for implicit feedback (Alternating Least Squares (ALS) and Bayesian Personalized Ranking(BPR)) and a new content-based algorithm, based on the semantic similarity of the Chemical Compounds in the ChEBI ontology. The algorithms were assessed on an implicit dataset of Chemical Compounds, CheRM-20, with more than 16.000 items (Chemical Compounds). The Hybrid model was able to improve the results of the collaborative-filtering algorithms, with increases of more than 10 percentage points in most of the assessed evaluation metrics.
- Published
- 2021
30. Constrained ALS-based tensor blind receivers for multi-user MIMO systems.
- Author
-
Buiquang, Chung and Ye, Zhongfu
- Subjects
- *
MIMO systems , *RECEIVING antennas , *FACTORIZATION , *STOCHASTIC convergence , *CALCULUS of tensors - Abstract
Abstract Tensor factorizations has shown to be an efficient approach for symbols and/or channel estimation in multi-input multi-output (MIMO) systems, where the factor matrices of tensor that correspond to symbols, channel, code/diversity of signals, are often estimated by using alternating least squares (ALS) algorithm. Although the performance of tensor approaches strongly depend on the initializations of the factor matrices. However, due to the absence of a priori on channels, these initializations are done randomly in traditional ALS algorithm. This generally implies a slow convergence. Further, ALS does not take into account the potential orthogonal structure in the factor matrices, which can be exploited to improve the accuracy of factor matrices recovery. To address these insures, this paper proposes constrained ALS tensor blind receivers for multi-user MIMO systems. We show that the multi-user MIMO signals can be expressed as a third-order tensor model, where the matrices of users symbols, direction-of-arrival (DOA) and delay can be viewed as three factor matrices of the tensor model. Two constrained ALS blind algorithms that take into account the potential orthogonal and Vandermonde structures in the factor matrices, are proposed to learn the tensor model, where the users symbols, DOA and delay are joint estimated as three factor matrices. Besides provide the estimations for the factor matrices, the orthogonal and Vandermonde structures also give a better uniqueness results for the use of tensor model. Interestingly, these structures are the nature properties of the factor matrices in our system. This results in an efficient blind approach that has better performance and lower complexity compare with the traditional ALS. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
31. Inline Raman Spectroscopy and Indirect Hard Modeling for Concentration Monitoring of Dissociated Acid Species
- Author
-
Alexander Walter Wilhelm Echtermeyer, Alexander Mitsos, Caroline Marks, and Jörn Viell
- Subjects
chemistry.chemical_classification ,Materials science ,Aqueous solution ,Carboxylic acid ,010401 analytical chemistry ,Analytical chemistry ,Infrared spectroscopy ,02 engineering and technology ,021001 nanoscience & nanotechnology ,01 natural sciences ,Dissociation (chemistry) ,0104 chemical sciences ,chemistry.chemical_compound ,symbols.namesake ,Applied spectroscopy ,chemistry ,Alternating least squares ,symbols ,ddc:610 ,Itaconic acid ,0210 nano-technology ,Raman spectroscopy ,Instrumentation ,Spectroscopy - Abstract
We propose an approach for monitoring the concentration of dissociated carboxylic acid species in dilute aqueous solution. The dissociated acid species are quantified employing inline Raman spectroscopy in combination with indirect hard modeling (IHM) and multivariate curve resolution (MCR). We introduce two different titration-based hard model (HM) calibration procedures for a single mono- or polyprotic acid in water with well-known (method A) or unknown (method B) acid dissociation constants p Ka. In both methods, spectra of only one acid species in water are prepared for each acid species. These spectra are used for the construction of HMs. For method A, the HMs are calibrated with calculated ideal dissociation equilibria. For method B, we estimate p Ka values by fitting ideal acid dissociation equilibria to acid peak areas that are obtained from a spectral HM. The HM in turn is constructed on the basis of MCR data. Thus, method B on the basis of IHM is independent of a priori known p K a values, but instead provides them as part of the calibration procedure. As a detailed example, we analyze itaconic acid in aqueous solution. For all acid species and water, we obtain low HM errors of −4mol mol−1 in the cases of both methods A and B. With only four calibration samples, IHM yields more accurate results than partial least squares regression. Furthermore, we apply our approach to formic, acetic, and citric acid in water, thereby verifying its generalizability as a process analytical technology for quantitative monitoring of processes containing carboxylic acids.
- Published
- 2020
- Full Text
- View/download PDF
32. Detection of Penicillin G Produced by Penicillium chrysogenum with Raman Microspectroscopy and Multivariate Curve Resolution-Alternating Least-Squares Methods
- Author
-
Akira Také, Yoko Takahashi, Shumpei Horii, Haruko Takeyama, Masahiro Ando, Ashok Zachariah Samuel, Atsuko Matsumoto, and Takuji Nakashima
- Subjects
Pharmaceutical Science ,01 natural sciences ,Analytical Chemistry ,symbols.namesake ,Drug Discovery ,medicine ,Pharmacology ,Multivariate curve resolution ,Chromatography ,biology ,010405 organic chemistry ,Chemistry ,Organic Chemistry ,Subcellular localization ,Penicillium chrysogenum ,biology.organism_classification ,0104 chemical sciences ,Raman microspectroscopy ,Penicillin ,010404 medicinal & biomolecular chemistry ,Complementary and alternative medicine ,Alternating least squares ,symbols ,Molecular Medicine ,Raman spectroscopy ,medicine.drug - Abstract
Raman microspectroscopy is a minimally invasive technique that can identify molecules without labeling. In this study, we demonstrate the detection of penicillin G inside Penicillium chrysogenum KF425 fungal cells. Raman spectra acquired from the fungal cells had highly overlapped spectroscopic signatures and hence were analyzed with multivariate curve resolution by alternating least-squares (MCR-ALS) to extract the spectra of individual molecular constituents. In addition to detecting spatial distribution of multiple constituents such as proteins and lipids inside the fungal body, we could also observe the subcellular localization of penicillin G. This methodology has the potential to be employed in screening the production of bioactive compounds by microorganisms.
- Published
- 2020
- Full Text
- View/download PDF
33. Tensor approximation of the self-diffusion matrix of tagged particle processes
- Author
-
Jad Dabaghi, Virginie Ehrlacher, Christoph Strössner, Ecole Supérieure d'Ingénieurs Léonard de Vinci (ESILV), Centre d'Enseignement et de Recherche en Mathématiques et Calcul Scientifique (CERMICS), École des Ponts ParisTech (ENPC), MATHematics for MatERIALS (MATHERIALS), École des Ponts ParisTech (ENPC)-École des Ponts ParisTech (ENPC)-Inria de Paris, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria), Ecole Polytechnique Fédérale de Lausanne (EPFL), This work was supported by the European Research Council under the European Union’s Seventh Framework Programme (FP/2007-2013) / ERC Grant Agreement number 614492 and under the European Union’s Horizon 2020 Research and Innovation Programme, ERC Grant Agreement number 810367, project EMC2. The work was initiated during the CEMRACS 2021 summer school at CIRM, Luminy, Marseille. The authors also acknowledge funding by the ANR project COMODO (ANR-19-CE46-0002), the Center on Energy and Climate Change (E4C) and the I-Site FUTURE., ANR-19-CE46-0002,COMODO,Systèmes de diffusion croisée sur des domaines en mouvement(2019), and European Project: 810367,EMC2(2019)
- Subjects
FOS: Computer and information sciences ,History ,Physics and Astronomy (miscellaneous) ,Polymers and Plastics ,alternating least squares ,limit-theorem ,FOS: Physical sciences ,Statistics - Computation ,Industrial and Manufacturing Engineering ,coefficient ,low-rank approximations ,tagged particle process ,FOS: Mathematics ,alternating least-squares ,Mathematics - Numerical Analysis ,decompositions ,Business and International Management ,Computation (stat.CO) ,exclusion ,Numerical Analysis ,Applied Mathematics ,Numerical Analysis (math.NA) ,Computational Physics (physics.comp-ph) ,self-diffusion ,Computer Science Applications ,Computational Mathematics ,finite-dimensional approximation ,Modeling and Simulation ,monte carlo methods ,high-dimensional optimization ,optimization ,Physics - Computational Physics ,[MATH.MATH-NA]Mathematics [math]/Numerical Analysis [math.NA] - Abstract
The objective of this paper is to investigate a new numerical method for the approximation of the self-diffusion matrix of a tagged particle process defined on a grid. While standard numerical methods make use of long-time averages of empirical means of deviations of some stochastic processes, and are thus subject to statistical noise, we propose here a tensor method in order to compute an approximation of the solution of a high-dimensional quadratic optimization problem, which enables to obtain a numerical approximation of the self-diffusion matrix. The tensor method we use here relies on an iterative scheme which builds low-rank approximations of the quantity of interest and on a carefully tuned variance reduction method so as to evaluate the various terms arising in the functional to minimize. In particular, we numerically observe here that it is much less subject to statistical noise than classical approaches.(c) 2023 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons .org /licenses /by /4 .0/).
- Published
- 2022
34. Corrigendum to 'Data mining Raman microspectroscopic responses of cells to drugs in vitro using multivariate curve resolution-alternating least squares' [Talanta 208 (2020) 120386]
- Author
-
Guillermo Quintás, Zeineb Farhane, Romà Tauler, Hugh J. Byrne, David Perez-Guaita, European Commission, Tauler, Romà, and Tauler, Romà [0000-0001-8559-9670]
- Subjects
Multivariate curve resolution ,Chemistry ,010401 analytical chemistry ,Analytical chemistry ,02 engineering and technology ,021001 nanoscience & nanotechnology ,01 natural sciences ,0104 chemical sciences ,3. Good health ,Analytical Chemistry ,symbols.namesake ,Alternating least squares ,symbols ,0210 nano-technology ,Raman spectroscopy ,Corrigendum - Abstract
The authors regret a mistake in the acknowledgement section., The acknowledgement of the original article should be corrected to: “This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 796287. GQ acknowledges support from the Agencia Estatal de Investigación (AEI) and the Fondo Europeo de Desarrollo Regional (FEDER) (CTQ2016- 79561-P)”
- Published
- 2022
35. Нейромережева система генерації персоналізованих рекомендацій на основі динамічних вкладень графів
- Author
-
Недашківська, Надія Іванівна
- Subjects
recommender system ,вкладення графів ,graph embeddings ,deep and cross networks ,alternating least squares ,змінні найменші квадрати ,deep learning ,bayesian personalied ranking ,factorization machines ,004.852 ,attention networks ,глибоке навчання ,машини факторизації ,довга короткострокова пам’ять ,динамічні графові мережі ,dynamic graph networks ,механізм уваги ,байєсівське персоналізоване ранжування ,рекомендаційна система ,long short-term memory ,глибока та широка мережа - Abstract
Магістерська дисертація: 114 с., 25 табл., 10 рис., 19 джерел, 1 додаток. Об’єктом дослідження є задача генерації контекстуальних рекомендацій. Предмет дослідження – алгоритми та моделі видачі контекстуальних рекомендацій. Мета дослідження полягає у аналізі алгоритмів видачі рекомендацій, що базуються на моделях матричної факторизації, а також алгоритмів, що використовують аппарат мереж глибокого навчання. Запропонована та розроблена модель рекомендаційних систем, що використовують аппарат глибоких перехресних нейронних мереж та мереж на основі механізму уваги. Дані моделі здатні швидко оброблювати послідовності дій користувача з об’єктами, враховуючи порядок виконаної дії, ознаку дії, характеристику суб’єкта та об’єкта, а також пов’язані з об’єктом суміжні позиції. Проведено порівняння запропонованої рекомендаційної системи з існуючими методами проектування рекомендаційних систем: машинами факторизації на основі байєсівського персоналізованого ранжування, машинами факторизації на основі методів змінних найменших квадратів та нейронними мережами на основі моделей довгої короткострокової пам’яті. Master’s thesis: 114 p., 25 tab., 10 fig., 19 references, 1 appendix. The object of following study is a contextual recommendation retrieval problem. The subjects of current thesis are models and algorithms of contextual recommendation retrieval. The purpose of study is to conduct research on effectiveness of different approaches to solve contextual recommendation tasks, such as matrix factorization models and deep learning models. In following research new recommender system modelling approach is proposed and implemented. That model type leverages architecture of Deep and Cross Networks with addition of Attention Networks. Proposed model type can quickly process user interaction data and infer knowledge of latter interactions taking into consideration interaction type, interaction order, features of items and users, and leverages previously gained knowledge of item-to-item relations. During the study, model comparisons were made. In particular, proposed model variations were compared with recommender system approaches based on matrix factorization machines, such as Bayesian Personalized Ranking and Alternating Least Squares, and deep learning based models, in particular Long Short-Term Memory network.
- Published
- 2022
36. Produktrekommendationer för matvaror med Random Indexing och Collaborative Filtering
- Author
-
Orrenius, Axel and Wiebe Werner, Axel
- Subjects
Machine Learning ,Rapid-Delivery ,Computer and Information Sciences ,Collaborative Filtering ,Product Recommendations ,Data- och informationsvetenskap ,Random Indexing ,Alternating Least Squares - Abstract
The field of personalized product recommendation systems has seen tremendous growth in recent years. The usefulness of the algorithms’ abilities to filter out data from vast sets has been shown to be crucial in today’s information-heavy online experience. Our goal is therefore to compare two recommender models, one based on Random Indexing, the other on Collaborative Filtering, in order to find out if one is better suited to the task than the other. We bring up relevant previous research to set the context for our study, its limitations and possibilities. We then explain the theories, models and algorithms underlying our two recommender systems and finally we evaluate them, partly through empirical data collection from our employer Kavall’s platform, and partly through analysing data from interviews. We judge that our study is scientifically relevant as it compares an algorithm that is rarely used in this context, Random Indexing, to a more established recommendation algorithm, Collaborative Filtering, and as such the result of this comparison might give useful insights into the further development of new or existing algorithms. While more testing is required, the study did show signs that Random Indexing does have the potential of outperforming Collaborative Filtering in some areas, and further development of the model might be a worthwhile endeavor. Området för personliga produktrekommendationer har sett en enorm tillväxt under de senaste Åren. Användbarheten av algoritmernas förmåga att filtrera ut data ur stora uppsättningar har visat sig vara avgörande i dagens informationstunga onlineupplevelse. Vårt mål Är därför att jämföra två rekommendatormodeller, en baserad på Random Indexing, den andra på Collaborative Filtering, för att ta reda på om den ena Är bättre lämpad för uppgiften Än den andra. Vi tar upp relevant tidigare forskning för att sätta sammanhanget för vår studie, dess begränsningar och möjligheter. Vi förklarar sedan de teorier, modeller och algoritmer som ligger till grund för våra två rekommendationssystem och slutligen utvärderar vi dem, dels genom empirisk datainsamling från vår arbetsgivare Kavalls plattform, dels genom att analysera data från intervjuer. Vi bedömer att vår studie Är vetenskapligt relevant då den jämför en algoritm som sällan används i detta sammanhang, Random Indexing, med en mer etablerad rekommendationsalgoritm, Collaborative Filtering, och som sådan kan resultatet av denna jämförelse ge användbara insikter i den fortsatta utvecklingen av nya eller befintliga algoritmer. även om fler tester krävs, visade studien tecken på att Random Indexing har potentialen att överträffa Collaborative Filtering på vissa områden, och vidareutveckling av modellen kan vara ett givande åtagande.
- Published
- 2022
37. Sistemes recomanadors aplicats a productes de roba
- Author
-
Pérez Muro, Sergi, Universitat Autònoma de Barcelona. Escola d'Enginyeria, and Baldrich i Caselles, Ramon
- Subjects
Factorización de matrices ,Filtratge col·laboratiu ,User-based cf ,Factorització de matrius ,Similitud de jaccard ,Collaborative filtering ,Matrix factorization ,Jaccard similarity ,Sistemes recomanadors ,Item-based cf ,Sistemas recomendadores ,Similitud de cosinus ,Recommender systems ,Content-based ,Alternating Least Squares ,Similitud de coseno ,Cosine similarity ,Filtrado colaborativo ,Stochastic Gradient Descent - Abstract
Els sistemes recomanadors avui dia són molt presents a internet, de manera que han canviat la forma en què les persones descobreixen i consumeixen nous continguts o productes. Un sistema recomanador és una eina que estableix un conjunt de criteris i valoracions sobre dades d'usuaris per realitzar prediccions sobre les preferències d'aquests i recomanar elements que els puguin ser d'utilitat. En aquest projecte s'exploren les diferents tècniques que ofereix el filtratge col·laboratiu, el que consisteix a utilitzar les dades de compres existents dels usuaris: aquelles basades en la similitud entre items (item-based), les basades en la similitud entre usuaris (user-based), la factorització de matrius i finalment les basades en aprenentatge profund. Aquestes s'apliquen a un problema real de recomanació de productes de roba a partir de les compres dels usuaris en què caldrà determinar quins són els 12 productes que els usuaris més probablement compraran. Recommender systems today are really present on the internet, so they have changed the way people discover and cosume new content or products. A recommender system is a tool that establishes a set of criteria and assessments of user data to make predictions about their preferences and recommend items that may be useful to them. This project explores the different techniques offered by collaborative filtering, which consists in using data about purchases users have made. These tecniques are based in similarities of items (item-based), similarities of users (user-based), matrix factorization and finally based of deep learning architectures. These are applied to a real problem of recommending clothing products based on users purchases in which it will need to determine which 12 products are most likely to buy. Los sistemas recomendadores hoy en día están muy presentes en internet, por lo que han cambiado la forma en que las personas descubren y consumen nuevos contenidos o productos. Un sistema recomendador es una herramienta que establece un conjunto de criterios y valoraciones sobre datos de usuarios para realizar predicciones sobre sus preferencias y recomendar elementos que les puedan ser de utilidad. En este proyecto se exploran las diferentes técnicas que ofrece el filtrado colaborativo, lo que consiste en utilizar los datos de compras existentes de los usuarios: aquellos basados en la similitud entre items (item-based), los basados en la similitud entre usuarios (user-based), la factorización de matrices y finalmente las basadas en aprendizaje profundo. Éstas se aplican a un problema real de recomendación de productos de ropa a partir de las compras de los usuarios en las que habrá que determinar cuáles son los 12 productos que los usuarios más probablemente comprarán.
- Published
- 2022
38. Digital images and independent components analysis in the determination of bioactive compounds from grape juice
- Author
-
Patrícia Valderrama, Karla K. Beltrame, Paulo Henrique Março, Douglas N. Rutledge, Thays R. Gonçalves, Sandra Terezinha Marques Gomes, Makoto Matsushita, and Universidade Estadual de Maringá (UEM), Maringá, Paraná
- Subjects
Multivariate curve resolution ,Computer science ,Calibration (statistics) ,business.industry ,Calibration curve ,010401 analytical chemistry ,Pattern recognition ,010501 environmental sciences ,01 natural sciences ,Independent component analysis ,0104 chemical sciences ,Set (abstract data type) ,Digital image ,[CHIM.ANAL]Chemical Sciences/Analytical chemistry ,Alternating least squares ,Principal component analysis ,Artificial intelligence ,business ,ComputingMilieux_MISCELLANEOUS ,0105 earth and related environmental sciences ,Food Science - Abstract
Analytical methods based on digital images coupled to chemometric tools are promising alternatives for developing analytical methodologies in determining quality parameters in foods. The total phenolics, monomeric anthocyanins, and tannin content were quantified based on digital images and pseudo-univariate calibration performed from Independent Components Analysis (ICA). Differences between ICA, Principal Component Analysis (PCA), and Multivariate Curve Resolution with Alternating Least Squares (MCR-ALS), highlighting the chosen ICA for this proposal, were explained. In all pseudo-univariate calibration curves, correlation coefficients higher than 0.9 were achieved. The results suggest that digital images coupled with ICA can give applicable pseudo-univariate calibration models to determine bioactive compounds. Even more, the device to obtain an image is cheaper than most analytical instruments. Digital images may have a promising future in developing analytical methodologies for complex samples such as foods from a pseudo-univariate calibration perspective (where a small data set is needed in the calibration step).
- Published
- 2021
- Full Text
- View/download PDF
39. Sparse principal component analysis subject to prespecified cardinality of loadings.
- Author
-
Adachi, Kohei and Trendafilov, Nickolay
- Subjects
- *
PRINCIPAL components analysis , *STATISTICAL correlation , *FACTOR analysis , *EIGENVECTORS , *VARIANCES - Abstract
Most of the existing procedures for sparse principal component analysis (PCA) use a penalty function to obtain a sparse matrix of weights by which a data matrix is post-multiplied to produce PC scores. In this paper, we propose a new sparse PCA procedure which differs from the existing ones in two ways. First, the new procedure does not sparsify the weight matrix. Instead, the so-called loadings matrix is sparsified by which the score matrix is post-multiplied to approximate the data matrix. Second, the cardinality of the loading matrix i.e., the total number of nonzero loadings, is pre-specified to be an integer without using penalty functions. The procedure is called unpenalized sparse loading PCA (USLPCA). A desirable property of USLPCA is that the indices for the percentages of explained variances can be defined in the same form as in the standard PCA. We develop an alternate least squares algorithm for USLPCA which uses the fact that the PCA loss function can be decomposed as a sum of a term irrelevant to the loadings, and another one being easily minimized under cardinality constraints. A procedure is also presented for selecting the best cardinality using information criteria. The procedures are assessed in a simulation study and illustrated with real data examples. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
40. Optimal scaling for survival analysis with ordinal data.
- Author
-
Willems, S.J.W., Fiocco, M., and Meulman, J.J.
- Subjects
- *
PARAMETER estimation , *PROPORTIONAL hazards models , *SURVIVAL analysis (Biometry) , *LEAST squares , *ACQUISITION of data - Abstract
Medical and psychological studies often involve the collection and analysis of categorical data with nominal or ordinal category levels. Nominal categories have no ordering property, e.g. gender, with the two unordered covariates male and female. Ordinal category levels, however, have an ordering, e.g. when subjects are classified according to their education level, often categorized as low, medium or high education. When analyzing survival data, currently two methods can be chosen to include ordinal covariates in the Cox proportional hazard model. Dummy covariates can be used to indicate category memberships, as is usually done for nominal covariates. Estimated parameters for each category indicate the increase or decrease in risk of experiencing the event of interest compared to the reference category. Since these parameters are estimated independently from each other, the ordering property of the categories is lost in the process. To keep the ordinal property, integer values can be given to the covariate’s categories (e.g. low = 0, medium = 1, high = 2), and the variable is included in the model as a numeric covariate. However, since the ordinal data are now interpreted as numeric data, the property of equal distances between consecutive categories is introduced. This assumption is too strict for this data type; distances between consecutive categories do not necessarily have to be equal. A method is described to include ordinal data in the Cox model. The method implements optimal scaling to find optimal quantifications for the ordinal category levels. These quantifications are chosen such that they preserve the categories’ ordering, and do not force equal distances between consecutive category levels. A simulation study is carried out to compare the performance of optimal scaling with the performance of the two currently used methods described above. Results show that the optimal scaling method increases the model fit if ordinal covariates are included in the model. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
41. Tensor-based methods for blind spatial signature estimation under arbitrary and unknown source covariance structure.
- Author
-
Gomes, Paulo R.B., de Almeida, André L.F., da Costa, João Paulo C.L., and Del Galdo, Giovanni
- Subjects
- *
TENSOR fields , *MOBILE communication systems , *FACTORIZATION , *ANALYSIS of covariance , *ESTIMATION theory - Abstract
Spatial signature estimation is a problem encountered in several applications in signal processing such as mobile communications, sonar, radar, astronomy and seismology. In this paper, we propose higher-order tensor methods to solve the blind spatial signature estimation problem using planar arrays. By assuming that sources' powers vary between successive time blocks, we recast the spatial and spatiotemporal covariance models for the received data as third-order PARATUCK2 and fourth-order Tucker4 tensor decompositions, respectively. Firstly, by exploiting the multilinear algebraic structure of the proposed tensor models, new iterative algorithms are formulated to blindly estimate the spatial signatures. Secondly, in order to achieve a better spatial resolution, we propose an expanded form of spatial smoothing that returns extra spatial dimensions in comparison with the traditional approaches. Additionally, by exploiting the higher-order structure of the resulting expanded tensor model, a multilinear noise reduction preprocessing step is proposed via higher-order singular value decomposition. We show that the increase on the tensor order provides a more efficient denoising, and consequently a better performance compared to existing spatial smoothing techniques. Finally, a solution based on a multi-stage Khatri–Rao factorization procedure is incorporated as the final stage of our proposed estimators. Our results demonstrate that the proposed tensor methods yield more accurate spatial signature estimates than competing approaches while operating in a challenging scenario where the source covariance structure is unknown and arbitrary (non-diagonal), which is actually the case when sample covariances are computed from a limited number of snapshots. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
42. A Weighted Additive Model for the Whole Demand Analysis of a Seasonally Dependent Product Using Meteorological and Regional Data, Considering Social Customs Factors and Policy Factors: Focus on Japanese Beer Demand Structure
- Author
-
Hiroshi Yamashita, Tsuyoshi Kurihara, and Takaaki Kawanaka
- Subjects
Structure (mathematical logic) ,Focus (computing) ,Demand analysis ,Alternating least squares ,Econometrics ,Economics ,General Social Sciences ,Product (category theory) ,Additive model ,General Economics, Econometrics and Finance - Published
- 2019
- Full Text
- View/download PDF
43. Towards a Vector Field Based Approach to the Proper Generalized Decomposition (PGD)
- Author
-
Universitat Politècnica de València. Departamento de Ingeniería Mecánica y de Materiales - Departament d'Enginyeria Mecànica i de Materials, Universitat Politècnica de València. Departamento de Ingeniería de Sistemas y Automática - Departament d'Enginyeria de Sistemes i Automàtica, Universitat Politècnica de València. Instituto de Diseño para la Fabricación y Producción Automatizada - Institut de Disseny per a la Fabricació i Producció Automatitzada, Generalitat Valenciana, Agencia Estatal de Investigación, Falco, Antonio, Hilario Pérez, Lucia, Montés Sánchez, Nicolás, Mora Aguilar, Marta Covadonga, Nadal, Enrique, Universitat Politècnica de València. Departamento de Ingeniería Mecánica y de Materiales - Departament d'Enginyeria Mecànica i de Materials, Universitat Politècnica de València. Departamento de Ingeniería de Sistemas y Automática - Departament d'Enginyeria de Sistemes i Automàtica, Universitat Politècnica de València. Instituto de Diseño para la Fabricación y Producción Automatizada - Institut de Disseny per a la Fabricació i Producció Automatitzada, Generalitat Valenciana, Agencia Estatal de Investigación, Falco, Antonio, Hilario Pérez, Lucia, Montés Sánchez, Nicolás, Mora Aguilar, Marta Covadonga, and Nadal, Enrique
- Abstract
[EN] A novel algorithm called the Proper Generalized Decomposition (PGD) is widely used by the engineering community to compute the solution of high dimensional problems. However, it is well-known that the bottleneck of its practical implementation focuses on the computation of the so-called best rank-one approximation. Motivated by this fact, we are going to discuss some of the geometrical aspects of the best rank-one approximation procedure. More precisely, our main result is to construct explicitly a vector field over a low-dimensional vector space and to prove that we can identify its stationary points with the critical points of the best rank-one optimization problem. To obtain this result, we endow the set of tensors with fixed rank-one with an explicit geometric structure
- Published
- 2021
44. Multivariate Curve Resolution and Carbon Balance Constraint to Unravel FTIR Spectra from Fed-Batch Fermentation Samples
- Author
-
Dennis Vier, Stefan Wambach, Volker Schünemann, and Klaus-Uwe Gollmer
- Subjects
multivariate curve resolution ,E. coli ,fed-batch ,fermentation ,carbon mass balance constraint ,soft constraints ,alternating least squares ,hybrid modelling ,Technology ,Biology (General) ,QH301-705.5 - Abstract
The current work investigates the capability of a tailored multivariate curve resolution–alternating least squares (MCR-ALS) algorithm to analyse glucose, phosphate, ammonium and acetate dynamics simultaneously in an E. coli BL21 fed-batch fermentation. The high-cell-density (HCDC) process is monitored by ex situ online attenuated total reflection (ATR) Fourier transform infrared (FTIR) spectroscopy and several in situ online process sensors. This approach efficiently utilises automatically generated process data to reduce the time and cost consuming reference measurement effort for multivariate calibration. To determine metabolite concentrations with accuracies between ±0.19 and ±0.96·gL−l, the presented utilisation needs primarily—besides online sensor measurements—single FTIR measurements for each of the components of interest. The ambiguities in alternating least squares solutions for concentration estimation are reduced by the insertion of analytical process knowledge primarily in the form of elementary carbon mass balances. Thus, in this way, the established idea of mass balance constraints in MCR combines with the consistency check of measured data by carbon balances, as commonly applied in bioprocess engineering. The constraints are calculated based on online process data and theoretical assumptions. This increased calculation effort is able to replace, to a large extent, the need for manually conducted quantitative chemical analysis, leads to good estimations of concentration profiles and a better process understanding.
- Published
- 2017
- Full Text
- View/download PDF
45. Study of dimension reduction techniques based on Principal Components: Non-linear Principal Components Analysis
- Author
-
Giraldo Otálvaro, Juan David, Esteban Duarte, Nubia, and Martínez Aragón, Aymara
- Subjects
Principal Components Analyisis ,Optimal Scaling ,Análisis de Homogeneidad ,Análisis multivariable ,Multivariate analysis ,Homogeinity Analysis ,Escalamiento Óptimo ,Componentes Principales no Lineales ,Nonlinear Principal Components ,Componentes Principales ,Mínimos Cuadrados Alternantes ,519 - Probabilidades y matemáticas aplicadas [510 - Matemáticas] ,Alternating Least Squares - Abstract
figuras, tablas En la estadística multivariada un gran desafío en el manejo correcto de grandes cantidades de datos es el análisis de variables de carácter cuantitativo y cualitativo al mismo tiempo, es decir, análisis de datos mixtos. En lo relacionado al tratamiento de datos solamente cuantitativos existen varias técnicas que ayudan en la reducción de la dimensión, en donde el Análisis de Componentes Principales (PCA) es la metodología de mayor relevancia. Para el análisis de datos mixtos, la técnica de Análisis de Componentes Principales proporciona una base fundamental para otras técnicas multivariadas como lo es el Análisis de Componentes Principales No Lineales (NLPCA), la cual no está muy bien documentada y tal vez aplicada sin la rigurosidad que la teoría requiere. Por otro lado, su uso no ha sido extendido a la metodología de las cartas de control como herramienta que apoya la gestión de calidad desde un punto de vista analítico. Por lo anterior, en este trabajo se describe de forma teórica la metodología de Análisis de Componentes Principales y se formaliza una técnica que permita el procesamiento de datos mixtos con el fin de facilitar la reducción de dimensión bajo el marco del PCA seleccionando la técnica de Análisis de Componentes Principales No Lineales (NLPCA), la cual incluye en su procesamiento la cuantificación óptima de datos cualitativos de manera no lineal con el fin de encontrar las mejores relaciones entre las variables. Se propone adaptar las cartas de control desarrolladas para variables múltiples y componentes obtenidas a partir del PCA, a las técnicas NLPCA obteniendo herramientas novedosas de gran interés para la interpretación de datos. Las metodologías descritas son aplicadas a un conjunto de datos reales pertenecientes al Proyecto “Corazones de Baependi” (Processo Fapesp 2007/58150-7) del Laboratorio de Genética y Cardiología Molecular (Incor/USP). (Texto tomado de la fuente) In multivariate statistics, a great challenge in the correct handling of large amounts of data is the analysis of variables of a quantitative and qualitative nature at the same time, that is, analysis of mixed data. Regarding the treatment of only quantitative data, there are several techniques that help in dimensional reduction, where the Principal Component Analysis (PCA) is the most relevant methodology. For the analysis of mixed data, the Principal Component Analysis technique provides a fundamental basis for other multivariate techniques such as Nonlinear Principal Component Analysis (NLPCA), which is not very well documented and perhaps applied without rigor. that the theory requires. On the other hand, its use has not been extended to the control chart methodology as a tool that supports quality management from an analytical point of view. Due to the above, in this work the Principal Component Analysis methodology is described theoretically and a technique is formalized that allows the processing of mixed data in order to facilitate the reduction of dimensions under the framework of the PCA by selecting the technique Non-linear Principal Components Analysis (NLPCA), which includes in its processing the optimal quantification of qualitative data in a non-linear way in order to find the best relationships between the variables. It is proposed to adapt the control charts developed for multiple variables and components obtained from the PCA, to the NLPCA techniques, obtaining novel tools of great interest for data interpretation. The methodologies described are applied to a set of real data belonging to the Project "Hearts of Baependi ”(Processo Fapesp 2007 / 58150-7) of the Molecular Genetics and Cardiology Laboratory (Incor / USP). Maestría Magíster en Ciencias - Matemática Aplicada
- Published
- 2021
46. Setting B2B digital marketing in artificial intelligence-based CRMs: A review and directions for future research
- Author
-
Jose Ramon Saura, Daniel Palacios-Marqués, and Domingo Ribeiro-Soriano
- Subjects
Marketing ,R language ,Digital marketing ,business.industry ,Computer science ,Customer relationship management ,R (Programming language) ,CRMs ,Multiple correspondence analysis ,Artificial intelligence-based CRMs ,Artificial intelligence-based ,Work (electrical) ,Alternating least squares ,ORGANIZACION DE EMPRESAS ,Artificial intelligence ,business ,Communication channel ,B2B digital marketing - Abstract
[EN] The new business challenges in the B2B sector are determined by connected ecosystems, where data-driven decision making is crucial for successful strategies. At the same time, the use of digital marketing as a communication and sales channel has led to the need and use of Customer Relationship Management (CRM) systems to correctly manage company information. The understanding of B2B traditional Marketing strategies that use CRMs that work with Artificial Intelligence (AI) has been studied, however, research focused on the understanding and application of these technologies in B2B digital marketing is scarce. To cover this gap in the literature, this study develops a literature review on the main academic contributions in this area. To visualize the outcomes of the literature review, the results are then analyzed using a statistical approach known as Multiple Correspondence Analysis (MCA) under the homogeneity analysis of variance by means of alternating least squares (HOMALS) framework programmed in the R language. The research results classify the types of CRMs and their typologies and explore the main techniques and uses of AI-based CRMs in B2B digital marketing. In addition, a discussion, directions and propositions for future research are presented., In gratitude to the Ministry of Science, Innovation and Universities and the European Regional Development Fund: RTI2018-096295-BC22.
- Published
- 2021
47. MTC: Multiresolution Tensor Completion from Partial and Coarse Observations
- Author
-
Jimeng Sun, Navjot Singh, Cheng Qian, Chaoqi Yang, Cao Xiao, and Edgar Solomonik
- Subjects
FOS: Computer and information sciences ,2019-20 coronavirus outbreak ,Computer Science - Machine Learning ,Coronavirus disease 2019 (COVID-19) ,Hierarchy (mathematics) ,Tensor completion ,Numerical Analysis (math.NA) ,Space (mathematics) ,Machine Learning (cs.LG) ,Factorization ,Alternating least squares ,FOS: Mathematics ,Tensor ,Mathematics - Numerical Analysis ,Algorithm ,Mathematics - Abstract
Existing tensor completion formulation mostly relies on partial observations from a single tensor. However, tensors extracted from real-world data are often more complex due to: (i) Partial observation: Only a small subset (e.g., 5%) of tensor elements are available. (ii) Coarse observation: Some tensor modes only present coarse and aggregated patterns (e.g., monthly summary instead of daily reports). In this paper, we are given a subset of the tensor and some aggregated/coarse observations (along one or more modes) and seek to recover the original fine-granular tensor with low-rank factorization. We formulate a coupled tensor completion problem and propose an efficient Multi-resolution Tensor Completion model (MTC) to solve the problem. Our MTC model explores tensor mode properties and leverages the hierarchy of resolutions to recursively initialize an optimization setup, and optimizes on the coupled system using alternating least squares. MTC ensures low computational and space complexity. We evaluate our model on two COVID-19 related spatio-temporal tensors. The experiments show that MTC could provide 65.20% and 75.79% percentage of fitness (PoF) in tensor completion with only 5% fine granular observations, which is 27.96% relative improvement over the best baseline. To evaluate the learned low-rank factors, we also design a tensor prediction task for daily and cumulative disease case predictions, where MTC achieves 50% in PoF and 30% relative improvements over the best baseline., Comment: Accepted in SIGKDD 2021. Code in https://github.com/ycq091044/MTC
- Published
- 2021
- Full Text
- View/download PDF
48. Towards a Vector Field Based Approach to the Proper Generalized Decomposition (PGD)
- Author
-
Antonio Falcó, Enrique Nadal, Lucia Hilario, Marta C. Mora, Nicolás Montés, Producción Científica UCH 2021, and UCH. Departamento de Matemáticas, Física y Ciencias Tecnológicas
- Subjects
Greedy rank one update algorithm ,Optimization problem ,Computer science ,General Mathematics ,Computation ,INGENIERIA MECANICA ,alternating least squares ,Álgebra de tensores ,Structure (category theory) ,010103 numerical & computational mathematics ,tensor numerical methods ,01 natural sciences ,Tensor algebra ,Bottleneck ,Decomposition (Mathematics) ,Alternating least squares ,Computer Science (miscellaneous) ,Applied mathematics ,0101 mathematics ,Engineering (miscellaneous) ,Descomposición (Matemáticas) ,lcsh:Mathematics ,Algoritmos ,Ecuaciones en derivadas parciales ,Proper generalised decomposition ,lcsh:QA1-939 ,Differential equations, Partial ,Stationary point ,INGENIERIA DE SISTEMAS Y AUTOMATICA ,010101 applied mathematics ,proper generalised decomposition ,Algorithms ,Tensor numerical methods ,greedy rank one update algorithm ,Vector field ,Vector space - Abstract
[EN] A novel algorithm called the Proper Generalized Decomposition (PGD) is widely used by the engineering community to compute the solution of high dimensional problems. However, it is well-known that the bottleneck of its practical implementation focuses on the computation of the so-called best rank-one approximation. Motivated by this fact, we are going to discuss some of the geometrical aspects of the best rank-one approximation procedure. More precisely, our main result is to construct explicitly a vector field over a low-dimensional vector space and to prove that we can identify its stationary points with the critical points of the best rank-one optimization problem. To obtain this result, we endow the set of tensors with fixed rank-one with an explicit geometric structure, This research was funded by the GVA/2019/124 grant from Generalitat Valenciana and by the RTI2018-093521-B-C32 grant from the Ministerio de Ciencia, Innovacion y Universidades. Document
- Published
- 2021
49. Accelerating jackknife resampling for the Canonical Polyadic Decomposition
- Author
-
Christos Psarras, Lars Karlsson, Rasmus Bro, and Paolo Bientinesi
- Subjects
Statistics and Probability ,FOS: Computer and information sciences ,decomposition ,math.NA ,Computer Sciences ,Applied Mathematics ,Canonical Polyadic Decomposition ,Tensors ,Numerical Analysis (math.NA) ,cs.MS ,jackknife ,Datavetenskap (datalogi) ,CP ,FOS: Mathematics ,Computer Science - Mathematical Software ,Sannolikhetsteori och statistik ,Mathematics - Numerical Analysis ,ALS ,Alternating Least Squares ,Probability Theory and Statistics ,Mathematical Software (cs.MS) ,cs.NA - Abstract
The Canonical Polyadic (CP) tensor decomposition is frequently used as a model in applications in a variety of different fields. Using jackknife resampling to estimate parameter uncertainties is often desirable but results in an increase of the already high computational cost. Upon observation that the resampled tensors, though different, are nearly identical, we show that it is possible to extend the recently proposed Concurrent ALS (CALS) technique to a jackknife resampling scenario. This extension gives access to the computational efficiency advantage of CALS for the price of a modest increase (typically a few percent) in the number of floating point operations. Numerical experiments on both synthetic and real-world datasets demonstrate that the new workflow based on a CALS extension can be several times faster than a straightforward workflow where the jackknife submodels are processed individually.
- Published
- 2021
- Full Text
- View/download PDF
50. Raman mapping coupled to self‐modelling MCR‐ALS analysis to estimate active cosmetic ingredient penetration profile in skin
- Author
-
Ali Tfayli, Emilie Munnier, Aline Stella, Florent Yvergnaux, Hugh J. Byrne, Clovis Tauber, Franck Bonnier, Igor Chourpa, Imagerie et cerveau (iBrain - Inserm U1253 - UNIV Tours ), Université de Tours (UT)-Institut National de la Santé et de la Recherche Médicale (INSERM), Nanomédicaments et Nanosondes, EA 6295 (NMNS), Université de Tours (UT), Université Paris-Saclay, U-Psud, Université Paris-Saclay, Bioeurope Groupe Solabia, Focas Research Institute [Dublin], Technological University [Dublin] (TU), Mistic 2017, Université de Tours-Institut National de la Santé et de la Recherche Médicale (INSERM), and Université de Tours
- Subjects
Delipidol ,Materials science ,General Physics and Astronomy ,Raman mapping ,Spectrum Analysis, Raman ,01 natural sciences ,General Biochemistry, Genetics and Molecular Biology ,Diffusion profile ,010309 optics ,symbols.namesake ,[CHIM.ANAL]Chemical Sciences/Analytical chemistry ,0103 physical sciences ,Stratum corneum ,medicine ,Humans ,General Materials Science ,penetration profiles ,Least-Squares Analysis ,Skin ,Multivariate curve resolution ,010401 analytical chemistry ,confocal raman mapping ,General Engineering ,General Chemistry ,Penetration (firestop) ,0104 chemical sciences ,Cosmetic ingredient ,Chemistry ,medicine.anatomical_structure ,Alternating least squares ,Multivariate Analysis ,symbols ,Epidermis ,human skin ,multivariate curve resolution alternating least squares ,Raman spectroscopy ,Biological system - Abstract
International audience; Confocal Raman mapping (CRM) is a powerful, label free, non‐destructive tool, enabling molecular characterization of human skin with applications in the dermo‐cosmetic field. Coupling CRM to multivariate analysis can be used to monitor the penetration and permeation of active cosmetic ingredients (ACI) after topical application. It is presently illustrated how multivariate curve resolution alternating least squares (MCR‐ALS) can be applied to detect and semi‐quantitatively describe the diffusion profile of Delipidol, a commercially available slimming ACI, from Raman spectral maps. Although the analysis outcome can be critically dependent on the a priori selection of the number of regression components, it is demonstrated that profiling of the kinetics of diffusion into the skin can be established with or without additionnal spectral equality constraints in the multivariate analysis, with similar results. Ultimately, MCR‐ALS, applied without spectral equality contraints, specifically identifies the ACI as one of main spectral components enabling to investigate its distribution and penetration into the stratum corneum and underlying epidermis layers.
- Published
- 2020
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.