49 results on '"Nassif, Houssam"'
Search Results
2. Best of Three Worlds: Adaptive Experimentation for Digital Marketing in Practice
- Author
-
Fiez, Tanner, Nassif, Houssam, Chen, Yu-Cheng, Gamez, Sergio, and Jain, Lalit
- Subjects
Computer Science - Machine Learning ,Statistics - Methodology - Abstract
Adaptive experimental design (AED) methods are increasingly being used in industry as a tool to boost testing throughput or reduce experimentation cost relative to traditional A/B/N testing methods. However, the behavior and guarantees of such methods are not well-understood beyond idealized stationary settings. This paper shares lessons learned regarding the challenges of naively using AED systems in industrial settings where non-stationarity is prevalent, while also providing perspectives on the proper objectives and system specifications in such settings. We developed an AED framework for counterfactual inference based on these experiences, and tested it in a commercial environment.
- Published
- 2024
3. Estimation of subsidiary performance metrics under optimal policies
- Author
-
Li, Zhaoqi, Nassif, Houssam, and Luedtke, Alex
- Subjects
Mathematics - Statistics Theory - Abstract
In policy learning, the goal is typically to optimize a primary performance metric, but other subsidiary metrics often also warrant attention. This paper presents two strategies for evaluating these subsidiary metrics under a policy that is optimal for the primary one. The first relies on a novel margin condition that facilitates Wald-type inference. Under this and other regularity conditions, we show that the one-step corrected estimator is efficient. Despite the utility of this margin condition, it places strong restrictions on how the subsidiary metric behaves for nearly optimal policies, which may not hold in practice. We therefore introduce alternative, two-stage strategies that do not require a margin condition. The first stage constructs a set of candidate policies and the second builds a uniform confidence interval over this set. We provide numerical simulations to evaluate the performance of these methods in different scenarios.
- Published
- 2024
4. Experimental Designs for Heteroskedastic Variance
- Author
-
Weltz, Justin, Fiez, Tanner, Volfovsky, Alexander, Laber, Eric, Mason, Blake, Nassif, Houssam, and Jain, Lalit
- Subjects
Mathematics - Statistics Theory - Abstract
Most linear experimental design problems assume homogeneous variance although heteroskedastic noise is present in many realistic settings. Let a learner have access to a finite set of measurement vectors $\mathcal{X}\subset \mathbb{R}^d$ that can be probed to receive noisy linear responses of the form $y=x^{\top}\theta^{\ast}+\eta$. Here $\theta^{\ast}\in \mathbb{R}^d$ is an unknown parameter vector, and $\eta$ is independent mean-zero $\sigma_x^2$-sub-Gaussian noise defined by a flexible heteroskedastic variance model, $\sigma_x^2 = x^{\top}\Sigma^{\ast}x$. Assuming that $\Sigma^{\ast}\in \mathbb{R}^{d\times d}$ is an unknown matrix, we propose, analyze and empirically evaluate a novel design for uniformly bounding estimation error of the variance parameters, $\sigma_x^2$. We demonstrate the benefits of this method with two adaptive experimental design problems under heteroskedastic noise, fixed confidence transductive best-arm identification and level-set identification and prove the first instance-dependent lower bounds in these settings. Lastly, we construct near-optimal algorithms and demonstrate the large improvements in sample complexity gained from accounting for heteroskedastic variance in these designs empirically.
- Published
- 2023
5. On Neural Networks as Infinite Tree-Structured Probabilistic Graphical Models
- Author
-
Li, Boyao, Thomson, Alexandar J., Nassif, Houssam, Engelhard, Matthew M., and Page, David
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning - Abstract
Deep neural networks (DNNs) lack the precise semantics and definitive probabilistic interpretation of probabilistic graphical models (PGMs). In this paper, we propose an innovative solution by constructing infinite tree-structured PGMs that correspond exactly to neural networks. Our research reveals that DNNs, during forward propagation, indeed perform approximations of PGM inference that are precise in this alternative PGM structure. Not only does our research complement existing studies that describe neural networks as kernel machines or infinite-sized Gaussian processes, it also elucidates a more direct approximation that DNNs make to exact inference in PGMs. Potential benefits include improved pedagogy and interpretation of DNNs, and algorithms that can merge the strengths of PGMs and DNNs., Comment: Accepted to NeurIPS 2024
- Published
- 2023
6. A Data-Driven State Aggregation Approach for Dynamic Discrete Choice Models
- Author
-
Geng, Sinong, Nassif, Houssam, and Manzanares, Carlos A.
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
We study dynamic discrete choice models, where a commonly studied problem involves estimating parameters of agent reward functions (also known as "structural" parameters), using agent behavioral data. Maximum likelihood estimation for such models requires dynamic programming, which is limited by the curse of dimensionality. In this work, we present a novel algorithm that provides a data-driven method for selecting and aggregating states, which lowers the computational and sample complexity of estimation. Our method works in two stages. In the first stage, we use a flexible inverse reinforcement learning approach to estimate agent Q-functions. We use these estimated Q-functions, along with a clustering algorithm, to select a subset of states that are the most pivotal for driving changes in Q-functions. In the second stage, with these selected "aggregated" states, we conduct maximum likelihood estimation using a commonly used nested fixed-point algorithm. The proposed two-stage approach mitigates the curse of dimensionality by reducing the problem dimension. Theoretically, we derive finite-sample bounds on the associated estimation error, which also characterize the trade-off of computational complexity, estimation error, and sample complexity. We demonstrate the empirical performance of the algorithm in two classic dynamic discrete choice estimation applications.
- Published
- 2023
7. Neural Insights for Digital Marketing Content Design
- Author
-
Kong, Fanjie, Li, Yuan, Nassif, Houssam, Fiez, Tanner, Henao, Ricardo, and Chakrabarti, Shreya
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
In digital marketing, experimenting with new website content is one of the key levers to improve customer engagement. However, creating successful marketing content is a manual and time-consuming process that lacks clear guiding principles. This paper seeks to close the loop between content creation and online experimentation by offering marketers AI-driven actionable insights based on historical data to improve their creative process. We present a neural-network-based system that scores and extracts insights from a marketing content design, namely, a multimodal neural network predicts the attractiveness of marketing contents, and a post-hoc attribution method generates actionable insights for marketers to improve their content in specific marketing locations. Our insights not only point out the advantages and drawbacks of a given current content, but also provide design recommendations based on historical data. We show that our scoring model and insights work well both quantitatively and qualitatively.
- Published
- 2023
- Full Text
- View/download PDF
8. Adaptive Experimental Design and Counterfactual Inference
- Author
-
Fiez, Tanner, Gamez, Sergio, Chen, Arick, Nassif, Houssam, and Jain, Lalit
- Subjects
Computer Science - Machine Learning ,Statistics - Methodology - Abstract
Adaptive experimental design methods are increasingly being used in industry as a tool to boost testing throughput or reduce experimentation cost relative to traditional A/B/N testing methods. This paper shares lessons learned regarding the challenges and pitfalls of naively using adaptive experimentation systems in industrial settings where non-stationarity is prevalent, while also providing perspectives on the proper objectives and system specifications in these settings. We developed an adaptive experimental design framework for counterfactual inference based on these experiences, and tested it in a commercial environment., Comment: In Workshops of the Conference on Recommender Systems (RecSys), 2022
- Published
- 2022
9. Instance-optimal PAC Algorithms for Contextual Bandits
- Author
-
Li, Zhaoqi, Ratliff, Lillian, Nassif, Houssam, Jamieson, Kevin, and Jain, Lalit
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning - Abstract
In the stochastic contextual bandit setting, regret-minimizing algorithms have been extensively researched, but their instance-minimizing best-arm identification counterparts remain seldom studied. In this work, we focus on the stochastic bandit problem in the $(\epsilon,\delta)$-$\textit{PAC}$ setting: given a policy class $\Pi$ the goal of the learner is to return a policy $\pi\in \Pi$ whose expected reward is within $\epsilon$ of the optimal policy with probability greater than $1-\delta$. We characterize the first $\textit{instance-dependent}$ PAC sample complexity of contextual bandits through a quantity $\rho_{\Pi}$, and provide matching upper and lower bounds in terms of $\rho_{\Pi}$ for the agnostic and linear contextual best-arm identification settings. We show that no algorithm can be simultaneously minimax-optimal for regret minimization and instance-dependent PAC for best-arm identification. Our main result is a new instance-optimal and computationally efficient algorithm that relies on a polynomial number of calls to an argmax oracle.
- Published
- 2022
10. Improved Confidence Bounds for the Linear Logistic Model and Applications to Linear Bandits
- Author
-
Jun, Kwang-Sung, Jain, Lalit, Mason, Blake, and Nassif, Houssam
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning - Abstract
We propose improved fixed-design confidence bounds for the linear logistic model. Our bounds significantly improve upon the state-of-the-art bound by Li et al. (2017) via recent developments of the self-concordant analysis of the logistic loss (Faury et al., 2020). Specifically, our confidence bound avoids a direct dependence on $1/\kappa$, where $\kappa$ is the minimal variance over all arms' reward distributions. In general, $1/\kappa$ scales exponentially with the norm of the unknown linear parameter $\theta^*$. Instead of relying on this worst-case quantity, our confidence bound for the reward of any given arm depends directly on the variance of that arm's reward distribution. We present two applications of our novel bounds to pure exploration and regret minimization logistic bandits improving upon state-of-the-art performance guarantees. For pure exploration, we also provide a lower bound highlighting a dependence on $1/\kappa$ for a family of instances.
- Published
- 2020
11. Deep PQR: Solving Inverse Reinforcement Learning using Anchor Actions
- Author
-
Geng, Sinong, Nassif, Houssam, Manzanares, Carlos A., Reppen, A. Max, and Sircar, Ronnie
- Subjects
Computer Science - Machine Learning ,Mathematics - Optimization and Control ,Statistics - Machine Learning - Abstract
We propose a reward function estimation framework for inverse reinforcement learning with deep energy-based policies. We name our method PQR, as it sequentially estimates the Policy, the $Q$-function, and the Reward function by deep learning. PQR does not assume that the reward solely depends on the state, instead it allows for a dependency on the choice of action. Moreover, PQR allows for stochastic state transitions. To accomplish this, we assume the existence of one anchor action whose reward is known, typically the action of doing nothing, yielding no reward. We present both estimators and algorithms for the PQR method. When the environment transition is known, we prove that the PQR reward estimator uniquely recovers the true reward. With unknown transitions, we bound the estimation error of PQR. Finally, the performance of PQR is demonstrated by synthetic and real-world datasets.
- Published
- 2020
12. Bayesian Meta-Prior Learning Using Empirical Bayes
- Author
-
Nabi, Sareh, Nassif, Houssam, Hong, Joseph, Mamani, Hamed, and Imbens, Guido
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Adding domain knowledge to a learning system is known to improve results. In multi-parameter Bayesian frameworks, such knowledge is incorporated as a prior. On the other hand, various model parameters can have different learning rates in real-world problems, especially with skewed data. Two often-faced challenges in Operation Management and Management Science applications are the absence of informative priors, and the inability to control parameter learning rates. In this study, we propose a hierarchical Empirical Bayes approach that addresses both challenges, and that can generalize to any Bayesian framework. Our method learns empirical meta-priors from the data itself and uses them to decouple the learning rates of first-order and second-order features (or any other given feature grouping) in a Generalized Linear Model. As the first-order features are likely to have a more pronounced effect on the outcome, focusing on learning first-order weights first is likely to improve performance and convergence time. Our Empirical Bayes method clamps features in each group together and uses the deployed model's observed data to empirically compute a hierarchical prior in hindsight. We report theoretical results for the unbiasedness, strong consistency, and optimal frequentist cumulative regret properties of our meta-prior variance estimator. We apply our method to a standard supervised learning optimization problem, as well as an online combinatorial optimization problem in a contextual bandit setting implemented in an Amazon production system. Both during simulations and live experiments, our method shows marked improvements, especially in cases of small traffic. Our findings are promising, as optimizing over sparse data is often a challenge., Comment: Expanded discussions on applications and extended literature review section. Forthcoming in the Management Science Journal
- Published
- 2020
13. Seeker: Real-Time Interactive Search
- Author
-
Biswas, Ari, Pham, Thai T, Vogelsong, Michael, Snyder, Benjamin, and Nassif, Houssam
- Subjects
Computer Science - Information Retrieval ,Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
This paper introduces Seeker, a system that allows users to interactively refine search rankings in real time, through feedback in the form of likes and dislikes. When searching online, users may not know how to accurately describe their product of choice in words. An alternative approach is to search an embedding space, allowing the user to query using a representation of the item (like a tune for a song, or a picture for an object). However, this approach requires the user to possess an example representation of their desired item. Additionally, most current search systems do not allow the user to dynamically adapt the results with further feedback. On the other hand, users often have a mental picture of the desired item and are able to answer ordinal questions of the form: "Is this item similar to what you have in mind?" With this assumption, our algorithm allows for users to provide sequential feedback on search results to adapt the search feed. We show that our proposed approach works well both qualitatively and quantitatively. Unlike most previous representation-based search systems, we can quantify the quality of our algorithm by evaluating humans-in-the-loop experiments., Comment: This paper will appear in KDD 2019
- Published
- 2019
14. An Efficient Bandit Algorithm for Realtime Multivariate Optimization
- Author
-
Hill, Daniel N, Nassif, Houssam, Liu, Yi, Iyer, Anand, and Vishwanathan, S V N
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Optimization is commonly employed to determine the content of web pages, such as to maximize conversions on landing pages or click-through rates on search engine result pages. Often the layout of these pages can be decoupled into several separate decisions. For example, the composition of a landing page may involve deciding which image to show, which wording to use, what color background to display, etc. Such optimization is a combinatorial problem over an exponentially large decision space. Randomized experiments do not scale well to this setting, and therefore, in practice, one is typically limited to optimizing a single aspect of a web page at a time. This represents a missed opportunity in both the speed of experimentation and the exploitation of possible interactions between layout decisions. Here we focus on multivariate optimization of interactive web pages. We formulate an approach where the possible interactions between different components of the page are modeled explicitly. We apply bandit methodology to explore the layout space efficiently and use hill-climbing to select optimal content in realtime. Our algorithm also extends to contextualization and personalization of layout selection. Simulation results show the suitability of our approach to large decision spaces with strong interactions between content. We further apply our algorithm to optimize a message that promotes adoption of an Amazon service. After only a single week of online optimization, we saw a 21% conversion increase compared to the median layout. Our technique is currently being deployed to optimize content across several locations at Amazon.com., Comment: KDD'17 Audience Appreciation Award
- Published
- 2018
- Full Text
- View/download PDF
15. An Inductive Logic Programming Approach to Validate Hexose Binding Biochemical Knowledge
- Author
-
Nassif, Houssam, Al-Ali, Hassan, Khuri, Sawsan, Keirouz, Walid, and Page, David
- Subjects
Quantitative Biology - Other Quantitative Biology ,Computer Science - Logic in Computer Science - Abstract
Hexoses are simple sugars that play a key role in many cellular pathways, and in the regulation of development and disease mechanisms. Current protein-sugar computational models are based, at least partially, on prior biochemical findings and knowledge. They incorporate different parts of these findings in predictive black-box models. We investigate the empirical support for biochemical findings by comparing Inductive Logic Programming (ILP) induced rules to actual biochemical results. We mine the Protein Data Bank for a representative data set of hexose binding sites, non-hexose binding sites and surface grooves. We build an ILP model of hexose-binding sites and evaluate our results against several baseline machine learning classifiers. Our method achieves an accuracy similar to that of other black-box classifiers while providing insight into the discriminating process. In addition, it confirms wet-lab findings and reveals a previously unreported Trp-Glu amino acids dependency.
- Published
- 2018
- Full Text
- View/download PDF
16. Contextual Multi-Armed Bandits for Causal Marketing
- Author
-
Sawant, Neela, Namballa, Chitti Babu, Sadagopan, Narayanan, and Nassif, Houssam
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
This work explores the idea of a causal contextual multi-armed bandit approach to automated marketing, where we estimate and optimize the causal (incremental) effects. Focusing on causal effect leads to better return on investment (ROI) by targeting only the persuadable customers who wouldn't have taken the action organically. Our approach draws on strengths of causal inference, uplift modeling, and multi-armed bandits. It optimizes on causal treatment effects rather than pure outcome, and incorporates counterfactual generation within data collection. Following uplift modeling results, we optimize over the incremental business metric. Multi-armed bandit methods allow us to scale to multiple treatments and to perform off-policy policy evaluation on logged data. The Thompson sampling strategy in particular enables exploration of treatments on similar customer contexts and materialization of counterfactual outcomes. Preliminary offline experiments on a retail Fashion marketing dataset show merits of our proposal.
- Published
- 2018
17. Adaptive, Personalized Diversity for Visual Discovery
- Author
-
Teo, Choon Hui, Nassif, Houssam, Hill, Daniel, Srinavasan, Sriram, Goodman, Mitchell, Mohan, Vijai, and Vishwanathan, SVN
- Subjects
Computer Science - Information Retrieval ,Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Search queries are appropriate when users have explicit intent, but they perform poorly when the intent is difficult to express or if the user is simply looking to be inspired. Visual browsing systems allow e-commerce platforms to address these scenarios while offering the user an engaging shopping experience. Here we explore extensions in the direction of adaptive personalization and item diversification within Stream, a new form of visual browsing and discovery by Amazon. Our system presents the user with a diverse set of interesting items while adapting to user interactions. Our solution consists of three components (1) a Bayesian regression model for scoring the relevance of items while leveraging uncertainty, (2) a submodular diversification framework that re-ranks the top scoring items based on category, and (3) personalized category preferences learned from the user's behavior. When tested on live traffic, our algorithms show a strong lift in click-through-rate and session duration., Comment: Best Paper Award
- Published
- 2018
- Full Text
- View/download PDF
18. Diversifying Music Recommendations
- Author
-
Nassif, Houssam, Cansizlar, Kemal Oral, Goodman, Mitchell, and Vishwanathan, SVN
- Subjects
Computer Science - Multimedia ,Computer Science - Information Retrieval - Abstract
We compare submodular and Jaccard methods to diversify Amazon Music recommendations. Submodularity significantly improves recommendation quality and user engagement. Unlike the Jaccard method, our submodular approach incorporates item relevance score within its optimization function, and produces a relevant and uniformly diverse set., Comment: Machine Learning for Music Discovery Workshop at the 33rd International Conference on Machine Learning (ICML'16), New York, 2016
- Published
- 2018
19. Best of Three Worlds: Adaptive Experimentation for Digital Marketing in Practice
- Author
-
Fiez, Tanner, primary, Nassif, Houssam, additional, Chen, Yu-Cheng, additional, Gamez, Sergio, additional, and Jain, Lalit, additional
- Published
- 2024
- Full Text
- View/download PDF
20. Predicting invasive breast cancer versus DCIS in different age groups.
- Author
-
Ayvaci, Mehmet US, Alagoz, Oguzhan, Chhatwal, Jagpreet, Munoz del Rio, Alejandro, Sickles, Edward A, Nassif, Houssam, Kerlikowske, Karla, and Burnside, Elizabeth S
- Subjects
Humans ,Carcinoma ,Ductal ,Breast ,Carcinoma ,Intraductal ,Noninfiltrating ,Breast Neoplasms ,Mammography ,Logistic Models ,Risk Factors ,Age Factors ,Adult ,Aged ,Middle Aged ,Female ,Logistic models ,Breast neoplasms ,Overdiagnosis ,Biopsy ,Aging ,Oncology & Carcinogenesis ,Oncology and Carcinogenesis ,Public Health and Health Services - Abstract
BackgroundIncreasing focus on potentially unnecessary diagnosis and treatment of certain breast cancers prompted our investigation of whether clinical and mammographic features predictive of invasive breast cancer versus ductal carcinoma in situ (DCIS) differ by age.MethodsWe analyzed 1,475 malignant breast biopsies, 1,063 invasive and 412 DCIS, from 35,871 prospectively collected consecutive diagnostic mammograms interpreted at University of California, San Francisco between 1/6/1997 and 6/29/2007. We constructed three logistic regression models to predict the probability of invasive cancer versus DCIS for the following groups: women ≥ 65 (older group), women 50-64 (middle age group), and women < 50 (younger group). We identified significant predictors and measured the performance in all models using area under the receiver operating characteristic curve (AUC).ResultsThe models for older and the middle age groups performed significantly better than the model for younger group (AUC = 0.848 vs, 0.778; p = 0.049 and AUC = 0.851 vs, 0.778; p = 0.022, respectively). Palpability and principal mammographic finding were significant predictors in distinguishing invasive from DCIS in all age groups. Family history of breast cancer, mass shape and mass margins were significant positive predictors of invasive cancer in the older group whereas calcification distribution was a negative predictor of invasive cancer (i.e. predicted DCIS). In the middle age group--mass margins, and in the younger group--mass size were positive predictors of invasive cancer.ConclusionsClinical and mammographic finding features predict invasive breast cancer versus DCIS better in older women than younger women. Specific predictive variables differ based on age.
- Published
- 2014
21. Neural Insights for Digital Marketing Content Design
- Author
-
Kong, Fanjie, primary, Li, Yuan, additional, Nassif, Houssam, additional, Fiez, Tanner, additional, Henao, Ricardo, additional, and Chakrabarti, Shreya, additional
- Published
- 2023
- Full Text
- View/download PDF
22. Support Vector Machines for Differential Prediction
- Author
-
Kuusisto, Finn, Costa, Vitor Santos, Nassif, Houssam, Burnside, Elizabeth, Page, David, Shavlik, Jude, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Kobsa, Alfred, Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Goebel, Randy, Series editor, Tanaka, Yuzuru, Series editor, Wahlster, Wolfgang, Series editor, Siekmann, Jörg, Series editor, Calders, Toon, editor, Esposito, Floriana, editor, Hüllermeier, Eyke, editor, and Meo, Rosa, editor
- Published
- 2014
- Full Text
- View/download PDF
23. Score As You Lift (SAYL): A Statistical Relational Learning Approach to Uplift Modeling
- Author
-
Nassif, Houssam, Kuusisto, Finn, Burnside, Elizabeth S., Page, David, Shavlik, Jude, Santos Costa, Vítor, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Goebel, Randy, editor, Siekmann, Jörg, editor, Wahlster, Wolfgang, editor, Blockeel, Hendrik, editor, Kersting, Kristian, editor, Nijssen, Siegfried, editor, and Železný, Filip, editor
- Published
- 2013
- Full Text
- View/download PDF
24. Relational Differential Prediction
- Author
-
Nassif, Houssam, Santos Costa, Vítor, Burnside, Elizabeth S., Page, David, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Goebel, Randy, editor, Siekmann, Jörg, editor, Wahlster, Wolfgang, editor, Flach, Peter A., editor, De Bie, Tijl, editor, and Cristianini, Nello, editor
- Published
- 2012
- Full Text
- View/download PDF
25. An Inductive Logic Programming Approach to Validate Hexose Binding Biochemical Knowledge
- Author
-
Nassif, Houssam, Al-Ali, Hassan, Khuri, Sawsan, Keirouz, Walid, Page, David, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Goebel, Randy, editor, Siekmann, Jörg, editor, Wahlster, Wolfgang, editor, and De Raedt, Luc, editor
- Published
- 2010
- Full Text
- View/download PDF
26. Bayesian Meta-Prior Learning Using Empirical Bayes
- Author
-
Nabi, Sareh, primary, Nassif, Houssam, additional, Hong, Joseph, additional, Mamani, Hamed, additional, and Imbens, Guido, additional
- Published
- 2021
- Full Text
- View/download PDF
27. Support Vector Machines for Differential Prediction
- Author
-
Kuusisto, Finn, primary, Costa, Vitor Santos, additional, Nassif, Houssam, additional, Burnside, Elizabeth, additional, Page, David, additional, and Shavlik, Jude, additional
- Published
- 2014
- Full Text
- View/download PDF
28. Score As You Lift (SAYL): A Statistical Relational Learning Approach to Uplift Modeling
- Author
-
Nassif, Houssam, primary, Kuusisto, Finn, additional, Burnside, Elizabeth S., additional, Page, David, additional, Shavlik, Jude, additional, and Santos Costa, Vítor, additional
- Published
- 2013
- Full Text
- View/download PDF
29. Relational Differential Prediction
- Author
-
Nassif, Houssam, primary, Santos Costa, Vítor, additional, Burnside, Elizabeth S., additional, and Page, David, additional
- Published
- 2012
- Full Text
- View/download PDF
30. An Inductive Logic Programming Approach to Validate Hexose Binding Biochemical Knowledge
- Author
-
Nassif, Houssam, primary, Al-Ali, Hassan, additional, Khuri, Sawsan, additional, Keirouz, Walid, additional, and Page, David, additional
- Published
- 2010
- Full Text
- View/download PDF
31. Automatic classification of mammography reports by BI-RADS breast tissue composition class
- Author
-
Percha, Bethany, Nassif, Houssam, Lipson, Jafi, Burnside, Elizabeth, and Rubin, Daniel
- Published
- 2012
- Full Text
- View/download PDF
32. Bayesian Meta-Prior Learning Using Empirical Bayes.
- Author
-
Nabi, Sareh, Nassif, Houssam, Hong, Joseph, Mamani, Hamed, and Imbens, Guido
- Subjects
EMPIRICAL Bayes methods ,SUPERVISED learning ,COMBINATORIAL optimization - Abstract
Adding domain knowledge to a learning system is known to improve results. In multiparameter Bayesian frameworks, such knowledge is incorporated as a prior. On the other hand, the various model parameters can have different learning rates in real-world problems, especially with skewed data. Two often-faced challenges in operation management and management science applications are the absence of informative priors and the inability to control parameter learning rates. In this study, we propose a hierarchical empirical Bayes approach that addresses both challenges and that can generalize to any Bayesian framework. Our method learns empirical meta-priors from the data itself and uses them to decouple the learning rates of first-order and second-order features (or any other given feature grouping) in a generalized linear model. Because the first-order features are likely to have a more pronounced effect on the outcome, focusing on learning first-order weights first is likely to improve performance and convergence time. Our empirical Bayes method clamps features in each group together and uses the deployed model's observed data to empirically compute a hierarchical prior in hindsight. We report theoretical results for the unbiasedness, strong consistency, and optimal frequentist cumulative regret properties of our meta-prior variance estimator. We apply our method to a standard supervised learning optimization problem as well as an online combinatorial optimization problem in a contextual bandit setting implemented in an Amazon production system. During both simulations and live experiments, our method shows marked improvements, especially in cases of small traffic. Our findings are promising because optimizing over sparse data is often a challenge. This paper was accepted by Hamid Nazerzadeh, Management Science Special Section on Data-Driven Prescriptive Analytics. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
33. Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study
- Author
-
A Santos Jose C, Nassif Houssam, Page David, Muggleton Stephen H, and E Sternberg Michael J
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background There is a need for automated methods to learn general features of the interactions of a ligand class with its diverse set of protein receptors. An appropriate machine learning approach is Inductive Logic Programming (ILP), which automatically generates comprehensible rules in addition to prediction. The development of ILP systems which can learn rules of the complexity required for studies on protein structure remains a challenge. In this work we use a new ILP system, ProGolem, and demonstrate its performance on learning features of hexose-protein interactions. Results The rules induced by ProGolem detect interactions mediated by aromatics and by planar-polar residues, in addition to less common features such as the aromatic sandwich. The rules also reveal a previously unreported dependency for residues cys and leu. They also specify interactions involving aromatic and hydrogen bonding residues. This paper shows that Inductive Logic Programming implemented in ProGolem can derive rules giving structural features of protein/ligand interactions. Several of these rules are consistent with descriptions in the literature. Conclusions In addition to confirming literature results, ProGolem’s model has a 10-fold cross-validated predictive accuracy that is superior, at the 95% confidence level, to another ILP system previously used to study protein/hexose interactions and is comparable with state-of-the-art statistical learners.
- Published
- 2012
- Full Text
- View/download PDF
34. Seeker
- Author
-
Biswas, Ari, primary, Pham, Thai T., additional, Vogelsong, Michael, additional, Snyder, Benjamin, additional, and Nassif, Houssam, additional
- Published
- 2019
- Full Text
- View/download PDF
35. An Efficient Bandit Algorithm for Realtime Multivariate Optimization
- Author
-
Hill, Daniel N., primary, Nassif, Houssam, additional, Liu, Yi, additional, Iyer, Anand, additional, and Vishwanathan, S.V.N., additional
- Published
- 2017
- Full Text
- View/download PDF
36. Adaptive, Personalized Diversity for Visual Discovery
- Author
-
Teo, Choon Hui, primary, Nassif, Houssam, additional, Hill, Daniel, additional, Srinivasan, Sriram, additional, Goodman, Mitchell, additional, Mohan, Vijai, additional, and Vishwanathan, S.V.N., additional
- Published
- 2016
- Full Text
- View/download PDF
37. Rational Polypharmacology: Systematically Identifying and Engaging Multiple Drug Targets To Promote Axon Growth
- Author
-
Al-Ali, Hassan, primary, Lee, Do-Hun, additional, Danzi, Matt C., additional, Nassif, Houssam, additional, Gautam, Prson, additional, Wennerberg, Krister, additional, Zuercher, Bill, additional, Drewry, David H., additional, Lee, Jae K., additional, Lemmon, Vance P., additional, and Bixby, John L., additional
- Published
- 2015
- Full Text
- View/download PDF
38. Using machine learning to identify benign cases with non-definitive biopsy
- Author
-
Kuusisto, Finn, primary, Dutra, Ines, additional, Nassif, Houssam, additional, Wu, Yirong, additional, Klein, Molly E., additional, Neuman, Heather B., additional, Shavlik, Jude, additional, and Burnside, Elizabeth S., additional
- Published
- 2013
- Full Text
- View/download PDF
39. Extracting BI-RADS features from Portuguese clinical texts
- Author
-
Nassif, Houssam, primary, Cunha, Filipe, additional, Moreira, Ines C., additional, Cruz-Correia, Ricardo, additional, Sousa, Eliana, additional, Page, David, additional, Burnside, Elizabeth, additional, and Dutra, Ines, additional
- Published
- 2012
- Full Text
- View/download PDF
40. Uncovering age-specific invasive and DCIS breast cancer rules using inductive logic programming
- Author
-
Nassif, Houssam, primary, Page, David, additional, Ayvaci, Mehmet, additional, Shavlik, Jude, additional, and Burnside, Elizabeth S., additional
- Published
- 2010
- Full Text
- View/download PDF
41. Information Extraction for Clinical Data Mining: A Mammography Case Study
- Author
-
Nassif, Houssam, primary, Woods, Ryan, additional, Burnside, Elizabeth, additional, Ayvaci, Mehmet, additional, Shavlik, Jude, additional, and Page, David, additional
- Published
- 2009
- Full Text
- View/download PDF
42. Prediction of protein-glucose binding sites using support vector machines
- Author
-
Nassif, Houssam, primary, Al-Ali, Hassan, additional, Khuri, Sawsan, additional, and Keirouz, Walid, additional
- Published
- 2009
- Full Text
- View/download PDF
43. Predicting invasive breast cancer versus DCIS in different age groups.
- Author
-
Ayvac, Mehmet U. S., Alagoz, Oguzhan, Chhatwal, Jagpreet, del Rio, Alejandro Munoz, Sickles, Edward A., Nassif, Houssam, Kerlikowske, Karla, and Burnside, Elizabeth S.
- Subjects
BREAST cancer diagnosis ,MAMMOGRAMS ,BREAST biopsy ,AGING ,RECEIVER operating characteristic curves - Abstract
Background: Increasing focus on potentially unnecessary diagnosis and treatment of certain breast cancers prompted our investigation of whether clinical and mammographic features predictive of invasive breast cancer versus ductal carcinoma in situ (DCIS) differ by age. Methods: We analyzed 1,475 malignant breast biopsies, 1,063 invasive and 412 DCIS, from 35,871 prospectively collected consecutive diagnostic mammograms interpreted at University of California, San Francisco between 1/6/1997 and 6/29/2007. We constructed three logistic regression models to predict the probability of invasive cancer versus DCIS for the following groups: women ≥ 65 (older group), women 50-64 (middle age group), and women < 50 (younger group). We identified significant predictors and measured the performance in all models using area under the receiver operating characteristic curve (AUC). Results: The models for older and the middle age groups performed significantly better than the model for younger group (AUC = 0.848 vs, 0.778; p = 0.049 and AUC = 0.851 vs, 0.778; p = 0.022, respectively). Palpability and principal mammographic finding were significant predictors in distinguishing invasive from DCIS in all age groups. Family history of breast cancer, mass shape and mass margins were significant positive predictors of invasive cancer in the older group whereas calcification distribution was a negative predictor of invasive cancer (i.e. predicted DCIS). In the middle age group--mass margins, and in the younger group--mass size were positive predictors of invasive cancer. Conclusions: Clinical and mammographic finding features predict invasive breast cancer versus DCIS better in older women than younger women. Specific predictive variables differ based on age. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
44. Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study.
- Author
-
Antos, Jose C A, Nassif, Houssam, Page, David, Muggleton, Stephen H, and Sternberg, Michael J E
- Subjects
- *
HEXOSES , *PROTEIN-ligand interactions , *INDUCTIVE logic programming , *BIOMOLECULES - Abstract
Background: There is a need for automated methods to learn general features of the interactions of a ligand class with its diverse set of protein receptors. An appropriate machine learning approach is Inductive Logic Programming (ILP), which automatically generates comprehensible rules in addition to prediction. The development of ILP systems which can learn rules of the complexity required for studies on protein structure remains a challenge. In this work we use a new ILP system, ProGolem, and demonstrate its performance on learning features of hexose-protein interactions. Results: The rules induced by ProGolem detect interactions mediated by aromatics and by planar-polar residues, in addition to less common features such as the aromatic sandwich. The rules also reveal a previously unreported dependency for residues CYS and LEU. They also specify interactions involving aromatic and hydrogen bonding residues. This paper shows that Inductive Logic Programming implemented in ProGolem can derive rules giving structural features of protein/ligand interactions. Several of these rules are consistent with descriptions in the literature. Conclusions: In addition to confirming literature results, ProGolem's model has a 10-fold cross-validated predictive accuracy that is superior, at the 95% confidence level, to another ILP system previously used to study protein/hexose interactions and is comparable with state-of-the-art statistical learners. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
45. Genetic variants improve breast cancer risk prediction on mammograms.
- Author
-
Liu J, Page D, Nassif H, Shavlik J, Peissig P, McCarty C, Onitilo AA, and Burnside E
- Subjects
- Breast Neoplasms diagnostic imaging, Case-Control Studies, Female, Genetic Predisposition to Disease, Genotype, Humans, Neural Networks, Computer, Polymorphism, Single Nucleotide, ROC Curve, Bayes Theorem, Breast Neoplasms genetics, Mammography, Risk Assessment methods
- Abstract
Several recent genome-wide association studies have identified genetic variants associated with breast cancer. However, how much these genetic variants may help advance breast cancer risk prediction based on other clinical features, like mammographic findings, is unknown. We conducted a retrospective case-control study, collecting mammographic findings and high-frequency/low-penetrance genetic variants from an existing personalized medicine data repository. A Bayesian network was developed using Tree Augmented Naive Bayes (TAN) by training on the mammographic findings, with and without the 22 genetic variants collected. We analyzed the predictive performance using the area under the ROC curve, and found that the genetic variants significantly improved breast cancer risk prediction on mammograms. We also identified the interaction effect between the genetic variants and collected mammographic findings in an attempt to link genotype to mammographic phenotype to better understand disease patterns, mechanisms, and/or natural history.
- Published
- 2013
46. Using Machine Learning to Identify Benign Cases with Non-Definitive Biopsy.
- Author
-
Kuusisto F, Dutra I, Nassif H, Wu Y, Klein ME, Neuman HB, Shavlik J, and Burnside ES
- Abstract
When mammography reveals a suspicious finding, a core needle biopsy is usually recommended. In 5% to 15% of these cases, the biopsy diagnosis is non-definitive and a more invasive surgical excisional biopsy is recommended to confirm a diagnosis. The majority of these cases will ultimately be proven benign. The use of excisional biopsy for diagnosis negatively impacts patient quality of life and increases costs to the healthcare system. In this work, we employ a multi-relational machine learning approach to predict when a patient with a non-definitive core needle biopsy diagnosis need not undergo an excisional biopsy procedure because the risk of malignancy is low.
- Published
- 2013
- Full Text
- View/download PDF
47. Logical Differential Prediction Bayes Net, improving breast cancer diagnosis for older women.
- Author
-
Nassif H, Wu Y, Page D, and Burnside E
- Subjects
- Aged, Bayes Theorem, Breast Neoplasms diagnostic imaging, Carcinoma, Intraductal, Noninfiltrating diagnostic imaging, Diagnostic Errors prevention & control, Female, Humans, Logic, Mammography, Algorithms, Artificial Intelligence, Breast Neoplasms diagnosis, Carcinoma, Intraductal, Noninfiltrating diagnosis
- Abstract
Overdiagnosis is a phenomenon in which screening identities cancer which may not go on to cause symptoms or death. Women over 65 who develop breast cancer bear the heaviest burden of overdiagnosis. This work introduces novel machine learning algorithms to improve diagnostic accuracy of breast cancer in aging populations. At the same time, we aim at minimizing unnecessary invasive procedures (thus decreasing false positives) and concomitantly addressing overdiagnosis. We develop a novel algorithm. Logical Differential Prediction Bayes Net (LDP-BN), that calculates the risk of breast disease based on mammography findings. LDP-BN uses Inductive Logic Programming (ILP) to learn relational rules, selects older-specific differentially predictive rules, and incorporates them into a Bayes Net, significantly improving its performance. In addition, LDP-BN offers valuable insight into the classification process, revealing novel older-specific rules that link mass presence to invasive, and calcification presence and lack of detectable mass to DCIS.
- Published
- 2012
48. Extracting BI-RADS Features from Portuguese Clinical Texts.
- Author
-
Nassif H, Cunha F, Moreira IC, Cruz-Correia R, Sousa E, Page D, Burnside E, and Dutra I
- Abstract
In this work we build the first BI-RADS parser for Portuguese free texts, modeled after existing approaches to extract BI-RADS features from English medical records. Our concept finder uses a semantic grammar based on the BIRADS lexicon and on iterative transferred expert knowledge. We compare the performance of our algorithm to manual annotation by a specialist in mammography. Our results show that our parser's performance is comparable to the manual method.
- Published
- 2012
- Full Text
- View/download PDF
49. Information Extraction for Clinical Data Mining: A Mammography Case Study.
- Author
-
Nassif H, Woods R, Burnside E, Ayvaci M, Shavlik J, and Page D
- Abstract
Breast cancer is the leading cause of cancer mortality in women between the ages of 15 and 54. During mammography screening, radiologists use a strict lexicon (BI-RADS) to describe and report their findings. Mammography records are then stored in a well-defined database format (NMD). Lately, researchers have applied data mining and machine learning techniques to these databases. They successfully built breast cancer classifiers that can help in early detection of malignancy. However, the validity of these models depends on the quality of the underlying databases. Unfortunately, most databases suffer from inconsistencies, missing data, inter-observer variability and inappropriate term usage. In addition, many databases are not compliant with the NMD format and/or solely consist of text reports. BI-RADS feature extraction from free text and consistency checks between recorded predictive variables and text reports are crucial to addressing this problem. We describe a general scheme for concept information retrieval from free text given a lexicon, and present a BI-RADS features extraction algorithm for clinical data mining. It consists of a syntax analyzer, a concept finder and a negation detector. The syntax analyzer preprocesses the input into individual sentences. The concept finder uses a semantic grammar based on the BI-RADS lexicon and the experts' input. It parses sentences detecting BI-RADS concepts. Once a concept is located, a lexical scanner checks for negation. Our method can handle multiple latent concepts within the text, filtering out ultrasound concepts. On our dataset, our algorithm achieves 97.7% precision, 95.5% recall and an F
1 -score of 0.97. It outperforms manual feature extraction at the 5% statistical significance level.- Published
- 2009
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.