Authors:
            
                    Dmitri Dolgov
                    
                        
                    
                     and
                
                    Ken Laberteaux
                    
                        
                    
                    
                
        
        
            Affiliation:
            
                    
                        
                    
                    Toyota Technical Center, USA, Inc., United States
                
        
        
        
        
        
             Keyword(s):
            Decision support systems, Vehicle control applications, Optimization algorithms.
        
        
            
                Related
                    Ontology
                    Subjects/Areas/Topics:
                
                        Decision Support Systems
                    ; 
                        Informatics in Control, Automation and Robotics
                    ; 
                        Intelligent Control Systems and Optimization
                    ; 
                        Optimization Algorithms
                    ; 
                        Robotics and Automation
                    ; 
                        Vehicle Control Applications
                    
            
        
        
            
                Abstract: 
                The key components of an intelligent vehicular collision-avoidance system are sensing, evaluation, and decision making. We focus on the latter task of finding (approximately) optimal collision-avoidance control policies, a problem naturally modeled as a Markov decision process. However, standard MDP models scale exponentially with the number of state features, rendering them inept for large-scale domains. To address this, factored MDP representations and approximation methods have been proposed. We approximate collisionavoidance factored MDP using a composite approximate linear programming approach that symmetrically approximates objective functions and feasible regions of the LP. We show empirically that, combined with a novel basis-selection method, this produces high-quality approximations at very low computational cost.