In this paper, we propose a new multimedia data mining framework for the extraction of soccer goal events in soccer videos by utilising both multimodal analysis and decision tree logic. The extracted events can be used to index the soccer videos. We first adopt an advanced video shot detection method to produce shot boundaries and some important visual features. Then, the visual/audio features are extracted for each shot at different granularities. This rich multimodal feature set is then filtered by a pre-filtering step in order to clean the noise as well as to reduce the irrelevant data. A decision tree model is built upon the cleaned data set and is used to classify the goal shots. We also present the experimental results for the proposed framework, which indicate the performance of the framework for soccer goal extraction. Copyright © 2006 Inderscience Enterprises Ltd.