Data Mining in Bioinformatics [Wang, Zaki, Tolvonen & Shasha 2004-09-17].pdf
(
3177 KB
)
Pobierz
Advanced Information and Knowledge Processing
Also in this series
Gregoris Mentzas, Dimitris Apostolou, Andreas Abecker and Ron Young
Knowledge Asset Management
1-85233-583-1
Michalis Vazirgiannis, Maria Halkidi and Dimitrios Gunopulos
Uncertainty Handling and Quality Assessment in Data Mining
1-85233-655-2
´
´
´
´
´
Asuncion Gomez-Perez, Mariano Fernandez-Lopez and Oscar Corcho
Ontological Engineering
1-85233-551-3
Amo Scharil (Ed.)
Environmental Online Communication
1-85233-783-4
Shichao Zhang, Chengqi Zhang and Xindong Wu
Knowledge Discovery in Multiple Databases
1-85233-703-6
Jason T.L. Wang, Mohammed J. Zaki,
Hannu T.T. Toivonen and Dennis Shasha
(Eds)
Data Mining in
Bioinformatics
With
110
Figures
Jason T.L. Wang, PhD
New Jersey Institute of Technology, USA
Mohammed J. Zaki, PhD
Computer Science Department, Rensselaer Polytechnic Institute, USA
Hannu T.T. Toivonen, PhD
University of Helsinki and Nokia Research Center
Dennis Shasha, PhD
New York University, USA
Series Editors
Xindong Wu
Lakhmi Jain
British Library Cataloguing in Publication Data
Data mining in bioinformatics. — (Advanced information and
knowledge processing)
1. Data mining 2. Bioinformatics — Data processing
I. Wang, Jason T. L.
006.3′12
ISBN 1852336714
Library of Congress Cataloging-in-Publication Data
A catalogue record for this book is available from the American Library of Congress.
Apart from any fair dealing for the purposes of research or private study, or criticism or review, as
permitted under the Copyright, Designs and Patents Act 1988, this publication may only be repro-
duced, stored or transmitted, in any form or by any means, with the prior permission in writing of
the publishers, or in the case of reprographic reproduction in accordance with the terms of licences
issued by the Copyright Licensing Agency. Enquiries concerning reproduction outside those terms
should be sent to the publishers.
AI&KP ISSN 1610-3947
ISBN 1-85233-671-4 Springer London Berlin Heidelberg
Springer Science+Business Media
springeronline.com
©
Springer-Verlag London Limited 2005
The use of registered names, trademarks, etc. in this publication does not imply, even in the absence
of a specific statement, that such names are exempt from the relevant laws and regulations and
therefore free for general use.
The publisher makes no representation, express or implied, with regard to the accuracy of the infor-
mation contained in this book and cannot accept any legal responsibility or liability for any errors
or omissions that may be made.
Typesetting: Electronic text files prepared by authors
Printed and bound in the United States of America
34/3830-543210 Printed on acid-free paper SPIN 10886107
Contents
Contributors
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Part I. Overview
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.
Introduction to Data Mining in Bioinformatics
. . . . . . . . . . .
1.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.2 Organization of the Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.3 Support on the Web . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Survey of Biodata Analysis from a Data Mining
Perspective
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.2 Data Cleaning, Data Preprocessing, and Data Integration . . .
2.3 Exploration of Data Mining Tools for Biodata Analysis . . . . . .
2.4 Discovery of Frequent Sequential and Structured Patterns . . .
2.5 Classification Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.6 Cluster Analysis Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.7 Computational Modeling of Biological Networks . . . . . . . . . . . .
2.8 Data Visualization and Visual Data Mining . . . . . . . . . . . . . . . .
2.9 Emerging Frontiers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.10 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
ix
1
3
3
4
8
2.
9
9
12
16
21
24
25
28
31
35
38
Part II. Sequence and Structure Alignment
. . . . . . . . . . . . . . . . . . .
41
3.
AntiClustAl: Multiple Sequence Alignment by Antipole
Clustering
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.2 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.3 Antipole Tree Data Structure for Clustering . . . . . . . . . . . . . . .
3.4 AntiClustAl: Multiple Sequence Alignment via Antipoles . . . .
3.5 Comparing ClustalW and AntiClustAl . . . . . . . . . . . . . . . . . . . .
3.6 Case Study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.7 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.8 Future Developments and Research Problems . . . . . . . . . . . . . .
43
43
45
47
48
51
53
54
56
Plik z chomika:
musli_com
Inne pliki z tego folderu:
21 Recipes for Mining Twitter_ Distilling Rich Information from Messy Data [Russell 2011-03-10](1).pdf
(1049 KB)
Active Mining_ New Directions of Data Mining [Motoda 2002-07-29](2).pdf
(8618 KB)
Advanced Data Mining Techniques [Olson & Delen 2008-01-21](1).pdf
(1098 KB)
Advances in Data Mining_ Knowledge Discovery and Applications [Karahoca 2014](2).pdf
(15624 KB)
Advances in K-means Clustering_ A Data Mining Thinking [Wu 2012-07-10](1).pdf
(2511 KB)
Inne foldery tego chomika:
cheat-sheets
Data Structures
Demystified Series
Dreamweaver
Eclipse
Zgłoś jeśli
naruszono regulamin