

# SEMICONDUCTOR INTEGRATED CIRCUIT PROCESSING TECHNOLOGY

W. R. RUNYANRetired, formerly with Texas InstrumentsK. E. BEANTexas Instruments

INDUSTRIAL DESIGN CORPORATION LIBRARY 2020 S.W. 4th PORTLAND, OREGON 97201

Addison-Wesley Publishing Company

Reading, Massachusetts • Menlo Park, California • New York Don Mills, Ontario • Wokingham, England • Amsterdam • Bonn Sydney • Singapore • Tokyo • Madrid • San Juan

## This book is in the Addison-Wesley Series in Electrical and Computer Engineering

Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial caps or all caps.

This publication is designed to provide accurate and reliable information in regard to the subject matter covered. However, responsibility is assumed neither for its use nor for any infringement of patterns or rights of others which may result from its use.

Cover photos (top to bottom): Courtesy of Bell Laboratories; Courtesy of Texas Instruments; Courtesy of Intel Corporation (®Registered trademark of Fairchild Semiconductor Corporation); Courtesy of Gordon Teal; Courtesy of Texas Instruments Incorporated.

### Library of Congress Cataloging-in-Publication Data

Runyan, W. R. Semiconductor integrated circuit processing technology / Walter R. Runyan, Kenneth E. Bean. p. cm. Bibliography: p. Includes index. ISBN 0-201-10831-3 1. Integrated circuits-Design and construction. 2. Semiconductors-Design and construction. 1. Bean, Kenneth E. II. Title. TK7874.R86 1990 89-320 621.381'73-dc19 CIP

Copyright © 1990 by Addison-Wesley Publishing Company, Inc.

All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the publisher. Printed in the United States of America.

ABCDEFGHIJ-DO-89

## Foreword

Forty years ago the transistor was hailed as a model of the scientific process of invention. Although Shockley's work provided a basic understanding of junction transistor structure, little was really known about the fabrication processes used for semiconductors. Terms such as "deathnium" and "forming" were used to describe effects that were not understood. Most of the techniques used were crude adaptations of technology developed for other purposes, and much of the learning was empirical.

Today, after many thousands of man-years of effort, the situation has changed. New processes have been developed specifically for semiconductor use, and others have been carefully optimized. By the standards of the 1950s, all are now well understood.

The authors of this book, Walt Runyan and Ken Bean, have been key contributors to this new understanding. Both have been actively involved with semiconductors since the beginning; both, in the development of new processes and the application of processes to high-volume production.

Today, semiconductor manufacture is truly a science based industry. This book provides an excellent summary of the state-of-theart in modern semiconductor processing, both in terms of the scientific knowledge of the field and in its practical application to devices.

J. S. Kilby

iii

Elm Exhibit 2159, Page 5

## Preface

The field of semiconductor process engineering is now so broad that it is virtually impossible for a single individual to provide process engineering support or process development for the complete semiconductor manufacturing operation. Thus, subdivision and specialization are required. However, since each subdivision is intricately related to all others, it is absolutely necessary that each technology specialist (engineer) have a good understanding of all phases of the manufacturing process. This book is intended to provide a unified treatment of that portion of the semiconductor manufacturing process commonly referred to as "wafer fabrication."

The book is to be used as a text by university students and as a reference by practicing engineers. It provides background, theory, and practical discussions related to the wafer fabrication process. It is not, however, a compendium of all of the latest research activities, which are constantly changing and continually covered in various technical journals and review book series. The serious student, whether in industry or university, must keep up-to-date by following the current technical journals most closely allied with his or her chosen specialty.

The level of presentation in the book assumes that the student has had introductory physics, introductory inorganic chemistry, mathematics through differential equations, a first course in transistor device physics, and a course covering electronic material properties including crystal structure. All of these courses are generally available by the senior year so the text is suitable for either senioror graduate-level courses.

The book is organized into eleven chapters and three appendices. All chapters have an extensive list of references, and all but the first two chapters have problems and a list of key ideas. Since silicon remains the semiconductor used for 99 percent of the integrated circuits, it is naturally emphasized. However, where GaAs processing differs substantially from silicon, it is discussed separately.

The first chapter is an historical discussion of the development of the integrated circuit. The second chapter gives an overview of the processing steps required to manufacture an integrated circuit. Chapters 3 through 10 discuss the various technologies required for the steps, and each chapter has a section on safety. The chapter subjects are, respectively, silicon thermal oxidation, deposition of thin inorganic films, photolithography, etching, epitaxial

Elm Exhibit 2159, Page 6

۷

VI PREFACE

growth, solid state diffusion, ion implantation, and contacts and interconnects.

Chapter 11 is devoted to a discussion of yields, yield analysis, and yield economics. The latter is included because all too often the student (and in some cases, practicing engineers as well) lose sight of the fact that to be practical for industrial use, semiconductor processes must be economical as well as scientifically feasible. Emphasis is on defect density measurement and interpretation, on the use of I–V plots to determine the cause of faulty integrated circuit elements, and on how yield and an economical manufacturing operation are related.

The first of three appendices is a review of crystallography as it relates to silicon and gallium arsenide. The second is a review of phase diagrams, and the third is a compilation of numerical constants and conversions. Finally, an extensive list of semiconductorrelated acronyms is given inside the front and back covers of the book.

Acknowledgments: Over the course of time spent in preparing this book, many people have been very helpful, and in particular, we would like to acknowledge Hettie Smith, who typed the manuscript and took care of many of the details involved in its preparation, as well as Marion Johnson and John Powell, who provided many of the photographs. We also want to acknowledge the following people who read and commented on various portions of the manuscript: Neal Akridge. ATEQ Corp.: Frederick Strieter. Honeywell Corp.: Bruce Deal, Advantage Production Technology, Inc.; Howard Huff, Sematech: George Brown. Dwayne Carter, Rinn Cleavelin, Monte Douglas, P. B. Ghate, John Hatcher, William Keenan, Charlotte Tipton, Michael Tipton, Rick Wise, and Richard Yeakley, all of Texas Instruments Incorporated; and Isaac Trachtenberg, Chemical Engineering Department at the University of Texas. In addition, we want to thank the following people for their helpful reviews of the manuscript: D. K. Schroder, Arizona State University; John D. Shott, Stanford University: Dean P. Neikirk, University of Texas, Austin; Arthur B. Glaser, AT&T Bell Laboratories; and Richard C. Jaeger, Auburn University.

We would also like to thank our editor. Tom Robbins, and his staff for guidance during the various stages of manuscript preparation. Lastly, we wish to express our appreciation for the patience, encouragement, and indulgence of our wives. Delma Runyan and Helen Bean.

> W. R. Runyan K. E. Bean

# Contents in Brief

i

| CHAPTER 1        | Historical Overview 1                                    |
|------------------|----------------------------------------------------------|
| CHAPTER 2        | Processing Overview 22                                   |
| CHAPTER 3        | Thermal Oxidation of Silicon 53                          |
| <b>CHAPTER 4</b> | Definition of Inorganic Thin Films 121                   |
| CHAPTER 5        | Lithography 161                                          |
| CHAPTER 6        | Etching 242                                              |
| CHAPTER 7        | Epitaxy 294                                              |
| CHAPTER 8        | Impurity Diffusion 371                                   |
| CHAPTER 9        | Ion Implantation 476                                     |
| CHAPTER 10       | Ohmic Contacts, Schottky Barriers, and Interconnects 518 |
| CHAPTER 11       | Yields and Yield Analysis 579                            |
| APPENDIX A       | Crystallography 645                                      |
| APPENDIX B       | Phase Diagrams 661                                       |
| APPENDIX C       | Reference Tables 670                                     |
|                  | Index 673                                                |
|                  |                                                          |

vii

Elm Exhibit 2159, Page 9

# **Detailed Contents**

### **CHAPTER 1**

#### **Historical Overview** 1

- 1.1 Background
- 1.2 The Transistor 2
- 1.3 The Integrated Circuit 8
- 1.4 Semiconductor Processes 17 References 20

1

#### **Processing Overview** CHAPTER 2 22

- 2.1 Introduction
- 2.2 Wafer Fabrication Facility 31
- 2.3 Wafer Fabrication Operations 33
  - 2.3.1 Surface Cleaning 34
  - 2.3.2 Epitaxy 34
  - 2.3.3 Thermal Oxidation of Silicon 35

22

- 2.3.4 Diffusion 35
- 2.3.5 Ion Implantation 36
- 2.3.6 Lithography 36
- 2.3.7 Etching 38
- 2.3.8 Chemical Vapor Deposition (CVD) 38
- 2.3.9 Metal Deposition 38
- 2.3.10 Metal (Contact) Sintering 39
- 2.3.11 Backgrinding 39 39
- 2.3.12 Multiprobing
- 2.4 Integrated Circuit Fabrication 40 2.4.1 Components and Component Construction 40 2.4.2 Steps in IC Fabrication 46

#### CHAPTER 3 **Thermal Oxidation of Silicon** 53

- 3.1 Introduction 53
- 3.2 Structure 55
- Nonelectrical Properties 3.3 57
  - 3.3.1 Density 57
    - 3.3.2 Expansion Coefficient 59
    - 3.3.3 Optical Properties 60
    - 3.3.4 Chemical Properties 61
- 3.4 Electrical Properties 63
  - 3.4.1 Bulk Electrical Conduction 63
  - 3.4.2 Surface Electrical Conductivity 65
  - 3.4.3 Dielectric Breakdown Strength 67

ix

### x DETAILED CONTENTS

- 3.4.4 Mobile Ion Transport 69
- 3.4.5 Interface Charges 72
- 3.5 Kinetics of Oxidation 76
  - 3.5.1 Deal-Grove Model
  - 3.5.2 Initial Stage of Oxidation 79
  - 3.5.3 Reaction Path 81
  - 3.5.4 Determination of Rate Constants 81
  - 3.5.5 Calculation of Oxide Thickness 83
  - 3.5.6 Temperature Behavior of Rate Constants 83
  - 3.5.7 Effect of Crystal Orientation on Rate 84
  - 3.5.8 Polycrystalline Silicon Oxidation 88
  - 3.5.9 Effect of Pressure on Oxidation 90
  - 3.5.10 Effect of Ambient Impurities on Oxidation Rate 90

77

- 3.5.11 Effect of Silicon Doping on Oxidation Rate 91
- 3.5.12 Impurity Profile at the Oxide Interface 96
- 3.6 Oxide Thickness Charts 98
- 3.7 Preparation of Thermal Oxide 99
  - 3.7.1 Preoxidation Surface Cleanup 99
    - 3.7.2 Atmospheric Pressure Equipment 104
    - 3.7.3 High-Pressure Oxidation Processing 107
    - 3.7.4 High-Pressure Equipment 107
    - 3.7.5 Selective Oxidation 108
    - 3.7.6 Two- and Three-Dimensional Oxidation 110
    - 3.7.7 Oxide Physical Defects
    - 3.7.8 Defects Generated in Silicon During Oxidation 112

111

- Key Ideas 113
- Problems 113
- References 114

### CHAPTER 4 Deposition of Inorganic Thin Films 121

- 4.1 Introduction 121
- 4.2 Sputter Deposition 124
- 4.3 Evaporation 126
- 4.4 Ion Beam Deposition 128
- 4.5 Chemical Vapor Deposition 129
  - 4.5.1 Atmospheric Pressure and Low-Pressure Depositions 130

137

- 4.5.2 Plasma-Enhanced CVD
  - 4.5.3 Photon-Enhanced CVD 139
- 4.6 Deposition of Specific Materials 140
  - 4.6.1 CVD Silicon Dioxide 140
  - 4.6.2 Plasma Silicon Dioxide 142
  - 4.6.3 Doped Silicon Oxide 143
  - 4.6.4 CVD Silicon Nitride 145
  - 4.6.5 Plasma Silicon Nitride 147
  - 4.6.6 APCVD and LPCVD Polysilicon 148
  - 4.6.7 Plasma Polysilicon 151

### DETAILED CONTENTS xi

- 4.7 Other CVD Film Materials 151
- 4.8 Spin-On Glasses 152
- 4.9 Glass Fritting 153 Safety 154 Key Ideas 156 Problems 156 References 157

### CHAPTER 5 Lithography 161

- 5.1Introduction1615.2Photomasks161
- 5.2.1 Masks for Optical Printing1615.2.2 X-Ray Masks164
  - 5.2.3 Mask Protection 165
- 5.3 Mask Defects 166 5.3.1 Mask Inspection 167
  - 5.3.2 Mask–Wafer Overlay Accuracy 169
  - 5.3.3 Overlay Accuracy Measurements 171
- 5.4 Photoresists 172
  - 5.4.1 UV Negative Resist Formulation 175
  - 5.4.2 UV Positive Resist Formulation 175
  - 5.4.3 E-Beam, X-Ray, and Ion Beam Resists 176
  - 5.4.4 Inorganic Resists 177
  - 5.4.5 Plasma-Developable Resists 177
  - 5.4.6 Resist Contrast 178
  - 5.4.7 Sensitivity 182
  - 5.4.8 Reciprocity 183
  - 5.4.9 Etch Resistance 184
- 5.5 Resist Processing Flow 184
  - 5.5.1 Adhesion Promoters and Adhesion 184
  - 5.5.2 Resist Application 186
  - 5.5.3 Soft-, Postexpose-, and Hard-Bake 189
  - 5.5.4 Developing 189
  - 5.5.5 Resist Removal 189
- 5.6 Light Sources 190
  - 5.6.1 Arc Sources 190
    - 5.6.2 Excimer Lasers 192
- 5.7 Contact Printing 192
- 5.8 Optical Proximity Printing 193
- 5.9 Fundamental Optical Limitations of Projection Printing 195
  - 5.9.1 Resolution 195
  - 5.9.2 Depth of Focus 198
  - 5.9.3 Variation in Position of Best Focus 198
- 5.10 Full-Wafer Projection Printing 200 5.10.1 Operation 201

### xii DETAILED CONTENTS

5.10.2 Focusing Accuracy 202 5.10.3 Registration 202 Stepper Printing 204 5.11 5.11.1 Focus 204 5.11.2 Alignment 206 5.11.3 Registration 207 5.11.4 Throughput 208 5.12 Electron Beam Printing 210 5.12.1 Pattern Generation 210 5.12.2 Resolution 210 5.12.3 Alignment 211 5.13 X-Ray Printing 211 5.13.1 Image Distortion 212 5.13.2 Resolution 212 5.13.3 X-Ray Sources 214 5.13.4 X-Ray Steppers 214 5.14 Ion Beam Printing 215 5.14.1 Focused Ion Beams 215 5.14.2 Masked Ion Beam Printing 215 5.15 Printer Selection 215 5.16 Line-Width Control 216 5.16.1 Process-Initiated Control Problems 217 5.16.2 Interference Effects 224 5.16.3 Minimizing Effect of Standing Waves 227 5.16.4 Resist over Steps 228 5.16.5 Multilayer Resist 228 5.17 Wafer Flatness 231 5.18 Lithography Process Modeling 234 Safety 235 236 Key Ideas Problems 237 References 237

### CHAPTER 6 Etching 242

- 6.1 Introduction 242
- 6.2 Wet Etching 245
  - 6.2.1 Etch Formulation 247
    - 6.2.2 Nonselective Silicon Etching 249
    - 6.2.3 Polycrystalline Silicon Etching 251
    - 6.2.4 Anisotropic Silicon Etching 252
    - 6.2.5 Resistivity-Sensitive Silicon Etching 256
  - 6.2.6 Electrolytic Silicon Etching 258
  - 6.2.7 Silicon Staining 259
  - 6.2.8 Gallium Arsenide Etching (General) 259
  - 6.2.9 Gallium Arsenide Etching (Anisotropic) 261
  - 6.2.10 Preferential Etches 263

#### DETAILED CONTENTS xiii

283

6.2.11 Silicon Dioxide Etching 264 6.2.12 Silicon Nitride Etching 267 6.2.13 Metal Etching 268 6.3 Plasma Etching 269 6.3.1 Etching Reactions 273 6.3.2 Plasma Etch Rates 278 6.3.3 Selectivity 280 6.3.4 End-Point Detection 282 6.3.5 Effect of Loading on Etch Rate 284 6.3.6 Effect of Electrode Material 6.3.7 Plasma-Induced Damage 284 6.3.8 Plasma Etching Equipment 285 6.4 Vapor Phase Etching 287 Safety 287 Key Ideas 288 Problems 289 References 289

#### **CHAPTER 7** Epitaxy

- 7.1 Introduction 294
- Vapor Phase Epitaxy 297 7.2
  - 7.2.1 Nucleation 297
    - 7.2.2 Surface Reactions 306

294

- 7.2.3 Material Transport 308
- 7.3 Vapor Phase Silicon Epitaxy 309
  - 7.3.1 Surface Cleaning and Vapor Phase Etching 309
    - 7.3.2 Deposition Reactions 313
    - 7.3.3 Deposition Rates 315
    - 321 7.3.4 Effect of Orientation on Deposition Rate
    - 7.3.5 Layer Doping 323
    - 7.3.6 Impurity Redistribution during Epitaxy 324
    - 7.3.7 Pattern Shift and Distortion 328
    - 7.3.8 Selective Epitaxial Growth (SEG) 332
    - 7.3.9 Equipment for Silicon Vapor Phase Epitaxy 338
    - 7.3.10 Molecular Beam Epitaxy (MBE) 342
    - 7.3.11 Defects Introduced during Growth 344
- 7.4 Vapor Phase Gallium Arsenide Epitaxy 349 7.4.1 Halide Transport 349 7.4.2 Metalorganic Source CVD (MOCVD) 350
  - 7.4.3 GaAs Molecular Beam Epitaxy 351
  - 352
- 7.5 Liquid Phase Epitaxy 7.5.1 Silicon LPE 353 7.5.2 GaAs LPE 355
- 7.6 Epitaxial Crystal Growth from a Solid 356
- Crystal Growth over Insulating Substrates 357 7.7 Safety 360

### xiv DETAILED CONTENTS

| Key Ideas  | 360 |
|------------|-----|
| Problems   | 361 |
| References | 362 |

### **CHAPTER 8**

### Impurity Diffusion 371

8.1 Introduction 371

- 8.2 Atomic Diffusion Mechanisms 374
  - 8.2.1 Diffusion by Vacancies 378
  - 8.2.2 Diffusion by Interstitialcy 379
  - 8.2.3 Interstitial Diffusion 380
  - 8.2.4 Electric-Field-Aided Diffusion
  - 8.2.5 Effect of Thermal Oxidation on the Diffusion Coefficient 383

381

- 8.2.6 Effect of Diffusing Direction (Orientation) 384
- 8.2.7 Effect of Heavy Doping 385
- 8.2.8 Effect of Temperature 386
- 8.2.9 Effect of Stress/Strain 386
- 8.2.10 Emitter Push, Pull, and Dip 388
- 8.3 Solutions to the Diffusion Equations 388
  - 8.3.1 Diffusion from Infinite Source on Surface 390
  - 8.3.2 Diffusion from Limited Source on Surface 392
  - 8.3.3 Diffusion from Interior Limited Source 394
  - 8.3.4 Diffusion from Layer of Finite Thickness 395
  - 8.3.5 Diffusion from Concentration Step 397
  - 8.3.6 Diffusion from Concentration Step into Moving Layer 398
  - 8.3.7 Out-Diffusion with Rate Limiting at Surface 401
  - 8.3.8 Diffusion from a Fixed Concentration into Moving Layer 401
  - 8.3.9 Diffusion through Thin Layer 402
  - 8.3.10 Diffusion during Thermal Oxidation 404
  - 8.3.11 Diffusion from Infinite Source into Finite Thickness 406
  - 8.3.12 Two-Dimensional Diffusion from High-Diffusivity Path 407
  - 8.3.13 Two-Dimensional Solutions 409
  - 8.3.14 Effect of Temperature Varying with Time 411
- 8.4 Diffusion Profile Calculations 412
  - 8.4.1 Phosphorus Profile 414
    - 8.4.2 Silicon Interstitial Diffusants 414
    - 8.4.3 Gallium Arsenide Profile 416
    - 8.4.4 Solid Solubility Data 416
- 8.5 Diffusion Characterization 418
  - 8.5.1 Conversion from Dopant Concentration to Resistivity 418
    - 8.5.2 Sheet Resistance Measurement 419
    - 8.5.3 Determination of  $N_0$  420
    - 8.5.4 Junction Depth Measurement 421
    - 8.5.5 Profile Measurement 422
  - 8.5.6 Determination of Diffusion Coefficient 424
- 8.6 Diffusion Processes 428
  - 8.6.1 Depth 428

#### DETAILED CONTENTS XV

442

8.6.2 Surface Concentration 428 8.6.3 Lifetime Control 429 8.6.4 Gettering of Silicon 431 8.7 Diffusion-Induced Defects 440 8.7.1 Diffusion Flaws and Pipes 441 8.7.2 Lifetime-Reducing Impurities and Precipitates 8.7.3 Strain-Induced Dislocations 443 8.7.4 Permanent Wafer Warpage 446 8.8 Diffusion Equipment 451 8.8.1 Gas Flow Measurement and Control 452 8.8.2 Diffusion Tubes 453 8.8.3 Wafer Diffusion Boats 453 8.8.4 Wafer Insertion 454 8.8.5 Computer Control of Tube Functions 454 8.9 **Diffusion Sources** 456 8.9.1 Liquid Sources 458 8.9.2 Gaseous Sources 459 8.9.3 Planar Sources 460 8.9.4 Doped Oxide Sources 461 461 8.9.5 Closed-Tube Sources The Error Function 462 8.10 8.10.1 Error Function Algebra 462 462 8.10.2 Calculation of Error Function Values Safety 465 Key Ideas 466 Problems 467 References 468

#### **CHAPTER 9** Ion Implantation 476

9.1 Introduction 476

- Implant Depth (Range) 478 9.2
  - 9.2.1 Range Calculations 478
    - 9.2.2 Range Dispersion 481
    - 9.2.3 Lateral Range Dispersion 486
  - 9.2.4 Channeling 487
  - 9.2.5 Implanting through Layers 491
- 9.3 Implant-Induced Defects 494 9.3.1 Lattice Damage 495
  - 9.3.2 Annealing 497
- 9.4 Applications 499
  - 9.4.1 MOS Threshold Adjust 501
  - 9.4.2 Self-Aligned MOS Gate 502
  - 9.4.3 Polysilicon Doping 503
  - 9.4.4 Alternative to Diffusion Predep 504
  - 9.4.5 Ion Beam Mixing and Ion Beam Damage 504
  - 9.4.6 Formation of Buried Layers 505

### xvi DETAILED CONTENTS

9.5 Ion Implant Equipment 505 9.5.1 Ion Sources 507 509 9.5.2 Ion Analyzer 9.5.3 Wafer Handling 510 9.5.4 Wafer Contamination 511 512 9.5.5 Implant Uniformity Measurements 513 Safety 514 Key Ideas 514 Problems References 515

### CHAPTER 10 Ohmic Contacts, Schottky Barriers, and Interconnects 518

| 10.1  | Introduction 518                                   |
|-------|----------------------------------------------------|
| 10.2  | Ohmic Contacts 520                                 |
| 17.10 | 10.2.1 Spreading Resistance 520                    |
|       | 10.2.2 Contact Resistance 522                      |
|       | 10.2.3 Total Contact Resistance 527                |
|       | 10.2.4 The Aluminum–Silicon Ohmic Contact 528      |
|       | 10.2.5 The Silicide–Silicon Ohmic Contact 535      |
|       | 10.2.6 Alloyed GaAs Ohmic Contacts 538             |
|       | 10.2.7 Nonalloyed GaAs Ohmic Contacts 538          |
|       | 10.2.8 Measurement of Contact Resistance 539       |
| 10.3  | Schottky Diodes 541                                |
|       | 10.3.1 Rectifying Schottky Barriers 541            |
|       | 10.3.2 Choice of Metal for Schottky Barriers 544   |
| 10.4  | Leads (Interconnects) 544                          |
|       | 10.4.1 Lead Resistance 545                         |
|       | 10.4.2 Adhesion to SiO <sub>2</sub> 549            |
|       | 10.4.3 Diffusion Barrier 550                       |
|       | 10.4.4 Interconnect Material Contamination 552     |
|       | 10.4.5 Step Coverage 552                           |
|       | 10.4.6 Protective Overcoats 555                    |
| 10.5  | MOS Gate Materials 556                             |
| 10.6  | Multilevel Metallization 558                       |
|       | 10.6.1 Interlevel Shorts 558                       |
|       | 10.6.2 Planarization 559                           |
| 10.7  | Interconnect and Gate Metal Pattern Definition 560 |
| 10.8  | Failure Modes 560                                  |
|       | 10.8.1 Electromigration 561                        |
|       | 10.8.2 Corrosion 564                               |
|       | 10.8.3 Internal Stresses 565                       |
| 10.9  | Deposition Methods 565                             |
|       | 10.9.1 Surface Cleanup 566                         |
|       | 10.9.2 Metal Evaporation 566                       |
|       | 10.9.3 Sputtering 567                              |
|       | 10.9.4 Chemical Vapor Deposition 568               |
|       |                                                    |

### DETAILED CONTENTS xvii

10.10 Process Control 570 570 10.10.1 Resistivity Ratio 10.10.2 Current Capacity 571 10.10.3 Lead Adherence 571 10.10.4 Process Problems Requiring Test Patterns for Detection 572 Safety 572 Key Ideas 572 573 Problems 574 References

### CHAPTER 11 Yields and Yield Analysis 579

- 11.1 Introduction 579
- 11.2 Yield Definitions 579
- 11.3Methods of Yield Measurement58111.3.1The Curve Tracer58211.3.2Automatic Multiprobe Testing583
- 11.4 Yield Tracking 585
   11.4.1 Process and Multiprobe Yield Tracking 585
   11.4.2 Split Lot Testing 585
- 11.5 Defect Density 588
  - 11.5.1 Theory 590
    - 11.5.2 Defect Density Determination 596
- 11.6 Chip Yield Predictions Based on Defect Density 599 11.6.1 New Devices 599
  - 11.6.2 New IC Designs with New Processes 601
  - 11.6.3 Devices in Production 602
  - 11.6.4 Work in Progress
  - 11.6.5 When Yield is Low 603
- 11.7 Determining Causes of Chip Yield Loss 604
- 11.8 Junction Testing 605
  - 11.8.1 Reverse-Biased pn Junctions 606
  - 11.8.2 Reverse Current Curve Tracer Diagnostics 611

603

- 11.8.3 Reverse-Biased Schottky Junctions 616
  - 11.8.4 Forward-Biased pn Junctions 617
- 11.9 MOS Transistor Testing 621
  - 11.9.1 Threshold Voltage 621
  - 11.9.2 Subthreshold Leakage Current 624
  - 11.9.3 Pinchoff Voltage 624
  - 11.9.4 Conduction Factors k and k' 624
  - 11.9.5 Channel Mobility 626
  - 11.9.6 Drain Breakdown Voltage 626
- 11.10 Bipolar Transistor Diagnostics 626
  - 11.10.1 Current Gain 627
    - 11.10.2 Reverse Breakdown Voltages 630
  - 11.10.3 Reverse and Forward Currents 633

### xviii DETAILED CONTENTS

Yield Economics 634 11.11 11.11.1 Silicon Usage 635 11.11.2 Wafer Cost 636 11.11.3 Wafer Diameter 637 11.11.4 Effect of Chip Size on Yield and Cost 638 11.11.5 Revenue from Wafers 639 Factors Affecting Yield and Yield Improvement 640 11.12 Key Ideas 642 Problems 642 References 643

### APPENDIX A Crystallography 645

- A.1 Silicon and Gallium Arsenide Structure 645
- A.2 Crystallographic Defects 654 References 659

### APPENDIX B Phase Diagrams 661

- B.1 Binary Phase Diagrams 661
  B.2 Ternary Phase Diagrams 667
  References 669
- APPENDIX C Reference Tables 670

Index 673



## **Historical Overview**

## 1.1 BACKGROUND

The semiconductor technology that ultimately led to the integrated circuit (IC) was over 75 years in the making.<sup>1</sup> Around 1875, it was observed that selenium exhibited rectification and photoconductivity. A silicon diode of sorts for detecting radio waves was described in 1906 (1). By 1935, selenium rectifiers and photodetectors, silicon carbide varistors, and galena (naturally occurring lead sulfide) and silicon point contact diodes for radio detection were on the market (2). Also in 1935, a British patent was issued, but was never reduced to practice, for a thin-film field effect transistor (3).

The development of radar just prior to and during World War II placed a great deal of emphasis on the study of silicon (Si) and germanium (Ge) since they were singled out as the most appropriate materials for the mixer and detector diodes required in the radar detection circuitry. Because of this activity, commercial sources of high-purity silicon and germanium were developed. Professor Lark-Horovitz of Purdue University prevailed on Eagle-Pitcher Mining and Smelting Company to develop a high-purity germanium supply, which they did, based on extraction from the lead and zinc ores mined in the tri-state area of Kansas, Missouri, and Oklahoma. (This was the only known significant source of germanium ore in the Western Hemisphere.) During the same period, Professor Seitz of the University of Pennsylvania initiated a program at DuPont to produce high-purity silicon (4).

The few events and names mentioned in this section should neither be construed as representing the total semiconductor activity in that period nor as even necessarily being the most significant. They were chosen to give some idea of progress as a function of time, and in fact, a large number of people, both in industry and in the universities, contributed to the development. To single out a few names and places may seem unfair, but to do otherwise would turn a processing technology text into a volume of semiconductor history. (For more definitive information, see reference 2 and the first 85 references contained therein.)

1.2 THE TRANSISTOR



William Shockley (seated at the microscope), John Bardeen (at left), and Walter Brattain, all of Bell Telephone Laboratories, are pictured here about the time of their discovery of the point contact transistor in 1947. This work resulted in their receiving the Nobel Prize for Physics in 1956. (Source: Photograph courtesy of AT&T Archives.)



John Bardeen, co-inventor of the point contact transistor, received a second Nobel Prize in physics in 1972 for his work on the theory of superconductivity. (Source Photograph courtesy of AT&T Archives.)

In December of 1947, the first transistor (a point contact structure) was constructed by John Bardeen and Walter Brattain of Bell Telephone Laboratories. For a discussion of the events leading up to that occasion and of the subsequent work that led to the junction transistor (and to the Nobel Prize in physics for Bardeen, Brattain, and Shockley), see Shockley's paper "The Path to the Conception of the Junction Transistor," *IEEE Trans. on Electron Dev. ED-23*.

The first point contact transistor used polycrystalline germanium for the semiconducting material. However, when the first paper describing the transistor was submitted for publication (June 25, 1948), the transistor effect had been demonstrated in silicon as well (5). By the end of 1949, single-crystalline rather than polycrystalline material was being used (2). This conversion from polycrystalline to single-crystal source material (spearheaded by Gordon Teal of Bell Telephone Laboratories) has been one of the most significant advances in semiconductor technology since the invention of the transistor itself. Without a source of large single crystals with uniform properties, the high-volume production of small devices would have been difficult and development of large-area integrated circuits impossible.

Efforts to build useful structures using polycrystalline or amorphous material thus far have had only spotty success. Amorphous silicon solar cells are now commercially feasible, and diodes utilized as fuses for programming are sometimes built into polycrystalline silicon integrated circuit leads. However, in all cases, single-crystal devices outperform comparable devices made in amorphous or polycrystalline material.

Although the point contact transistors were expensive and unreliable, they were, overall, superior in many respects to vacuum tubes.<sup>2</sup> Consequently, they went into production at Western Electric's Allentown plant in 1951 as replacements for vacuum tubes in some telephone exchange applications. However, by the time the transistor process was licensed to other manufacturers in 1952, William Shockley had invented the grown junction transistor, and Gordon Teal and Morgan Sparks had reduced it to practice (6). This transistor became the industry workhorse for several years. It differed from all subsequent transistor structures in that junction for-

<sup>&</sup>lt;sup>2</sup> Early guided missiles often used magnetic amplifiers instead of vacuum tubes in their control systems because of reliability considerations. The point contact transistor, which in retrospect appears to have been a very fragile and unreliable structure, was in fact good enough to replace the magnetic amplifiers and greatly improve the frequency response of the control systems.

### 1.2 THE TRANSISTOR 3

### FIGURE 1.1

Construction details of a silicon grown junction transistor. (Germanium transistors were pnp and used gold wire for the base contact.)



Walter Brattain was a co-inventor of the point contact transistor. (Source: Photograph courtesy of AT&T Archives.)

n-emitter region (heavily doped)



(a) Crystal cut in half for evaluation



(c) Section cut Into "bars" (size, varying with power rating of transistor, but typically 30× 30×150 mils)



(b) Section cut from mid-region of crystal that contains base layer and small section each of collector and emitter regions



(d) Base contact formed by fusing (alloying) aluminum wire to silicon so that wire made ohmic contact to base region and rectifying contacts to collector and emitter



William Shockley, co-inventor of the point contact transistor, was also the inventor of the junction transistor. (*Source:* Photograph courtesy of AT&T Archives.)

mation was completed during the crystal-growing operation. Fig. 1.1 shows the construction details of a silicon grown junction transistor. The remainder of the manufacturing steps were to affix leads, package, and test. Fig. 1.2 shows a transistor bar<sup>3</sup> mounted in a header. All that remained to be done here was to add a cover and final-test.

Even though both silicon and germanium were demonstrated at the beginning to be usable for transistors, germanium was much more tractable, and silicon transistors (desirable for their highertemperature capability) did not become commercially available until

<sup>&</sup>lt;sup>3</sup> As can be seen from Fig. 1.2, the bit of silicon used in each transistor really did look like a bar. For people and companies that worked extensively with grown junctions, the term *bar* became so deeply rooted that over 30 years later integrated circuits 1 cm on a side and 0.5 mm thick may still be called bars. Those whose experience started with alloyed or diffused units used the terms *chip* or *die*, which today are still common expressions.

### FIGURE 1.2

Grown junction transistor bar mounted in a header. (The collector and emitter connections support the bar; the fine wire is the base connection.)



1954 when grown junction silicon transistors were introduced by Texas Instruments Incorporated (7).

Alloyed junction germanium transistors were reported by Saby of General Electric in 1951 during the same meeting at which the grown junction transistor was first openly discussed (8). These transistors were made by alloying a dot of metal, usually indium, into each side of a chip of germanium, as shown in Fig. 1.3. A pn junction formed at each alloy–germanium interface and provided the collector–base and emitter–base junctions, respectively. Germanium alloyed transistors were very successful and became an alternative to the previously introduced germanium grown junction transistors. However, silicon alloyed transistors were difficult to construct and never commanded a very large market share.

The next important development was the use of gaseous diffusion at Bell Laboratories to make the necessary junctions from one side of a semiconductor slice (9). This activity gave rise to the diffused mesa transistors (10) that were commercially available in germanium in 1957 and in silicon in 1958. Fig. 1.4 shows the steps in their fabrication. There were actually several versions, only one of which is shown here. These mesa transistors had an advantage over either the grown junction or alloy transistors in that the diffusion process could give much narrower and more controlled base widths and thus allow higher-frequency operation. Since many transistors could be made at one time on each silicon or germanium slice, and since many slices could be cut from a single crystal, mesa transistors were less expensive to manufacture than grown junction transistors were. In the latter case, only one slice-or, if the crystal were rate grown (11), perhaps five or six—containing the grown junctions was available from each crystal. Diffused transistors were also less expensive than alloy transistors were since the alloy process required that each chip be processed individually rather than in slice form.



Gordon Teal pioneered the development of single-crystal growth of germanium and silicon and the use of high-purity single-crystal material for semiconductor applications. (*Source:* Photograph courtesy of Gordon K. Teal and AT&T Archives.)

### 1.2 THE TRANSISTOR 5





diffused in

Elm Exhibit 2159, Page 24

The point contact transistor went into production at Western Electric in 1951 and was licensed to other companies for manufacture. Although it was somewhat temperamental, this transistor was, in general, superior in performance to the vacuum tube. (*Source:* Photograph courtesy of AT&T Archives.)



Gordon Teal and Morgan Sparks made the first junction transistor, the construction of which eliminated many of the reliability problems of the point contact transistor. (*Source:* Photograph courtesy of AT & T Archives.)



### 1.2 THE TRANSISTOR 7





Jean Hoerni invented the planar process, which dramatically improved the reliability of silicon diodes and transistors. (Source: © Carolyn Caddes, 1986, from Portraits of Success.) Unfortunately, all three processes (grown junction, alloy, and mesa) continued to have one serious flaw, and that was the exposed junctions at the semiconductor surface. Various coatings were developed to minimize electrical leakage across those junctions, but they were never completely successful.

The silicon planar transistor, invented by Jean Hoerni of Fairchild Semiconductor, followed shortly thereafter (12) and was in production by 1961. As Fig. 1.5 shows, it very effectively solved the

| TA | RI | F | 11. | 1   |
|----|----|---|-----|-----|
| in | DL | - | 10  | ct: |

| Year | Germanium<br>Units | Silicon<br>Units |
|------|--------------------|------------------|
|      | 0.000              | onno             |
| 1954 | 1.3                | 0.02             |
| 1955 | 3.6                | 0.09             |
| 1956 | 12.4               | 0.42             |
| 1957 | 27.7               | 1.0              |
| 1958 | 45.0               | 2.1              |
| 1959 | 77.5               | 4.8              |
| 1960 | 119                | 8.8              |
| 1961 | 178                | 13.0             |
| 1962 | 214                | 26.6             |
| 1963 | 249                | 50.1             |
| 1964 | 289                | 117              |
| 1965 | 334                | 273              |
| 1966 | 369                | 481              |

Note: All units in millions.

Source: Electronic Industries Association Yearbook, 1967

1.3

THE INTEGRATED CIRCUIT junction protection problem for silicon by allowing the junctions to terminate under a silicon oxide thermally grown on the silicon surface. No other semiconductor has an oxide that can be grown in situ and that possesses the near ideal electrical characteristics of silicon dioxide. The introduction of the planar process, which increased reliability and improved yields, along with the inherently better hightemperature performance of silicon, marked the beginning of the decline of germanium devices. Thus, the four-year period between 1957 and 1961 included first the move to large-scale production with the introduction of the mesa process and then the beginning of the conversion from germanium to silicon with the advent of planar processing. Table 1.1 shows the growth of transistor production through those years, and while the yearly growth rate was no larger in those four years than in any other, it was because of those advances that the phenomenal rate could be sustained. This table also shows the dramatic manner in which silicon overtook germanium in the years following the introduction of the planar process.4 For a business view of the semiconductor world in this transition period between the times of the introduction of mesa and planar processing, see "Report on Semiconductors," Business Week, March 26, 1960.

Until the invention of the integrated circuit, complete circuits, whether they used transistors or vacuum tubes, were made by individually connecting the various components (tubes or transistors, diodes, capacitors, resistors, and inductors) together. Through the years, many improvements were introduced to increase packing density, simplify fabrication, and increase reliability of electronic circuits, but they all continued to depend on the wiring together of discrete and separately packaged devices. In an early effort to attack the problem from a fresh approach, G.W.A. Dummer of the Royal Radar Establishment of the United Kingdom suggested during the May 1952 Electronics Components Symposium in Washington, D.C., an integrated approach using a monolithic block comprising "layers of insulating, conducting, rectifying and amplifying mate-

Elm Exhibit 2159, Page 27

<sup>&</sup>lt;sup>6</sup> It should be appreciated that "production by" and "introduced in" dates are a bit difficult to pinpoint since these dates depend on the first time anyone made and reported such a device, the first time samples were shown to a potential customer, or the time at which reasonable quantities were available to customers. Similar problems arise in other areas, such as the year of introduction of larger crystal diameters.

### 1.3 THE INTEGRATED CIRCUIT 9



Jack Kilby is the inventor of the Solid Circuit® integrated circuit and a co-inventor of the pocket calculator. (Source: Photograph courtesy of Jack Kilby. ®Registered trademark of Texas Instruments Incorporated.)

rials, the electrical functions being connected directly by cutting out areas of the various layers." A metal model of how such a structure might be made was shown in 1957.

The figures included in a patent filed in May of 1953 by Harwick Johnson (U.S. Patent 2,816,228, issued December 10, 1957) bear a superficial resemblance to an integrated circuit. However, as expressed in the first sentence of the patent, both the discussion and claims relate only to a transistor phase-shift oscillator: "This invention pertains to semiconductor devices and particularly to semiconductor phase-shift oscillators and devices." Component isolation was not considered so that even if the concepts of the patent were extended to devices other than the phase-shift oscillator, the class of devices that could be made would be very limited.<sup>5</sup>

In February of 1959, Jack Kilby of Texas Instruments Incorporated filed a patent application describing a concept that allowed, using relatively simple steps, the fabrication of all of the necessary components of the desired circuit, both active and passive, in a single piece of semiconductor and their interconnection in situ. The patent states:

In contrast to the approaches to miniaturization that have been made in the past, the present invention has resulted from a new and totally different concept for miniaturization. . . In accordance with the principles of the invention, the ultimate in circuit miniaturization is attained using only one material for all circuit elements and a limited number of compatible processing steps for the production thereof. . . .

<sup>&</sup>lt;sup>3</sup> Considerable space is being devoted to a discussion of patents and subsequent patent litigation during the development of the integrated circuit. This discussion is provided because of the historical significance and to demonstrate several practical points of the patent world quite germane to the reader. One is that the fame and fortune of an inventor may depend on the precise wording of patent claims. Unfortunately, that wording is often considered to be the domain of the patent attorney and sometimes does not exactly reflect the inventor's intent. (See, for example, the discussion of the lack of a particularly specific figure in Kilby's patent.) Another is that in any new and developing technology, the key problems will be apparent to those active in the field, and they will be offering their individual solutions at about the same time. (Note the close grouping of the events of Table 1.2, which appears later in the chapter.) It is thus very important for the engineer or scientist to keep good records. Yet another point, perhaps not quite as obvious, is that the history of invention as depicted in the popular press, in the scientific literature, and in the issued patents may not be in agreement. To decide on the "true" inventor requires a careful study of all of the literature and also may depend on the particular definition of "inventor" that is chosen.

Up to this point, the goals are perhaps not much different from those expressed in 1952 by Dummer. However, to continue with the Kilby patent:

In a more specific conception of the invention, all components of an electric circuit are formed in or near one surface of a relatively thin semiconductor wafer characterized by a diffused p-n junction or junctions

It is a primary object of the invention to provide a miniaturized electronic circuit wherein the active and passive circuit components are integrated within a body of semiconductor material, the junctions of such components being near and/or extending to one face of the body, with components spaced or electrically separated from one another as necessary in the circuit . . . .

Figures 1–5a illustrate schematically various circuit components fabricated in accordance with the principles of the present invention in order that they may be integrated into, or as they constitute parts of, a single body of semiconductor material; . . .

The figures and text describe bulk resistors, diffused resistors, pn junction capacitors, MOS capacitors, transistors, and diodes. In the press coverage of the March 1959 announcement of the Kilby concept, this set of standard components was stressed (13). The patent text continues:

Because all of the circuit designs described above can be formed from a single material, a semiconductor, it is possible by physical and electrical shaping to integrate all of them into a single crystal semiconductor wafer containing a diffused p-n junction, or junctions, and to process the wafer to provide the proper circuit and the correct component values. . . .

### FIGURE 1.6

One of the first working germanium integrated circuits constructed by Jack Kilby in the summer of 1958. (Source: Courtesy of Texas Instruments Incorporated.)



### FIGURE 1.7

Advertisement for the first commercially available silicon integrated circuits appeared in the March 26, 1960 issue of *Business Week*: (Source: Courtesy of Texas Instruments Incorporated.)

### 1.3 THE INTEGRATED CIRCUIT 11

## AVAILABLE NOW FROM TI 100:1 miniaturization with SOLID CIRCUTT\*

semiconductor networks

Now-3 years ahead of industry's expectations- Only a Solid Circuit semiconductor networks from Texas instrution to ments for many of your high-reliability miniatorized systemations Solid

Solid Circuit networks are a major departure from conventional components because they integrate resistor, capacitor, diode, and transitor functions into a single figh-purity semiconductor water. Protection and packaging of discretic elements is eliminated, and contacts between dissimilar materials are minimized, reducing element interconnections as much as 80%. Fubricistion steps have been reduced to one-tenth those required for the same been reduced to non-tenth those required. Only a few process stops and time-proved T1 mass production tochniques permit a high degree of process control in Solid Circuit network fabrication. The result of these facts reliability is hailt into each Solid Circuit network.

If you need to reduce equipment size and weight—or to design a more complex system in the same size—invesingate Solid Circuit networks for your missile, satellitic, space vehicle, and other microelectronic programs. Ti equipments are ready to custom design this concept to your requirements. Contact your nearest TI Sales Engineer today. The TI Type 502 Solid Circuit network is immediately available for your evaluation,



The first working integrated circuit was built by Kilby in September of 1958 and was, for expediency, fabricated of germanium (Fig. 1.6). The first commercially available integrated circuits, announced in March of 1960 with the advertisement shown in Fig. 1.7, were made of silicon.

It was after the mesa process and before the planar process that Kilby's patent application was filed, and, because of the manner in



Robert Noyce is the inventor of the Microchip® Integrated circuit and a co-founder of both Fairchild Semiconductor Corporation and Intel Corporation. (Source: Photograph courtesy of Intel Corporation.) "Registered trademark of Fairchild Semiconductor Corporation. which the IC construction was described in the Kilby patent (14), when the planar process was introduced, a separate patent on an IC using planar technology and adherent leads was awarded to Robert Noyce of Fairchild Semiconductor (15). That award was the subject of a heated legal battle that arose over whether Kilby's patent statement as follows:

Instead of using the gold wires 70 in making electrical connections, connections may be provided in other ways. For example, an insulating and inert material such as silicon oxide may be evaporated onto the semiconductor circuit wafer through a mask either to cover the wafer completely except at the points where electrical contact is to be made thereto, or to cover only selected portions joining the points to be electrically connected. Electrically conducting material such as gold may then be laid down on the insulating material to make the necessary electrical connections.

covered the portion of the Noyce claims that described "... an electrical connection to one of said contacts comprising a conductor adherent to said layer ... " The disagreement centered around whether "laid down" was equivalent to "adherent to." The Board of Patent Interference, ruling in Kilby's favor, asserted that it was. However, a subsequent ruling by the Court of Customs and Patent Appeals (*Noyce v. Kilby*; *Kilby v. Noyce*, decided November 6, 1969) reversed the previous rulings and allowed the Noyce claims. (The Supreme Court then refused to review the case.) Contrary to assertions by some, the ruling did not depend on whether gold could or could not be made suitably adherent to silicon oxide. The Court specifically commented on that aspect. The ruling depended on the Court's assessment of whether or not someone reading Kilby's statement would be inevitably drawn to the conclusion that the lead should be adherent.

The Noyce patent application<sup>6</sup> was filed a few months after Kilby filed his, as well as after the Kilby concept was made public. However, it was issued April 25, 1961, three years before the Kilby patent. Perhaps because of this anomaly of issuing dates, and perhaps because of a mixing of the idea of a planar silicon integrated circuit with the already announced integrated circuit concept of Kilby, there has been considerable controversy in some circles over the true inventor of the integrated circuit. The sequence of events as portrayed in the patent and technical literature is shown in Table

<sup>&</sup>lt;sup>6</sup> His first notebook date was in January of 1959, before the public announcement of the integrated circuit concept by Texas Instruments, but well after Kilby's original notebook entry and after the Kilby approach had been described to various government agencies.

### 1.3 THE INTEGRATED CIRCUIT 13

1.2. Comments by some of the participants may be found in references 16–18. Additional comments are in references 19–20. Probably the most balanced assessment of Kilby's and Noyce's relative contributions is contained in the citations of the Franklin Institute's 1966 Ballantine Medal award, which they shared. Kilby was credited for "conceiving and constructing the first working monolithic circuit in 1958," and Noyce for "his sophistication of the monolithic circuit for more specialized use, particularly in industry."

One interesting aspect of the sequence shown in Table 1.2 is the issuance of the K. Lehovec patent for the use of a plurality of pn junctions for isolating integrated circuit components. It has been considered by some as being key to integrated circuit development in that it provided the necessary component isolation (21). Histori-

### TABLE 1.2

Key Events in Development of the Integrated Circuit

| Date             | Event                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        | Responsible<br>Party                      |
|------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------|
| February 6, 1959 | File date of U.S. Patent 3,138,743, which stated: "This invention<br>relates to miniature electronic circuits, and more particularly to<br>unique integrated electronic circuits fabricated from<br>semiconductor material."                                                                                                                                                                                                                                                                                                                                 | J.S. Kilby                                |
| March 6, 1959    | Public announcement of concept described in Kilby patent<br>application.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | Texas Instruments<br>Incorporated         |
| April 22, 1959   | File date of U.S. Patent 3,029,366, which covered electrical<br>isolation through the use of at least two pn junctions between<br>each component and described conductive ink interconnections<br>over deposited oxide.                                                                                                                                                                                                                                                                                                                                      | K. Lehovec                                |
| May 1, 1959      | File date of U.S. Patent 3,025,580, which covered the planar process for making planar transistors.                                                                                                                                                                                                                                                                                                                                                                                                                                                          | J.A. Hoerni                               |
| July 3, 1959     | File date of U.S. Patent 2,981,877, which stated: "Its principal<br>objects are these: to provide improved device-and-lead structures<br>for making electrical connections to various semiconductor<br>regions; to make unitary circuit structures more compact and<br>more easily fabricated in small sizes than has heretofore been<br>feasible; and to facilitate the inclusion of numerous<br>semiconductor devices within a single body of material." This<br>patent covered the use of planar transistors and metallization<br>running over the oxide. | R.N. Noyce                                |
| August 1959      | Public announcement of transistors built by planar process,                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | Fairchild<br>Semiconductor<br>Corporation |
| March 26, 1960   | Public announcement of availability of solid circuit silicon<br>multivibrator.                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | Texas Instruments<br>Incorporated         |
| October 1960     | Technical meeting at which planar process was described.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | J.A. Hoerni                               |

cally, pn junction isolation of a single discrete device was mentioned in a 1954 Shockley patent describing the use of ion implantation in transistor fabrication (22). Fig. 1.8 is a circuit diagram taken from that patent. Shockley's discussion relating to the pn junction was as follows: "It will be noted that the junction between the N body and the P layer is biased in the reverse direction. Hence the body serves essentially as a passive support for the layer." Then, before the Lehovec patent, the Kilby patent had already addressed the general problem of isolation when several components were present in one wafer and suggested various ways of providing it. One was by the use of high-resistivity material (used in gallium arsenide circuits), and one was by the use of pn junction isolation. The pertinent Kilby patent text is as follows:

Of importance to this invention is the concept of shaping. This shaping concept makes it possible in a circuit to obtain the necessary isolation between components and to define the components or, stated differently, to limit the area which is utilized for a given component. Shaping may be accomplished in a given circuit in one or more of several different ways. These various ways include actual removal of portions of the semiconductor material, specialized configurations of the semiconductor material such as long and narrow, L-shaped, U-shaped, etc., selective conversion of intrinsic semiconductor material by diffusion of impurities thereinto to provide low resistivity paths for current flow, and selective conversion of semiconductor material of one conductivity type to conductivity of the opposite type wherein the p-n junction thereby formed acts as a barrier to current flow [italics added].

The Lehovec patent was filed after the filing of the Kilby patent and later was the subject of an interference proceeding by Kilby. Since the Kilby discussion supports pn junction isolation, if one of his figures had included *two* diffused resistor components instead of just one, the Lehovec patent probably would never have been is-

### FIGURE 1.8

Fig. 1 from U.S. Patent 2,666,814 (filed April 27, 1949; issued January 19, 1954) showing use of a reverse-biased pn junction for electrical isolation.



### 1.3 THE INTEGRATED CIRCUIT 15

### FIGURE 1.9

(a) Use of an epitaxially grown thin layer of n-type silicon on a p-type slice to provide part of the required electrical isolation. (b) Use of that layer combined with localized n-type diffusions to improve performance of transistors built in the layer over the diffused regions.



sued. The Board of Patent Interference ruled that Kilby's patent did not cover the Lehovec claims. It did not, however, rule on whether the Lehovec patent covered the methods typically used for IC isolation. In fact, the claims do not appear to support the case of ordinary bipolar IC isolation, although that aspect has not been addressed by the courts. Lehovec's text describes the use of two or more junctions for isolation that extend from one face of the semiconductor body to the other but does not mention the typical IC isolation case in which all junctions terminate on the same side of the wafer. His claims typically state:

A multiple semiconductor assembly comprising a semiconductor slice having a plurality of regions of alternating p and n conductivity types to thereby provide a plurality of p-n junctions, two of said regions being separated by at least two of said p-n junctions, ....

The "two of said regions being separated by at least two of said p-n junctions, . . ." would apply to an IC, but "a semiconductor slice having a plurality of regions of alternating p and n conductivity types to thereby provide a plurality of p-n junctions . . ." does not appear to apply since isolation for a whole chip is normally provided by just one p-type region and multiple n-type regions.

Despite the fact that there were no interconnection claims. Lehovec described in the body of his patent an interconnection method that used conductive ink over an evaporated quartz insulating layer. Since conductive inks are adherent and since his patent was filed before Noyce's, one can speculate that had a related claim been included in his patent application, he would now have the patent on IC interconnections.

Two later developments, while not directly related to IC development, contributed greatly to bipolar IC manufacturing ability and performance. One was the use of epitaxial overgrowth (23), which in IC applications allowed a thin n-type layer to be overgrown on a high-resistivity p-type substrate, as shown in Fig. 1.9a. The other was the use of localized high-concentration diffusions before the growth of the epitaxial layer, as shown in Fig. 1.9b. in order to reduce transistor collector resistance (24).

While early IC work was all bipolar, in 1960, a practical MOS transistor was announced (25, 26), and by 1962, a MOS IC consisting of 16 silicon n-channel transistors had been constructed (27). Efforts to use germanium, either by itself or overgrown onto semi-insulating gallium arsenide (GaAs), as a material for ICs continued for several years but never proved successful. GaAs was, however, by the mid-1980s being used for niche applications requiring very-high-frequency ICs.



Silicon thus remains the mainstream material, and as shown in Fig. 1.10, the U.S. usage of silicon ICs had reached astronomical proportions by 1980. As represented in Fig. 1.10, this usage is given in terms of AEGs, or active element groups, which might, for example, consist of a single-stage amplifier, a single gate, or a single bit of memory. The term was apparently first applied in a limited manner to vacuum tube circuits and then by Patrick Haggerty (*IEEE Spectrum 1*, pp. 80–82, June 1964) to transistor circuits. Each AEG is made up of various combinations of the following:

- Bipolar transistors, with or without multiple emitters or collectors
- 2. pn junction diodes, which can also be used as capacitors
- 3. Conventional metal-plate capacitors
- 4. MOS transistors
- 5. MOS capacitors
- 6. MIS transistors
- 7. Schottky diodes
- 8. Diffused resistors
- 9. Thin-film resistors
- 10. Interconnecting leads

Conspicuously missing from this list of available components are inductors. It is true that small inductors can be made by spiraling a conductor and that at some frequencies there can be enough phase shift to make a transistor behave as though inductance is present. However, for all practical purposes, the IC world remains inductorless.

### 4 SEMICONDUCTOR PROCESSES 17

## 1.4 SEMICONDUCTOR PROCESSES

The major semiconductor wafer fabrication steps in the mesa process of 1958 were diffusion, metallization, silicon etching, and various cleanups. It was with these steps that the first ICs were made. The planar process of 1960 introduced three more steps: thermal oxidation of silicon (as a step separate from diffusion), lithographic printing of a pattern in a photoresist layer on top of the oxide, and the etching of that pattern in the oxide. Since 1960, there have been only four processing innovations that allow new kinds of structures or circuits to be fabricated, and they were all developed in the 1960s. They were as follows:

- The use of epitaxial layers in bipolar circuits to provide an easy method of device isolation. (Later, epitaxial layers were deposited over previously diffused areas of the circuit to reduce collector saturation resistance and time, thus providing high-speed circuitry.)
- The use (in 1963) of epitaxial overgrowth of silicon onto insulating substrates such as sapphire.
- The use of chemically vapor-deposited (CVD) materials, such as silicon nitride, first (in 1965) for protective overcoats and later as diffusion masking material.
- The use of ion implantation (first practical application in 1968 for MOS threshold adjusting) as an adjunct to diffusion.

This is not to imply that the industry has been stagnant, however. Tremendous strides have been taken in increasing the complexity of IC circuitry available and in decreasing the cost of each AEG. However, these advances have been accomplished primarily through the use of more sophisticated equipment and through process variants such as the use of plasma etching instead of wet etching and optical steppers and the use of electron beam writers or X-ray printing instead of contact printers or the original handpainted black wax for patterning. Table 1.3 illustrates the growth in capability and cost of printing technology during the past 30 years. The increasing complexity of patterning equipment is typical of the other processes as well.

Besides the advances made in processing equipment, other parts of the industry have changed as well. The increase in circuit complexity can be judged by comparing the first commercially available ICs (like the one in the advertisement shown in Fig. 1.7, which consisted of a single flip-flop with two transistors, four diodes, four capacitors, and six resistors) with a large MOS memory such as the one megabit DRAM chip shown in Fig. 1.11, which contains a mil-

# 18 CHAPTER 1 HISTORICAL OVERVIEW

# TABLE 1.3

The Increasing Complexity of Patterning Equipment

| Year  | Size of Feature<br>Patterned | Steps in Operation                                                                                                                      | Equipment Cost<br>per Machine<br>in Year of<br>Purchase | Comments                                        |
|-------|------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------|-------------------------------------------------|
| 1957  | 10 mils*                     | Handpaint black wax on<br>area to be protected;<br>etch mesas                                                                           | \$0.10                                                  | Camel's hair brush                              |
| 1960  | 2 mils                       | Apply photosensitive layer<br>("resist"); use contact<br>printer and mask to<br>expose desired area;<br>develop resist; etch<br>feature | \$2,000                                                 | In-house constructed contact printer            |
| 1965  | 0.6 mils                     | Contact printer                                                                                                                         | \$5,000                                                 | Commercial contact printer                      |
| 1970  | 0.4 mils                     | Contact printer                                                                                                                         | \$15,000                                                | Commercial contact printer                      |
| 1975  | 0.15                         | Optical projection printing.                                                                                                            | \$80,000                                                | Optical projection printing                     |
| 1980  | 0.08                         | Optical projection printing                                                                                                             | \$120,000-500,000                                       | Optical projection printing                     |
| 1985  | 0.06 mils                    | Optical projection or<br>electron beam printing                                                                                         | \$500,000-3,000,000                                     | Optical projection or electron beam<br>printing |
| 1990† | 0.024                        | Optical projection,<br>electron beam, or X-ray<br>printing                                                                              | \$500,000-3,000,000                                     | Optical projection or electron beam<br>printing |

\*The semiconductor industry began by using mils, and that terminology is used for comparison throughout this table. †Projected.

Note: For a description of patterning equipment, see Chapters 2 and 5.

lion memory cells plus thousands of additional transistors for various other operations. In ICs of the 1960s, chip area was perhaps  $0.01 \text{ cm}^2$ ; it is now approaching  $1 \text{ cm}^2$ . Thus, while chip size has increased, it has by no means followed commensurately with circuit complexity. The  $100 \times$  increase in area has been accompanied by a millionfold increase in circuit elements.

The slice diameter has also steadily increased since the introduction of the IC, as shown in Table 1.4. In some cases, the impetus has been to improve yields; in some, to reduce equipment and facility costs; and in others, to reduce labor costs. Not shown by the table is the small dip in the slice diameter that occurred in 1960–1962 because of the difficulty in producing uniform epitaxial depositions on slices of diameters larger than 18–20 mm.

## 1.4 SEMICONDUCTOR PROCESSES 19

# FIGURE 1.11

One megabit CMOS DRAM chip. (Source: Courtesy of Texas Instruments Incorporated.)



Circuit design has become much more sophisticated and attuned to space-saving features since the first flip-flops were introduced. Two or three layers of interconnections between circuit elements are now used, and the number of active devices required to perform a given function has decreased. For example, some of the early DRAM memory cells required six transistors, whereas now one transistor and one capacitor suffice. Also, much more attention is given to designs that are more tolerant of the expected process variations. Simultaneously, process control has been continually tightened through better understanding of the processes and the introduction of microprocessor control of most of the equipment. With the recognition that most yield loss is associated with particulates on the slice, extensive effort has been expended to provide better cleanups, cleaner and purer chemicals, and a cleaner manufacturing environment.

Probably the most dramatic way of displaying the effect of all of the improvements on semiconductor manufacturing during the past three decades is to compare silicon usage with semiconductor output, as is done in Table 1.5. To dramatize the effect of the original inventions and the subsequent improvements on our daily lives can be done by mentally eliminating all of the things around us that depend on transistors for their successful operation.

# TABLE 1.4

| Year | Diameter (mm) |
|------|---------------|
| 1955 | 25*           |
| 1960 | 25            |
| 1965 | 37            |
| 1970 | 50            |
| 1975 | 75            |
| 1980 | 100           |
| 1985 | 150           |

1.1 01

# 20 CHAPTER 1 HISTORICAL OVERVIEW

| TABL | E 1 | 15   |
|------|-----|------|
| INDL | 100 | 0.00 |

Silicon Usage Compared with Semiconductor Output

| Year | Polycrystalline<br>Silicon (tons) | Silicon Transistors<br>(millions) | IC<br>AEGs<br>(billions) |
|------|-----------------------------------|-----------------------------------|--------------------------|
| 1955 | 5                                 | 0.09                              | -                        |
| 1960 | 30                                | 8.8                               | -                        |
| 1965 | 50                                | 273                               | -                        |
| 1970 | 500                               | -                                 | 2                        |
| 1975 | 1500                              |                                   | 200                      |
| 1980 | 3000                              | -                                 | 10,000                   |

Sources: Data on polycrystalline silicon from reference 2 and industry estimates. Data on silicon transistors from *Electronic Industries Association* Yearbook, 1967.

# REFERENCES 1

- G.W. Pickard, "Thermo-Electric Wave Detectors," *Electrical World* 4, p. 1003 (L), November 24, 1906.
- G.K. Teal et al., Chap. 6, "Semiconductor Materials," in James F. Young and Robert Shane, eds., *Material Processes*, Marcel Dekker, New York, 1985.
- Oskar Heil, "Improvement in or Relating to Electrical Amplifiers and Other Control Arrangements and Devices." British Patent 439,457, September. 26, 1939.
- Henry C. Torrey and Charles A. Whitmer, Crystal Rectifiers, McGraw-Hill Book Co., New York, 1948.
- J. Bardeen and W.H. Brattain, "The Transistor, A Semi-Conductor Triode," Phys. Rev. 74, pp. 130-231, 1948.
- W. Shockley, M. Sparks, and G.K. Teal, "p-n Junction Transistors," *Phys. Rev.* 83, pp. 151– 162, 1951.
- G.K. Teal, "Some Recent Developments in Silicon and Germanium Materials and Devices." paper presented at National IRE Conference, Dayton, May 10, 1954.
- J.S. Saby, "Fused Impurity p-n-p Junction Transistors," Proc. IRE 40, pp. 1358–1360, 1952.
- G.L. Pearson and C.S. Fuller, "Silicon p-n Junction Power Rectifiers and Lightning Protectors," *Proc. IRE* 42, p. 760 (L), 1954.

- M. Tanenbaum and D.E. Thomas, "Diffused Emitter and Base Silicon Transistors," *Bell* Syst. Tech. J. 35, pp. 1–22, 1956. Charles A. Lee, "A High-Frequency Diffused Base Germanium Transistor," *Bell Syst. Tech. J.* 35, pp. 23–24, 1956.
- R.N. Hall, "p-n Junctions Produced by Rate Growth Variations," *Phys. Rev.* 88, p. 139, 1952.
- Jean A. Hoerni, "Planar Silicon Transistors and Diodes," paper presented at IRE Electron Devices meeting, Washington, D.C., October 1960, J.A. Hoerni, "Method of Manufacturing Semiconductor Devices," U.S. Patent 3,025,589, March 20, 1962 (filed May 1, 1959).
- "Ultramicroscopic Circuits." Electronic Design, April 29, 1959.
- J.S. Kilby, "Miniaturized Electronic Circuits," U.S. Patent 3,138,743, June 23, 1964 (filed February 6, 1959).
- R.N. Noyce, "Semiconductor Device-and-Lead Structure," U.S. Patent 2,918,877, April 25, 1961 (filed July 30, 1959).
- Jack S. Kilby, "Invention of the Integrated Circuit," *IEEE Trans. on Electron Dev. ED-23*, pp. 648–654, 1976.
- Robert Noyce. "Microelectronics." Scientific American 237, pp. 63–69, 1977.
- 18. Kurt Lehovec, "Invention of p-n Junction Iso-

REFERENCES 21

lation in Integrated Circuits," *IEEE Trans. on Electron Dev. ED-25*, pp. 495–496, 1978.

- M.F. Wolff, "The Genesis of the Integrated Circuit," *IEEE Spectrum* 13, pp. 45–53, August 1976.
- C. Lester Hogan, "Reflections on the Past and Thoughts about the Future of Semiconductor Technology," *Interface Age 2*, pp. 24–36, March 1977.
- Sorab K. Ghandhi, *The Theory and Practice of* Microelectronics, John Wiley & Sons, New York, 1968.
- W. Shockley, "Semiconductor Translating Device," U.S. Patent 2,666,814, January 19, 1954 (filed April 27, 1949).
- 23. H.C. Theuerer et al., "Epitaxial Diffused Transistors," Proc. IRE 48, pp. 1642–1643, 1960.

- B.T. Murphy, "Monolithic Semiconductor Devices," U.S. Patent 3,237,062, February 22, 1966 (filed October 20, 1961).
- D. Kahng and M.M. Atalla, "Silicon-Silicon Dioxide Field Induced Surface Devices," paper presented at IRE-AIEE Solid-State Device Conference, Pittsburg, 1960.
- Dawon Kahng, "A Historical Perspective on the Development of MOS Transistors and Related Devices," *IEEE Trans. on Electron Dev. ED-23*, pp. 655–657, 1976.
- S.R. Hofstein and F.P. Heiman, "The Silicon Insulated-Gate Field-Effect Transistor," *Proc. IEEE 51*, pp. 1190–1202, 1963.



# Processing Overview

# 2.1 INTRODUCTION

In Chapter 1, several processing operations were referenced, and some steps were described to illustrate differences in construction of various early devices. In this chapter, the individual processes required to produce an IC will be briefly discussed, and then the manner in which they are combined to make the circuit elements required for a complete IC will be described.

The steps required to produce ICs, beginning with the smelting of either metallurgical-grade Si or metallurgical-grade Ga and As, are listed in Table 2.1. As indicated in the table, several operations are not considered to be part of the IC chip fabrication process.<sup>1</sup> These steps will be considered only briefly, and only in this chapter.

Typically, separate companies specialize in various aspects of silicon material preparation. Some manufacture semiconductorgrade Si, and some grow the single crystals and supply the slices polished and ready for the first oxidation. Similarly, GaAs compounding, crystal growth, and slice cutting and polishing are generally performed by a separate materials company.

While there must be a considerable amount of interaction between the slice supplier and the wafer-fab process engineers, substantial differences exist in the engineering disciplines required, and the division is a natural one. Tables 2.2 and 2.3 give typical silicon and gallium arsenide slice specifications and illustrate the multitude of slice parameters that are important to the performance and yield of ICs. The gross mechanical specifications relate to the need for the slice to fit standardized handling equipment. The primary and secondary flats (ground on the crystal before it is sliced) are used to indicate the orientation of the slice surface and to provide an azi-

<sup>&#</sup>x27;The manufacturing facility for IC fabrication is usually referred to as a "frontend" or a "wafer-fab" (wafer fabrication) area.

# 2.1 INTRODUCTION 23

| TABLE 2.1                   | Manufacturing Operation                                          | Discussion             |
|-----------------------------|------------------------------------------------------------------|------------------------|
| IC Manufacturing Operations | Material Preparation                                             |                        |
|                             | Semiconductor-grade Si and GaAs manufacturing                    | This chapter           |
|                             | Crystal growing                                                  | This chapter           |
|                             | Slice cutting and polishing                                      | This chapter           |
|                             | Photomask manufacturing                                          | Chapter 5              |
|                             | Wafer Fabrication                                                |                        |
|                             | Cleaning of surfaces                                             | Various chapter        |
|                             | Growth of epitaxial layer                                        | Chapter 7              |
|                             | Thermal oxidation of silicon                                     | Chapter 3              |
|                             | Patterning of the various layers (lithography)                   | Chapter 5              |
|                             | Diffusion of impurities into silicon                             | Chapter 8              |
|                             | Ion implantation of impurities                                   | Chapter 9              |
|                             | Chemical vapor deposition of polycrystalline<br>silicon          | Chapter 7              |
|                             | Etching of silicon and GaAs                                      | Chapter 6              |
|                             | Deposition of insulating layers (silicon oxide or nitride)       | Chapter 4              |
|                             | Etching of insulating layers (silicon oxide or<br>nitride)       | Chapter 6              |
|                             | Deposition of conductive layer (usually metal)                   | Chapter 10             |
|                             | Etching of conductive layers (metal, polysilicon, other)         | Chapter 6              |
|                             | Alloying (sintering) to form metal-silicon<br>electrical contact | Chapter 10             |
|                             | Backgrinding (thinning of wafer by grinding)                     | This chapter           |
|                             | Multiprobing (DC electrical testing of each IC on wafer)         | Chapter 11,<br>briefly |
|                             | Assembly/Test                                                    |                        |
|                             | Cutting or breaking of wafers into individual chips              | Not covered            |
|                             | Packaging of individual chips                                    | Not covered            |
|                             | Full AC and DC electrical testing of packaged<br>ICs             | Not covered            |

muthal orientation reference. The SEMI Standards for the orientation of these flats (which manufacturers usually adhere to) are shown in Fig. 2.1. Flatness affects the amount of pattern distortion produced during the printing operation. The (flatness terms used in the specification sheets are defined in the section on flatness in Chapter 5.) The oxygen content of silicon affects the slice resistance to thermal shock and the concentration of some kinds of crystal defects that form during processing. Resistivity is dictated by the elec-

# TABLE 2.2

# Silicon Slice Specifications: (100) and (111) Orientation

|                                     | 100 mm       | 100 mm Diameter 125 mm D |                          | Diameter | 150 mm             | 50 mm Diameter |  |
|-------------------------------------|--------------|--------------------------|--------------------------|----------|--------------------|----------------|--|
| Item                                | n-type       | p-type                   | n-type                   | p-type   | n-type             | p-type         |  |
| Resistivity (ohm-cm)                |              |                          |                          |          |                    |                |  |
| Max.                                | 20.0         | 40.0                     | 20.0                     | 40.0     | 20.0               | 40.0           |  |
| Min.                                | 0.008        | 0.010                    | 0.008                    | 0.010    | 0.008              | 0.010          |  |
| Resistivity gradient (%)            | 1            | 0                        | 10                       |          | 10                 |                |  |
| Carbon conc. (ppma)                 |              |                          |                          |          |                    |                |  |
| Standard                            | <            | 2.0                      | < 2.0                    |          | < 2.0              |                |  |
| Low                                 | <            | 0.3                      | < 0.3                    |          | < 0.3              |                |  |
| Oxygen conc. (ppma)                 |              |                          |                          |          |                    |                |  |
| High                                | 30-          | -40                      | 30-40                    |          | 30-40              |                |  |
| Medium                              | 26-33        |                          | 26-33                    |          | 26-33              |                |  |
| Low                                 | 24-30        |                          | 24-30                    |          | 24-30              |                |  |
| Dislocation density/cm <sup>2</sup> | <100         |                          | <100                     |          | <100               |                |  |
| Diameter tolerance                  | $\pm 0.5$    | 5 mm                     | ± 1.0 mm                 |          | ±1.                | 0 mm (         |  |
| Primary flat length                 | 30-3         | 5 mm                     | 40-45 mm                 |          | 55-6               | 0 mm           |  |
| Secondary flat length               | 16-20        | 0 mm                     | 25-30 mm                 |          | 35–40 mm           |                |  |
| Slice thickness                     |              |                          |                          |          |                    |                |  |
| Min. available                      | 375 ±        | 25 µm                    | $625 \pm 25 \mu\text{m}$ |          | $675 \pm 25 \mu m$ |                |  |
| Max. available                      | 710 ±        | 25 µm                    |                          |          |                    |                |  |
| Typical FPD* (microns)              | 3.5 ove      | er 100%                  | 4.0 over 100%            |          | 4.0 over 100%      |                |  |
|                                     | usabl        | e area                   | usable area              |          | usable area        |                |  |
| Max. FPD (microns)                  | 5.0 over 95% |                          | 5.5 over 95%             |          | 5.5 over 95%       |                |  |
|                                     | usable area  |                          | usable area              |          | usable area        |                |  |
| Max. taper (µm)                     | 10           | 0.0                      | 10.0                     |          | 15.0               |                |  |
| Max. bow (µm)                       | 35           | 5.0                      | 40.0                     |          | 50.0               |                |  |
| *Local flatness screening if re     | quested.     |                          |                          |          |                    |                |  |

trical properties desired of the circuit elements. Crystal orientation affects the semiconductor surface properties and thus device performance.

The material processing flows prior to entry into wafer fabrication are shown in Fig. 2.2 and Fig. 2.3. Metallurgical-grade Si is made in large quantities by reducing quartz sand with coke in an electric furnace. It is then used in the manufacture of some grades of steel and as an alloying material in high-strength aluminum. The reaction of anhydrous HCl with metallurgical-grade Si in the 800°C range gives compounds such as SiCl<sub>4</sub> (silicon tetrachloride) and SiHCl<sub>3</sub> (trichlorsilane). SiHCl<sub>3</sub> is used as raw material for the silicon industry and for semiconductor-grade Si manufacturing. Actually,

# 2.1 INTRODUCTION 25

# TABLE 2.3

Gallium Arsenide Slice Specifications

|                                      | Item*                                       |                                             |  |  |
|--------------------------------------|---------------------------------------------|---------------------------------------------|--|--|
| Parameter                            | 50.8 mm Diameter                            | 76.2 mm Diameter                            |  |  |
| Doping                               | Chromium                                    | Chromium                                    |  |  |
| Resistivity                          | $> 5 \times 10^7 \Omega$ -cm                | $> 5 \times 10^7 \Omega$ -cm                |  |  |
| Mobility                             | $> 4000 \text{ cm}^2/\text{V}\cdot\text{s}$ | $> 4000 \text{ cm}^2/\text{V}\cdot\text{s}$ |  |  |
| Etch pit density per cm <sup>2</sup> | $2 \times 10^{4} - 5 \times 10^{4}$         | $2 \times 10^{4} - 5 \times 10^{4}$         |  |  |
| Diameter tolerance                   | 0.4 mm                                      | 0.6 mm                                      |  |  |
| Primary flat length                  | 14.5-17.5 mm                                | 19-25 mm                                    |  |  |
| Secondary flat length                | 6.5–9.5 mm                                  | 10-13 mm                                    |  |  |
| Minimum slice thickness<br>available | $400 \pm 25 \mu\text{m}$                    | $500 \pm 25 \ \mu m$                        |  |  |
| Maximum slice thickness<br>available | $625 \pm 25 \ \mu m$                        | $750 \pm 25 \ \mu m$                        |  |  |
| Flatness                             | $< 0.8 \ \mu m/cm$                          | $< 0.8 \ \mu$ m/cm                          |  |  |
| Bow                                  | $> 20 \ \mu m$                              | > 30 µm                                     |  |  |
| *Round and grown by liquid-e         | ncapsulated Czochralski m                   | ethod.                                      |  |  |

# FIGURE 2.1

Location of major (primary) and minor (secondary) flats on Si and GaAs slices less than 200 mm in diameter when viewed from polished side. Two-hundred mm diameter Si slices may have either a flat or a notch to mark the (011) face. The other pertinent information is laser-scribed near the periphery on the polished side. (*Source:* Based on SEMI Specifications, Spring 1989.)







# 2.1 INTRODUCTION 27

either SiCl<sub>4</sub> or SiHCl<sub>3</sub> can be purified by distillation and then reduced by hydrogen to give high-purity Si. In addition, other silicon compounds, such as SiH<sub>4</sub> (silane) can be used. Silicon manufacture is characterized by high power usage and poor reduction efficiency—thus the emphasis on recycling of feed stock and the continuing search for more efficient reaction chambers. In the conventional Siemens process,<sup>2</sup> silicon rods of up to 2 m in length are electrically heated in an atmosphere of SiHCl<sub>3</sub> and hydrogen and grow in diameter from less than 10 mm to over 125 mm. Some of the newer, more efficient processes use fluid bed reactors, where small silicon particles increase to about pea size in the reactor and are then withdrawn.

Gallium and arsenic are both produced primarily as by-products of other refining operations, with gallium coming from aluminum smelting and arsenic from copper smelting. The commercial-grade product (element or compound) is then converted to an easy-topurify compound, is purified, is reduced, and, in the case of arsenic, is often rerefined by vacuum sublimation. The two elements are then combined (compounded) to give GaAs. The arsenic vapor pressure over gallium arsenide is high enough that compounding must be done under pressure. To directly combine the two elements at the same temperature, pressures of about 60 atm are required. Alternatively, the gallium/GaAs can be held slightly above the GaAs melting point of 1238°C, and the arsenic at about 600°C in the same closed tube. The reaction will then proceed slowly at about 1 atm as the arsenic vaporizes and diffuses to the liquid gallium/GaAs, where it reacts with the liquid gallium.

Silicon crystals are grown from the melt by either the pulling (Czochralski or Teal-Little)<sup>3</sup> or the float-zone (FZ) process. However, almost no float-zone crystals are used in IC manufacture. Gallium arsenide crystals are grown either by a variation of the Czochralski (CZ) method or by the horizontal Bridgman method. While it is in principle possible to grow semiconductor-material crystals

<sup>&#</sup>x27;So-called because Siemens and Halske, AG, of Germany introduced the process.

<sup>&#</sup>x27;Ceochralski, in 1918, introduced a method for growing single-crystal metal wires that involved dipping a glass capillary tube into the metal melt and then withdrawing it and the attached thin metal thread. He is said to have gotten the idea by having mistakenly dipped his ink pen into a pot of molten tin on his laboratory workbench and then noting the attached frozen tin strand when he pulled the pen from the tin. In 1950, Teal and Little devised "crystal pulling" to grow the germanium single crystals used for early transistor development. Conceptually, their puller was the same as machines used today. The only new functions added in the intervening years have been automatic diameter control and, in some machines, magnetic damping of the melt circulation.

from solutions-for example, silicon from a molten aluminum solvent-all commercial crystals are produced from an essentially pure semiconductor melt.

In crystal pulling, shown schematically in Fig. 2.4, a single-crystal seed is dipped into the melt and slowly rotated. By controlling the temperature of the melt and the amount of heat withdrawn from the seed, freezing onto the seed (crystal growth) is possible. If too much heat is extracted, the whole surface of the melt will freeze over. If the melt is too hot, the seed will melt off above the surface. When conditions are perfectly balanced, controlled growth will occur as the seed and attached crystal are slowly withdrawn (pulled) from the melt. Stirring of the melt via crystal rotation reduces unsymmetrical growth caused by uneven heating of the melt. To minimize the number of dislocations in the crystal, the seeds must be no more than a few millimeters in diameter. To provide for economical IC fabrication, the crystals (Si) should be 100-200 mm in diameter. The growing-crystal diameter must thus be changed from 2 or 3 mm to 100-200 mm. This change and the subsequent diameter control required for the rest of the growth are accomplished by changing the melt temperature, spin (rotation) rate, and pull rate. Increasing the melt temperature decreases the crystal diameter; increasing the pull rate decreases the diameter; and increasing the crystal rotation rate increases the diameter.

Since the melt must be protected from oxidation, the crucible and crystal, as shown in Fig. 2.4, are surrounded by a chamber containing an inert or reducing atmosphere. For silicon, either atmo-

# FIGURE 2.4



Various stages of crystal pulling from the melt.

#### 2.1 INTRODUCTION 29

spheric or reduced pressure can be used. Gallium arsenide requires higher pressures, usually in the 5-60 atm range, and in addition, a molten layer of boric oxide on top of the melt to keep arsenic evaporation at a tolerable level. The latter procedure is referred to as liquid encapsulation. Silicon is particularly reactive but can be contained in fused silica (SiO<sub>3</sub>) with only a small amount of the silica being dissolved. This small amount, however, is the source of a substantial amount of oxygen that becomes incorporated into silicon crystals. Silica is also used as the container for GaAs crystals and. in this case, is the source of a small amount of silicon doping.

The horizontal Bridgman method of crystal growing, shown schematically in Fig. 2.5a, consists of loading the material to be grown into a long, narrow container, usually of semicircular cross section, and moving the container from a temperature (T<sub>i</sub>) slightly above the material's melting point to a temperature (T<sub>2</sub>) slightly below the melting point. In order to provide for single-crystal growth. a seed crystal is placed in one end of the container and partially kept in the cool zone so that only a portion of it melts. Thus, when the rest of the container is slowly moved into the cool zone, the melt freezes onto the seed as a single crystal. This growth procedure can be used only when a container material is available that will not bond to the crystal when it freezes. In the case of silicon, none are available. For gallium arsenide, fused silica is satisfactory.

Float-zone crystal growing, as shown in Fig. 2.5b, is a method in which the melt avoids contact with any container material. Here. a rod of polycrystalline material with a short, single-crystal seed at one end is held vertically by being clamped at each end. A narrow

# FIGURE 2.5

(a) Horizontal Bridgman method of crystal growing. which is used when the crystal does not stick to the container material. (b) Float-zone method of crystal growing. where the molten region is touched only by the ambient gas.



30

region, originally partially in the seed crystal and partially in the polycrystalline rod, is inductively melted and slowly moved the length of the rod by moving the induction coil relative to the rod. Unlike crystal pulling, in which it appears that the only limit to crystal diameter is the strength of the supporting seed, float-zone crystal diameter is limited by a combination of surface tension and levitation from the heating coil. Float-zoned silicon is used when the highest purity possible is required and when (as of 1989) the slice diameter need not be greater than 150 mm.

Silicon and gallium arsenide are brittle materials and have hardnesses that are a little less than that of quartz. They can be sawed and polished by the same techniques used for quartz. Both have pronounced cleavage planes, (111) for Si and (110) for GaAs, and in thin sections can be easily broken after scribing along traces of these planes on the wafer surface.4 For sawing and grinding operations. diamond-bonded wheels are used. Lapping is usually done with either alumina or silicon carbide grit slurries. Polishing can be done mechanically, using, for example, diamond dust, alumina, or zirconia. It can also be done electrolytically or chemically or by a combination of chemical etching and mechanical abrasion. The process actually used is dictated by the flatness required and the need for a minimum of residual mechanical damage. In the case of silicon, a chemical-mechanical combination using fine silica powder in a hydroxide solution is commonly employed. The analogous process for gallium arsenide is silica in a sodium hypochlorite solution.

Fig. 2.6 shows the mechanical shaping operations performed on a silicon crystal. First, the undulations on the grown crystal are ground away so that all slices will have the same diameter. Next, orienting flats are ground on the crystal before it is sawed into slices. Then, in order to minimize the chipping of slice edges (which would produce yield-reducing particulates), the slice edges are rounded. Finally, each slice is lapped with progressively smaller grit in order to remove fracture damage and then is polished on one or both sides. inspected, and sent to the wafer-fab area.

Mask making, even though technologically quite closely akin to wafer fabrication, is done in external "mask shops." The manufacturing steps are briefly discussed in Chapter 5. The masks are made of an ultraviolet, transparent material (usually glass or fused quartz), with a metallic-layer overcoat into which a pattern has been etched. Masks differ conceptually from the starting slices in that a particular mask pattern is specific to a given IC. Further, not just

<sup>&</sup>quot;See Appendix A for a discussion of planes and traces,

# 2.2 WAFER FABRICATION FACILITY 31

# FIGURE 2.6 Mechanical shaping operations from as-grown crystal to polished slices. (a) As-grown crystal (b) Grind crystal to remove undulations and saw to remove portions not, in resistivity range (c) Saw into slices (with orienting flats ground before sawing) (d) Round edges of slice by grinding

(e) Lap and polish slice

one mask, but a set of masks, each with a different but related pattern, is required for each circuit.

As shown in Fig. 2.7, the slices, masks, and required chemicals are brought together in a wafer fabrication facility for conversion into IC chips. Much of the facility is specifically designed to meet the requirements necessary for high-yield IC processing.

The wafer-fab manufacturing space, besides housing the equipment required for semiconductor processing, must also supply the proper environment for the process and for the human operators. Perhaps foremost is the provision for exhausting toxic fumes. These fumes range from relatively mild organic vapors such as benzene to extremely toxic gases such as phosphine, diborane, and anhydrous HF (hydrogen fluoride). In addition, many organic fumes are highly flammable, and hydrogen and silane are quite explosive. Highcapacity exhausts, coupled with specific gas detectors and great care in maintaining leakproof plumbing, are required for safety. Rooms used for epitaxial depositions and some CVD operations are laid out for quick evacuation and very rapid removal of toxic fumes in case of leaks or explosions.

In rooms where photoresist is applied, the humidity must be controlled. To improve resist adhesion to the slice surface, low hu-

2.2

WAFER FABRICATION FACILITY



midity is desirable. However, as humidity decreases, problems with static discharge and operator discomfort emerge. As a compromise, a nominal 45% is often used. Ultraviolet (UV) resist is insensitive to longer wavelength light; consequently, yellow light is used in rooms where resist is applied, exposed, and developed to provide visibility and yet not expose the resist.

A substantial portion of the wafer-fab facility must be "cleanroom"—that is, have areas where the total airborne particles larger than, say, 0.1  $\mu$ m will be 1 to 10 per cubic foot.<sup>5</sup> The need for such a cleanroom can be appreciated from the data of Fig. 2.8, which shows the size of some common airborne particulates. The usual criterion is that particles on critical areas of the wafer must be no larger than one-fourth of a line width. Thus, for a one Mbit DRAM with minimum geometries of 0.8  $\mu$ m, even a large virus has the potential of causing a yield loss! The air in cleanrooms is cleaned by passing it through HEPA (high-efficiency particulate air) filters. To ensure that particulates generated in the room are swept out and not swirled around in eddys and deposited on wafer surfaces, the air flow must be slow and laminar. To accomplish this, the air usually comes in from the ceiling, moves down, and exits through a perforated floor.

High-purity deionized water (DI water) is essential to the semiconductor industry's slice-cleaning procedures. The purity of such water is generally specified in terms of its electrical conductivity or resistivity. Absolutely pure water, like intrinsic semiconductors, still has some conductivity, and for a similar reason—the partial disassociation of HOH into H and OH ions, which then conduct. Its resistivity at room temperature is slightly above 18 M $\Omega$ -cm, and

Cleanrooms are classified by the number of particles per cubic foot larger than some specified size. A class 10 room would be one with 10 or less particles greater than some specified size per cubic foot of air. (Office space typically has about 200,000 particles greater than 0.5µm per cubic foot.)

# 2.3 WAFER FABRICATION OPERATIONS 33



semiconductor-grade water is normally specified as 18 M $\Omega$ -cm. In addition to resistivity, the quantity of particulates and bacteria is also closely controlled. The water is irradiated with ultraviolet light to kill the bacteria, and several levels of filtration are used to remove the particulates and dead bacteria. The quantity of DI water needed is substantial, ranging from 100 to 200 gallons per completed wafer. Thus, a site producing a total of 50,000 wafers per month could use up to half a million gallons of water a day. A large and rather elaborate water treatment plant is required to produce the necessary quantity and quality from municipal water, and it is usually located in a separate building close to the wafer fabrication area.

Each of the wafer fabrication operations listed earlier in Table 2.1 have many steps and usually many variations, depending on exactly the kind of structure to be built. Furthermore, the order in which the steps are performed is by no means fixed, and a wafer may circulate back through some of them several times. Silicon IC wafer fabrication starts with the careful cleaning of a bare silicon slice of appropriate crystallographic orientation and the growing of a layer of thermal oxide on each surface. (Thermal oxide is SiO<sub>2</sub> formed by oxidizing the silicon surface.) Gallium arsenide, not being amenable to a native-grown protective oxide, is coated with a CVD silicon dioxide layer. The crystallographic orientation of the slice is important to the finished IC and is specified by the circuit designer. However, as Table 2.4 shows, the orientation also affects nearly all of the wafer-fab processing steps. As fabrication proceeds, other layers covering only one side of the wafer.<sup>6</sup> such as CVD oxide, poly-

# FIGURE 2.8

Size of various common airborne particles.

# 2.3

WAFER FABRICATION OPERATIONS

<sup>&</sup>quot;While the terms *slice* and *wafer* are often used interchangeably, the recommended terminology is *slice* for a structure prior to the wafer-fab operations of Table 2.1 and *wafer* for the slice and the various layers added to it.

| TABLE 2.4                                                                    | Operation                             | Effect of Orientation                                                                                                                                                                                                       |
|------------------------------------------------------------------------------|---------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Effects of Crystallographic<br>Orientation of Slice Surface<br>on Processing | Thermal oxidation<br>Epitaxial growth | Rate varies with orientation: (100) < (110) < (111).<br>Rate varies with orientation: is very low exactly<br>on (111). Depending on orientation, depressions<br>in surface may change shape and slightly shift<br>position. |
|                                                                              | Etching                               | With some etchants, both aqueous and molten<br>metal, rate is very low in [111] direction.                                                                                                                                  |
|                                                                              | Diffusion                             | Orientation is dependent in silicon when done in<br>conjunction with thermal oxidation; (100) ><br>(111).                                                                                                                   |
|                                                                              | Ion implantation                      | Range is greatest in [110] direction; also is greater<br>directly in other low-indices directions as<br>compared with slightly off-orientation<br>directions.                                                               |

crystalline silicon, silicon nitride, and various metals will be added. Intermixed with these steps will be patterning (lithography), etching, diffusion, and ion implantation.

2.3.1 Surface Cleaning Complete cleaning of semiconductor surfaces requires that particulates, organic films, and adsorbed metal ions be removed. Most cleaning procedures are based on immersion in liquid baths or liquid sprays. In addition, ultrasonic agitation or brush scrubbing may be required. In some cases, high-temperature vapor etching or lowpressure sputter etching may be used. A good cleanup is complicated by the fact that unless great care is taken, the materials used for cleaning may contain (and leave behind) more particulates and metals than were on the surface in the beginning. Consequently, semiconductor-grade chemicals that are specially filtered and purified are used by the semiconductor industry. Organic solvents are widely used, but since they sometimes leave residues themselves, high-purity water is ordinarily the last stage of a cleanup. This last stage is often immediately preceded by an acid to oxidize and remove any remaining organics. 2.3.2 Epitaxy Epitaxy (epitaxial growth or epi) is the overgrowth of a thin layer of

Epitaxy (epitaxial growth or epi) is the overgrowth of a thin layer of single-crystal material onto a single-crystal substrate. In the case of ordinary epitaxy, it is the growth of a doped silicon layer onto a silicon slice of different resistivity and perhaps type. The epi growth process is unique in that it is the only way in which a layer can be produced with *fewer* total impurities than the underlying slice. De-

## 2.3 WAFER FABRICATION OPERATIONS 35

pending on the specific device application, the layer may be from 1 or 2  $\mu$ m to 50  $\mu$ m in thickness.

Vapor phase growth is the most common silicon epitaxial method and is done in a temperature range of 1000°C–1200°C by the reduction of some silicon-bearing compound at the slice surface. Common reactions are the hydrogen reduction of SiCl<sub>4</sub>. SiHCl<sub>3</sub>, or SiH<sub>2</sub>Cl<sub>2</sub> (dichlorosilane). For lower-temperature growth, the thermal reduction of SiH<sub>4</sub> is often used. Doping of the layer is accomplished by the co-deposition of the appropriate impurity—for example, by the thermal reduction of phosphine. Gallium arsenide epi for 1C applications is also generally grown from the vapor, although the task is complicated by requiring sources for two elements instead of only one. For discrete light-emitting devices, growth is most often from the liquid phase (LPE). In this process, the gallium arsenide wafer surface is bathed in a gallium–arsenic melt, and the temperature is varied so that a thin gallium arsenide layer freezes (grows) onto the surface.

The operation of oxidizing the silicon surface is performed in a temperature range of 800°C–1250°C and in an atmosphere containing oxygen or steam in an inert carrier gas. The oxide thickness increases, although not linearly, with oxidation time. Silicon is consumed during the oxidation process at a rate such that for every micron of oxide grown, the silicon surface recedes by 0.45  $\mu$ m. The oxidation rate increases with increasing temperature and pressure. To reduce the oxidation time, or keep a reasonable time at a lower temperature, oxidations are sometimes made at several atmospheres pressure.

If a source of dopant is provided, the oxidation operation becomes a diffusion. The mechanics of transferring an impurity atom directly from a gas ambient into the wafer is rather inefficient. Therefore, a process is generally chosen that lets the dopant react with silicon oxide on the wafer surface to give a silicate glass that then acts as the source. In the case of gallium arsenide, a deposited silicon oxide already containing the dopant is used for the source. To achieve better control of surface concentration and diffusion depth, the overall diffusion operation is often broken into two steps. The first, a dopant deposition, or "dep," is a very thin, high-concentration diffusion, with the surface concentration set by the solubility limit of the dopant at the dep temperature. The second, the "drive," is at a different temperature and is used to adjust the overall diffusion to give both the desired surface concentration and diffusion depth. The wafers are held at their edges, separated by 1-2 mm, in a quartz boat and heated in a long, quartz or polysilicon tube that has the neces-

2.3.3 Thermal Oxidation of Silicon

2.3.4 Diffusion

sary gases flowing through it. Temperature control is required to about  $\pm 1^{\circ}$ C, and diffusion times range from minutes to hours.

2.3.5 Ion Implantation An alternative to the deposition step is to accelerate the dopant atoms to a high velocity with an accelerator so that after striking the surface, they will continue on into the body of the semiconductor. Since acceleration is by electric field, ions, and not atoms, must be used-hence, the name ion implanter. The acceleration voltages used range from a few thousand to a few million volts. The penetration depth of the ions increases with increasing voltage and is typically a few hundred angstroms. When depths greater than those conveniently obtainable by implantation are required, implanting followed by an ordinary thermal diffusion is often used. Even if no further depth is desired, some additional heat treatment is still needed because the implanting causes substantial crystal damage. which must be annealed out. In very high dose implants, damage is so severe that amorphous layers are formed, but under appropriate heat treatment, such layers will regrow epitaxially to form highquality single-crystal layers.

2.3.6 Lithography Lithography, also referred to as photolithography, and sometimes as pattern printing, is the means by which a pattern of masking material (usually organic) is applied to the surface of a wafer. That pattern then provides protection to the desired portions of the wafer and allows material to be removed from the remaining area by a suitable etchant. The steps in patterning oxide on a wafer are shown in Fig. 2.9. The photosensitive resist is liquid and is applied by dropping a metered amount onto the middle of a slowly rotated wafer. The speed is then increased to a few thousand revolutions per minute so that centrifugal force can force the resist to flow out over the wafer surface and uniformly coat it. The rotational speed and resist viscosity are adjusted to give the required thickness (usually about 1 µm). The pattern in the resist is produced by exposing the resist to a similar pattern of high-intensity ultraviolet light and then developing it. The light pattern is defined by the pattern on the mask mentioned earlier. The mask is thus the counterpart to the conventional negative of photography. Either contact printing, with the mask in direct contact with the wafer, or a projection lens system can be used to transfer the mask image to the resist on the wafer. Both of these procedures are very similar to ordinary photographic processing, except that in this case, the mask pattern is either the same size or larger than the image desired on the slice. In the developing process, the unwanted resist is removed, thus leaving a pattern of bare oxide or whatever other material was under the resist.

36

2.3 WAFER FABRICATION OPERATIONS 37



# 2.3.7 Etching

# 2.3.8 Chemical Vapor Deposition (CVD)

2.3.9 Metal Deposition

After the resist pattern is defined, the underlying material must be removed by etching. The most common methods are by using aqueous-based etches and by plasma etching. In either case, a means of stopping the etch at the bottom of the layer being etched is necessary. It is not satisfactory to depend on etch rate and time. Rather, the etch must be chosen so that the rate differential between the layer being etched and the layer immediately beneath it is great enough so that the desired layer can always be completely removed and an inconsequential amount of the one below it can also be removed. For fine geometries, plasma etching is preferred and is considerably more anisotropic<sup>7</sup> than is wet etching. Typical wet etchants are HF–HNO<sub>3</sub> for silicon, HF for SiO<sub>2</sub>, hot H<sub>3</sub>PO<sub>4</sub> for silicon nitride, and cold H<sub>3</sub>PO<sub>4</sub> for aluminum. Plasma etching gases usually contain fluorine or chlorine—for example, CF<sub>4</sub>.

CVD is used to deposit layers of materials such as silicon oxide, silicon nitride, aluminum oxide, and polycrystalline silicon onto the wafers. CVD processes usually operate in a temperature range of 300°C–900°C and proceed by reactions such as follows:

 $SiH_4 + 2O_2 \rightarrow SiO_2 + 2H_2O$  $SiH_4 + heat \rightarrow Si + 2H_2$ 

Oxides deposited in this manner do not bond to the silicon surface in the same manner as a thermal oxide would and, in addition, may have a lower density. Consequently, they are not an alternative to thermal oxides but rather an adjunct to be used when thermal oxidation temperatures cannot be tolerated or when an insulating layer is required and there is no silicon to be oxidized. By using a plasma to supply a portion of the energy required, CVD reactions such as those just listed can be done at substantially lower temperatures. In particular, usable silicon nitride films can be deposited at 300°C, whereas straight CVD requires over 800°C.

Metal depositions are accomplished by evaporation, sputtering, CVD, or electroplating. Evaporation is easy and inexpensive. Sputtering, with its higher pressures, is used to provide better step coverage. When refractory metals are used, either sputtering or CVD is

<sup>&</sup>lt;sup>1</sup>Anisotropic etching has two meanings. In reference to single-crystal silicon, it means crystallographic orientation dependent etch rates. In most other contexts, it refers to etching straight down through the pattern window without any lateral undercutting.

# 2.3 WAFER FABRICATION OPERATIONS 39

required in order to get reasonable deposition rates. Electroplating is sometimes used to build up thin deposited layers and eliminate the need for etching through very thick metal. Aluminum is the most commonly used metal, but gold, platinum, titanium, molybdenum, tungsten, tungsten-titanium alloys, and palladium are also used. In addition, various refractory-metal silicides are alternatives to polysilicon as a gate and interconnect metallization. In their case, either co-sputtering of the components or sputtering from a silicide target is used.

2.3.10 Metal (Contact) Sintering
In order to achieve good metal-to-semiconductor contact after deposition, heating the combination to a high enough temperature to allow some interdiffusion and alloying to take place is usually necessary. For aluminum, the temperature is usually between 450°C and 500°C. The operation is performed in a diffusion-like tube in either an inert or a hydrogen atmosphere.

2.3.11 Backgrinding For wafers to have enough strength to survive processing without any breakage, their thickness is usually in the 500-600 µm range. However, the chip separation operation works better with thinner wafers. Furthermore, thinner dies allow better heat transfer from the junctions to the lead frames on which the dies are mounted. Consequently, a thinning operation (backgrinding) is often used. This operation is done on a milling-type machine with a diamond-cup grinding wheel. Less modern equipment, based on optical industry processes, uses a large rotating table, on which many slices are mounted by hand, and a grinding wheel that may be 18–24 inches in diameter. Newer equipment is much smaller, grinds the slices individually, and loads automatically

2.3.12 Multiprobing Multiprobing is the testing operation used to sort good electrical dies from bad ones while they are still in wafer form. A series of sharp-pointed probes is mounted so that the probes can be brought down on the wafer to simultaneously contact all of the metallized bonding pads<sup>8</sup> of an individual chip. A series of electrical tests, microprocesser controlled, is then sent to the chip to determine whether it functions properly. An ink dot is placed on each bad chip so that it can be sorted out later.

<sup>&</sup>lt;sup>8</sup>When an IC is assembled, connecting wires from the lead frame to the chip attach to the chip at the bonding pads.

# 2.4

INTEGRATED CIRCUIT FABRICATION 2.4.1 Components and Component Construction The wafer fabrication operations just described are used to construct the various active and passive devices (components) that may be found in an IC. The components available are as follows:

> Bipolar transistors Junction field effect transistors (JFETs) Metal oxide semiconductor field effect transistors (MOSFETs) Metal semiconductor field effect transistors (MESFETs) pn junction diodes Schottky barrier diodes pn junction capacitors MOS capacitors Diffused resistors Thin-film resistors Interconnecting leads Interconnecting leads crossovers

However, that a single IC will contain all possible devices is exceedingly unlikely.

Transistors are the key components in ICs and require the most sophisticated processing. Fig. 2.10 shows simple bipolar, junction field effect, and MOS transistors in cross section. Multiple bipolar (and/or junction field effect) transistors on the same chip can be electrically isolated from one another by pn junctions or by a dielectric layer completely surrounding the transistor. The most common method is pn junction isolation, and it starts with a p-type slice and then adds an epitaxial n-layer on top of it. The transistors are then made in the n-layer and isolated from one another by diffused p-regions, as shown in Fig. 2.11. Transistors designed for highspeed operation may combine thermal oxide sidewall and pn junction isolation, as depicted in Fig. 2.12. Dielectrically isolated (DI) bipolar transistors are used in some high-voltage and radiationhardened applications. As shown in Fig. 2.13, the oxide completely surrounds the collector region and thus provides a higher collectorsubstrate breakdown voltage than is easily obtainable with a junction. Such oxide also prevents the possibility of a large current flow between collector and substrate that could occur with junction isolation in the event of high-intensity photon irradiation.

In some bipolar circuits, having both npn and pnp transistors on the same chip is helpful. By building lateral pnp transistors as shown in Fig. 2.14, rather than having the main portion of the junction parallel to the wafer surface and the current flow normal to the surface, current flow is parallel to the wafer surface. Because of the difficulty in spacing the two diffusions close together, gain will be quite low, but the process has the advantage of allowing the pnp emitter and collector to be made during an npn base diffusion, thus requiring no

# 2.4 INTEGRATED CIRCUIT FABRICATION 41

# FIGURE 2.10

Cross sections of bipolar, junction field effect, and MOS transistors. (Not to scale. Also, in these kinds of drawings, the vertical magnification is much greater than the horizontal magnification.)



(a) npn bipolar transistor



(b) n-channel junction field effect transistor



(c) p-channel MOS transistor





additional steps.<sup>9</sup> JFETs are used primarily in analog circuits where their relatively high input impedance and low noise figure are sometimes desirable.

The simple MOS transistor of Fig. 2.10 has largely been replaced by more complex structures with oxide sidewalls as shown in Fig. 2.15 in order to reduce source-drain capacitance to the substrate. Complementary MOS (CMOS) circuitry requires both n-channel and p-channel transistors in the same chip. To do this, some regions of the chip surface must be n-type and other regions p-type. The manner in which this is done leads to the n-well, p-well,

<sup>&</sup>lt;sup>9</sup>If a complementary transistor circuit is being built, the npn and pnp transistors must be more closely matched than is possible with this approach.

# 2.4 INTEGRATED CIRCUIT FABRICATION 43



or twin-well CMOS structures shown in Fig. 2.16. Note that MOS transistors are self-isolating, but in order to reduce capacitance from source-drain to substrate and thus improve high-frequency performance, MOS transistors on insulating substrates are sometimes used. They are usually built in silicon epitaxially overgrown on single-crystal sapphire (SOS), as shown in Fig. 2.17.

An alternative to the MOS transistor is the MESFET, shown in Fig. 2.18. The source and drain are made as in a conventional MOS transistor. However, instead of an electrode over a thin gate oxide, a Schottky diode placed where the gate oxide would normally be is



MESFET transistor such as is often used in GaAs ICs.



Schottky diode. (Rectification occurs at the metal-semiconductor interface.)





used to pinch off the channel between source and drain. The MES-FET is thus closely akin to the JFET, and like the JFET, it is not self-isolating. MESFETs are often used in GaAs ICs. in which case (as in Fig. 2.18), they can be made in a thin n-layer epitaxially grown on a semi-insulating GaAs substrate. Isolation is then achieved by etching grooves down to the semi-insulating material.

Diodes can be either pn junction or Schottky barrier. When pn junction diodes are used in bipolar circuits, they are generally made during either the base or the emitter diffusions so that processing steps are conserved. In order to minimize series resistance and maximize capacitance per unit area, emitter-base junctions are preferred over collector-base junctions in bipolar circuits. To minimize junction interactions, the collector is often shorted to either the base or the emitter. Schottky diodes, like the one shown in Fig. 2.19, are made by having a barrier material, such as platinum silicide, in contact with a bare, very clean semiconductor surface. The barrier height is lower on p-type than it is on n-type material of the same resistivity so that the reverse breakdown voltage of p-type Schottky diodes is substantially less than that of comparable n-type diodes. For the same diode area, the forward current at a given voltage is lower for a Schottky than it is for a pn junction. A Schottky diode can thus be used in parallel with the collector-base junction of a bipolar switching transistor to keep the transistor from saturating during that part of the switching operation when the collector-base junction is forward-biased.

IC resistors are usually made from a thin, high sheet resistance Si or GaAs diffused layer, but sometimes a high-resistance metal film such as nichrome is used. In the early days of the IC, the base diffusion was also the resistor diffusion. Now, in order to get highervalue resistors into smaller areas, separate higher sheet resistance diffusions are usually used.

Capacitors may be either pn junction, MOS, or some other metal-insulator-metal combination. In order to conserve space,

## 2.4 INTEGRATED CIRCUIT FABRICATION 45

they may be between two levels of leads or in the walls of grooves etched in the wafer, as is the case with trench capacitors (Fig. 2.20).

In order to interconnect the many components of an IC, some leads must cross others. Fig. 2.21 shows the three ways in which crossing is done. The oldest method is the diffused crossunder, in which current of one lead goes under the other by traveling through

FIGURE 2.20

Integrated circuit MOS capacitors: (a) conventional, (b) trench, and (c) stacked.



FIGURE 2.21

Methods of providing insulation between leads where they cross one another.



(a) Diffused crossunder method



(b) Conventional double-level method



(c) Air-bridge method

a high-conductivity path diffused into silicon. When a large number of leads exists, unless some leads are stacked on top of others, leads on the surface, and not active components in the semiconductor, limit IC size. Multiple layers of leads (double-level metal, triple-level metal) use either a CVD insulating layer such as SiO<sub>2</sub> or an organic layer such as a polyimide for insulation. When the capacitance of overlapping leads or a heavily diffused tunnel is too great, air crossovers are sometimes used. They make use of an unsupported loop (bridge) of one conductor where it crosses another.

As described in Table 2.1, the IC fabrication process consists of a number of steps, many repetitious, with wafers moving back and forth from operation to operation until an IC is completed. A tabulation of the sequence of steps for a particular IC may be referred to as a process flowchart. Each of the steps listed in the flowchart is more fully detailed in a process specification (the exact details of which are generally considered trade secrets by the semiconductor manufacturer). As examples of how a flowchart maps out the manner in which the processes of Table 2.1 are combined to produce the necessary circuit elements, four processing flows will be discussed:

- A simple silicon bipolar flow
- A simple aluminum PMOS flow
- A complex silicon CMOS flow
- A gallium arsenide digital IC flow

These flows include only the salient features; it should be understood that the processes followed in commercial manufacture may include many more steps, often proprietary, designed to improve yield and enhance device performance.

For the first example, suppose that a simple silicon TTL (transistor-transistor logic) gate circuit is to be made. The electrical circuit is shown in Fig. 2.22. This circuit has few components, and consequently, even with large feature sizes such as contact openings of  $5\mu m \times 5\mu m$ , it can be laid out as a square approximately 0.7 mm on a side so that several thousand can be placed on a 100 mm wafer. The components are few and simple; only transistors, resistors, and lead crossunders are used. The transistors are pn junction isolated and have a subcollector preepitaxial diffusion. The starting material is a 10-20  $\Omega$ -cm p-type slice with a (111) orientation. Table 2.5 shows the process flowchart for this simple TTL IC. Steps 1-7 of the flowchart produce the subcollector diffusion and the epitaxial layer. Steps 8-10 produce the localized p-diffusions that will provide electrical isolation for the various devices. Not shown on this flowchart is the fact that the base and the resistors each have a design value of 120  $\Omega/sq$ , so that they can both be made by the base diffu-

2.4.2 Steps in IC Fabrication

46

# 2.4 INTEGRATED CIRCUIT FABRICATION 47

# FIGURE 2.22

TTL gate. (There are no diodes in this circuit.)



# TABLE 2.5

Simple Silicon TTL IC Process Flowchart

1 Clean slice.

Step

2 First oxidation: Grow approximately 10,000 Å of thermal oxide.

Operation

- 3 Apply photoresist; expose, using first mask of set; develop to remove resist where subcollector diffusion will be.
- 4 Etch oxide; remove resist; clean wafer.
- 5 Diffuse antimony into openings (windows).
- 6 Remove oxide; clean wafer.
- 7 Grow a 5–6  $\mu$ m thick, 0.2–0.5  $\Omega$ -cm, n-type epitaxial layer.
- 8 Second oxidation: Grow approximately 10,000 Å of thermal oxide.
- 9 Repeat steps 3 and 4 using second mask (isolation mask) of set.
- 10 Diffuse boron for isolation (also to regrow oxide).
- 11 Repeat steps 3 and 4 using third mask (base mask).
- 12 Diffuse boron for bases and resistors (also to regrow oxide).
- 13 Repeat steps 3 and 4 using fourth mask (emitter mask).
- 14 Diffuse phosphorus for emitters, crossunders, and collector contacts (also to regrow oxide).
- 15 Repeat steps 3 and 4 using fifth mask (contacts mask).
- 16 Evaporate aluminum over whole slice.
- 17 Repeat step 3 using sixth mask (leads mask) to define aluminum pattern.
- 18 Etch to remove unwanted aluminum; remove resist.
- 19 Sinter metal to ensure good electrical contact; clean wafer.
- 20 Vapor-deposit protective overcoat (PO) layer of SiO<sub>2</sub>.
- 21 Repeat step 3 using seventh mask (PO mask) to define window openings over bonding pads.
- 22 Etch to remove oxide over bonding pads; remove resist.
- 23 Multiprobe wafer.

sion. This operation is performed in steps 11–12. The sheet resistance design center for the emitters is 5  $\Omega$ /sq. The crossunders are diffused resistors, and since a very low resistance is needed, they can be made at the same time as the emitter diffusion. In order to make good metal lead contacts to the n-type collectors, heavily doped n<sup>+</sup>-regions are required, which can be made at the same time as the emitter diffusions. Steps 13–14 thus provide emitters, crossunders, and heavily doped n<sup>+</sup>-type collector contact regions. Openings in the oxide for contacts are made in step 15. Aluminum is used for leads, and it is evaporated, patterned, and sintered in steps 16– 19. An overcoat to protect the chips from scratches and moistureinduced corrosion is added in steps 20–22. Finally, in step 23, the wafers are multiprobed, and faulty chips are marked with ink.

A comparably simple MOS circuit (now outdated) can be made with an aluminum gate p-channel MOS (PMOS) process, the flowchart of which is shown in Table 2.6. Because of the sensitivity of a MOS device to the cleanup before gate oxidation and the growth of the thin oxide itself, the pregate cleanup and the gate oxide growth steps are substantially more involved than the cleanups and oxide growth steps of the previously discussed TTL process flow. Step 8, to ion implant the threshold adjust, is a step not found in bipolar fabrication, although ion implantation for other purposes may be used.

# TABLE 2.6

48

Simple Aluminum Gate PMOS Process Flowchart

Step

1

- Clean slice.
- 2 Grow field oxide.
- 3 Perform source-drain OR.\*
- 4 Perform boron source-drain diffusion.

Operation

- 5 Perform gate OR.
- 6 Clean pregate.
- 7 Grow gate oxide.
- 8 Ion implant threshold adjust.
- 9 Perform contact OR.
- 10 Metallize.
- 11 Pattern and etch metal.
- 12 Sinter.
- 13 Deposit protective overcoat (PO).
- 14 Pattern and etch PO.
- 15 Multiprobe wafer.

\*OR is a commonly used abbreviation for oxide removal. An OR generally involves the steps of (1) applying the resist, (2) exposing and developing the resist, (3) etching the oxide, and (4) removing the resist and cleaning the wafer.

# 2.4 INTEGRATED CIRCUIT FABRICATION 49

The construction of a twin-well, self-aligned, silicon gate CMOS, as outlined in the process flowchart of Table 2.7, is quite complex compared to the processes covered by the first two example flows. The process flow of Table 2.7 does, however, still use the same basic processes described in Table 2.1, and no individual devices are used in the chip that were not described in section 2.4.1. The twin-well construction was shown in Fig. 2.16. Typically, the starting slice resistivity is about 50  $\Omega$ -cm. The orientation, as for essentially all MOS circuits, is (100). Steps 1-5 produce the n-well, and steps 6-10 the p-well. Oxide isolation at the sidewalls of the sources and drains, as was shown in Fig. 2.15, is used in this IC. The oxide pattern is defined and the oxide is grown in steps 11-16. The region where this oxide is grown is sometimes referred to as a "moat," although the oxide grown is the "field" oxide. Since the thermal oxide thickness is about twice that of the silicon consumed, the surface of a localized oxidation will rise above the rest of the wafer surface. If a planar surface after oxidation is desired, the surface before oxidation must thus be recessed. In this flow, step 14 provides for such a recess. To prevent inversion of p-silicon under the field oxide, a high-concentration boron implant (step 15) is made before field oxidation (channel stop). The circuit described by this flow uses trench capacitors as shown in Fig. 2.20b. They are built in steps 17-25. The gate oxide is grown and the polysilicon gates are formed in steps 26-31. After the gate regions are cleared of thick oxide, a sacrificial oxide (dummy oxide) is grown and stripped before gate oxidation to clean the silicon (step 27). In the self-aligned gate process described here, the polysilicon gate electrode material is used to help define the location of sources and drains, which are made in steps 32-38. This ensures that the gate does not substantially overlap the sources and drains and cause poor performance. After the sources and drains, an interlevel CVD oxide to cover the polysilicon gates and leads is added, and holes are cut in it where the top level of metal is to contact the polysilicon leads (steps 39-40). The top metal is added, patterned, and sintered in steps 41-43. The protective overcoat is deposited and opened up over the bonding pads in steps 41-42. The wafer is thinned, after which its backside is coated with gold to improve electrical conductivity and multiprobe is performed (steps 46-48).

For the final example, consider a gallium arsenide digital IC. High-performance bipolar and MOS transistors are difficult to fabricate in gallium arsenide. The most commonly used GaAs transistor is the MESFET (Fig. 2.18), and it is described in this flow. Both enhancement- and depletion-mode MESFETs can be constructed, and digital circuits are designed using only these transistors and diodes for circuit elements. Analog circuits, such as microwave am-

# TABLE 2.7

Twin-Well, Self-Aligned, Silicon Gate CMOS Process Flowchart

| Step | Operation                                                                     |
|------|-------------------------------------------------------------------------------|
| 1    | Clean slice.                                                                  |
| 2    | Grow thin oxide (stress buffer between silicon and nitride).                  |
| 3    | Deposit silicon nitride (oxidation mask).                                     |
| 4    | Perform n-well nitride removal.*                                              |
| 5    | Ion implant n-well (as an alternative to diffusion deposition).               |
| 6    | Grow oxide over n-well.                                                       |
| 7    | Remove nitride (and expose remainder of wafer, which will be p-well).         |
| 8    | Ion implant p-well.                                                           |
| 9    | Remove oxide; clean wafer.                                                    |
| 10   | Drive-in and anneal ion implants.                                             |
| 11   | Grow thin oxide (buffer).                                                     |
| 12   | Deposit nitride (mask for moat oxidation).                                    |
| 13   | Remove moat nitride.                                                          |
| 14   | Etch silicon to depth of half that of oxide to be grown.                      |
| 15   | Implant channel stop.                                                         |
| 16   | Grow field oxide.                                                             |
| 17   | Deposit oxide for trench mask (for trench capacitor).                         |
| 18   | Perform trench OR (do not remove resist).                                     |
| 19   | Etch trench.                                                                  |
| 20   | Remove resist.                                                                |
| 21   | Diffuse to dope trench walls (n <sup>+</sup> ).                               |
| 22   | Remove oxide grown during trench diffusion.                                   |
| 23   | Grow trench capacitor oxide.                                                  |
| 24   | Deposit polysilicon to fill in trench; form capacitor plate.                  |
| 25   | Pattern polysilicon.                                                          |
| 26   | Perform gate OR.                                                              |
| 27   | Grow dummy oxide.                                                             |
| 28   | Remove (strip) dummy oxide.                                                   |
| 29   | Grow gate oxide.                                                              |
| 30   | Deposit polysilicon for gate electrode and first level of<br>interconnection. |
| 31   | Pattern polysilicon.                                                          |
| 32   | Pattern n-tub sources and drains.                                             |
| 33   | Implant n-tub sources and drains.                                             |
| 34   | Strip resist.                                                                 |
| 35   | Pattern p-tub sources and drains.                                             |
| 36   | Implant p-tub sources and drains.                                             |
| 37   | Strip resist.                                                                 |
| 38   | Anneal sources and drains.                                                    |
| 39   | Deposit CVD interlevel oxide.                                                 |
| 40   | Via OR (cuts holes for contacts between two levels of                         |

- 40 Via OR (cuts holes for contacts between two levels of interconnection).
- 41 Deposit second level of metallization.
- 42 Perform second-level metal patterning and removal.
- 43 Sinter contacts.

plifiers, may use diffused and thin-film resistors, metal-insulatormetal capacitors, and inductors formed by pancake-spiraled leads. GaAs ICs are used only when very-high-frequency operation is required, and to minimize lead crossover capacitance, air bridges as in Fig. 2.21c are common. The starting material is a semi-insulating slice with a (100) orientation. In some processes, as, for example, that used in Fig. 2.18, an n-epitaxial layer is added, and individual devices are isolated by etching away the n-layer between them. In other processes, such as the one described by the flowchart of Table 2.8, localized ion implantation into the semi-insulating GaAs is used to provide the necessary thin n-regions (steps 1-4). Steps 5-6 produce the sources and drains. Excessive arsenic is lost when any processing temperature is over about 600°C unless the wafer is covered with an impervious layer such as CVD SiO, or silicon nitride. Thus, "capping" layers (step 7) are added from time to time during GaAs wafer fabrication. After the capping, the ion implants are annealed. and the cap is removed (steps 8-9). A nitride layer, used for surface passivation, is added in step 10. Resist is applied, contact openings are cut in the nitride, and metal is deposited. The resist is then removed, and with it, all of the metal except that in the contact openings. This "lift-off" process (described in Chapter 5) is done in steps 11-14. The contacts are sintered in step 15. To improve the MES-FET's electrical performance, the implanted channel (step 4) must be reduced in thickness, which is done by etching a slight depression in the GaAs where the gate electrodes are to be applied (steps 16-17). The gate metal is then deposited, and the resist defining the gate areas is stripped (another example of lift-off) in steps 18-19. The first-level metal pattern is then deposited and defined in steps 20-21. In this flow, the second level of metal is only used in the air-bridge crossovers. They are made in steps 22-28. First, resist is applied and patterned to open up points of contact to the first-level metal (step 22). A thin layer of metal is deposited over the resist, after which another coat of resist is applied and patterned to define the airbridge metal (steps 23-24). The continuous, thin metal coating provides electrical contact to all of the bridge sites on the wafer and allows them to be simultaneously electroplated. The electroplating is used to build up the thickness of the bridge metal and provide enough strength to keep it from sagging. After plating, the top layer of resist is removed, and the thin layer of metal between the thick bridge metal sites is etched away. Then, the remaining resist layer (the one of step 22) between the first-level metal and the crossover metal is removed. Finally, in step 29, the wafers are tested.

# 2.4 INTEGRATED CIRCUIT FABRICATION 51

# TABLE 2.7 (continued)

# 44 Deposit protective overcoat.

- 45 Remove protective overcoat at bonding pads.
- 46 Thin wafers (backgrind).
- 47 Deposit gold on backside of wafer (to improve electrical contact of chip to lead frame).

Operation

48 Multiprobe wafer.

Step

\*A nitride removal includes essentially the same steps as an oxide removal.

# TABLE 2.8

Gallium Arsenide Digital MES-FET Process Flowchart

# Step Operation

- I Clean slice.
- 2 Add CVD silicon oxide layer.
- 3 Apply photoresist; expose and develop to remove resist where transistors will be located.
- 4 Ion implant silicon through CVD oxide, using resist as mask (channel implant).
- 5 Repeat steps 3 and 4 for source and drain implants.
- 6 Strip resist; clean wafer.
- 7 Cap whole wafer with more CVD oxide.
- 8 Anneal to activate implanted ions.
- 9 Strip CVD oxide; clean wafer.
- 10 Add CVD silicon nitride layer.
- 11 Apply photoresist; expose and develop to remove resist where contact windows will be located.
- 12 Etch nitride from contact windows.
- 13 Deposit Ge: Au alloy; deposit Ni layer.
- 14 Remove resist along with metal on top of it; clean wafer.
- 15 Alloy to form contacts.
- 16 Apply photoresist; expose and develop to remove resist where transistor gates will be located.
- 17 Etch GaAs to thin gate region.
- 18 Deposit Ti:Pd:Au layer.
- 19 Remove resist along with metal on top of it.
- 20 Apply photoresist; expose and develop to remove resist where first-level leads will be located.
- 21 Deposit first-level metal; remove resist along with unwanted resist.
- 22 Apply photoresist; expose and develop to remove resist where second-level metal (air-bridge metal) will contact first level.
- 23 Deposit thin metal layer.
- 24 Apply photoresist; expose and develop to define air-bridge metal.
- 25 Electroplate exposed pattern.
  - 26 Remove top layer of resist (down to metal of step 23).
  - 27 Etch away exposed thin metal.
  - 28 Remove underlying resist (resist added in step 22).
  - 29 Electrically test circuits.



# Thermal Oxidation of Silicon

# 3.1 INTRODUCTION

Silicon dioxide (SiO<sub>2</sub>, silica) exists both in crystalline form and in vitreous form. The crystalline form occurs in nature (most commonly as the mineral quartz) and can be produced artificially from a high-temperature, high-pressure, alkaline, aqueous solution of SiO<sub>2</sub>. Vitreous silica (fused silica, fused quartz) forms when crystalline silica is melted and allowed to cool, when a silicon-bearing species such as SiH<sub>4</sub> or SiCl<sub>4</sub> reacts with oxygen at elevated temperature, or when silicon is directly oxidized in an atmosphere containing oxygen or water vapor. A silicon surface with its oxide removed will reoxidize in air at room temperature, but because the growth is self-limiting, the maximum thickness is only about 40Å.<sup>1</sup> This native oxide, while somewhat porous, forms a protective coating that prevents corrosion or further oxidation in most room-temperature ambients.

When the temperature is raised several hundred degrees C, the oxidation rate is much greater, and the oxide becomes denser and more durable. This higher-temperature thermal oxide is used in IC manufacturing. It is formed by the direct oxidation of the silicon wafer surface at elevated temperature in either an oxygen or a steam ambient through the following reaction:

 $Si + O_2 \rightarrow SiO_2$ 

or

 $Si + 2H_2O \rightarrow SiO_2 + 2H_2$ 

53

<sup>&</sup>lt;sup>1</sup>The thickness-time curve is dependent on surface cleanliness, the amount of impurities in the air, and the method of measurement. Typically, the thickness as measured by ellipsometer will be about  $5\text{\AA}$  in 5 minutes, 20Å in 15 hours, and perhaps 40Å after a year. Using X-ray photoelectron spectroscopy, the thickness is estimated to be about half as much. (See, for example, S.I. Raider et al., *J. Electrochem. Soc*, 122, p. 413 and references therein, 1975.)

## CHAPTER 3 THERMAL OXIDATION OF SILICON

54

While many materials oxidize by the metal's diffusing to the oxide surface and reacting with oxygen, in the case of silicon, the oxygen diffuses through the oxide and reacts at the silicon surface. This reaction has been deduced from the final location of radioactive materials placed on the oxide surface and of oxygen isotopes introduced into the oxidizing atmosphere partway through an oxidation cycle (1, 2, 3).

The oldest semiconductor use of thermal oxide has been as a diffusion mask.2 The requirements are that the oxide be impervious to the diffusion species for at least as long as the time required for the diffusion and that there be no pinholes in the oxide. Because of the nature of the diffusants, they can penetrate the oxide in two ways. One is by straight diffusion, and the other is by reacting with and gradually dissolving the oxide. Of the standard group IIIA and VA dopants, only gallium has a high enough diffusion coefficient in the oxide to prevent masking. Most of the other dopants react in time with the oxide to form silicates that are liquid at diffusion temperatures. The formation will begin at the outer surface, form a welldefined interface, and gradually move through the oxide. The silicate-glass phase will be rich in dopant, and when the glass has totally consumed the pure oxide, it will be touching the silicon and become a diffusion source. Thus, the thickness of the oxide must be tailored to the required diffusion time and temperature.

While other materials, such as CVD oxide or nitride, can be used as a mask, no commercially proven substitute for a thermal oxide currently exists to provide silicon pn junction passivation or MOS gate oxides.<sup>3</sup> The prime importance of thermal oxide stems from its ability to properly terminate the silicon bonds at the siliconoxide interface. Thus, the deleterious electrical effects of a bare silicon surface or of one covered only with room-temperature native oxide are minimized. The major requirement of such a passivating layer is that it have few electrical charges either in its bulk or at the silicon-layer interface.

Another major application of thermal oxide as a MOS gate insulator requires the same properties needed for good junction pas-

Elm Exhibit 2159, Page 73

<sup>&</sup>quot;The use of thermal oxide as a mask for diffusion predates the planar process by a few years. When concerns about masking first arose, the primary consideration was to keep diffusing impurities from the back of the slice. (See, for example, C.J. Frosch, Chap. 5, "Silicon Diffusion Technology," in F.J. Biondi, ed., *Transistor Technology*, Vol. III, D. Van Nostrand Co., Princeton, N.J., 1958.)

<sup>&</sup>quot;That CVD insulators may eventually be used for MOS gate insulation does seem possible. (See, for example, T. Hosaka et al., "Ultra-Thin Reliable CVD-SiO, Films for VLS1s." *Electrochem. Soc. Ext. Ahst. 86-2*, abst. no. 409, October 1986.)

### 3.2 STRUCTURE 55

sivation. In addition, because the electric field in a gate insulator can approach 10<sup>7</sup> V/cm, far fewer mobile ions in the insulator can be tolerated, and defects such as pinholes, thin spots, and localized high-conductivity regions will cause device failure.

The physical properties of fused silica and of the CVD oxide discussed in Chapter 4 closely match those of thermal oxide. However, for surface passivation, the thermal oxide is the only material that provides satisfactory electrical interface properties. The key factor that enables the thermal oxide to provide the necessary silicon bond termination is its in situ growth, in which the oxidizing reaction occurs at the silicon surface and minimizes the structural damage associated with the abrupt Si–SiO<sub>2</sub> lattice change. Chemically deposited silicon oxide, as well as other materials such as CVD silicon nitride, can, however, be used for diffusion masking.

As shown in Table 3.1, crystalline SiO<sub>2</sub> exists in ten or more crystalline forms. It is of interest in IC fabrication only in that, occasionally, contamination will cause small regions of thermal oxide to crystallize. Where crystallization occurs, the desirable interface properties of the thermal oxide are lost, and silicon dopants can more easily penetrate the oxide.

The structure of both single-crystal and amorphous silica is such that an assembly of silicon atoms is surrounded tetrahedrally by four oxygen atoms as shown in Fig. 3.1 (4). The various crystallographic forms arise from the relative positions of the tetrahedra when they are joined together in a three-dimensional lattice. Through the Si-O-Si link, substantial rotation of one tetrahedron with respect to an adjacent one can occur without changing either the bond length or the angle between bonds. It is thus possible to lose long-range order, become amorphous, and still have close resemblance to the basic crystalline structure. Also, this ability to twist and distort probably allows the silica lattice to match up with and properly terminate the silicon surface bonds. A comparison of angles and distances is given in Table 3.1.

The interface structure has been widely studied (5) and appears to make quite an abrupt transition from single-crystal silicon to amorphous SiO<sub>2</sub>. Fig. 3.2 is a transmission electron micrograph (TEM) of an interface. Over short distances, the surface appears atomically flat but does show steps. This step roughness increases with higher oxidation rates or lower oxidation temperatures. Early spectroscopic studies also led to the conclusion of a very abrupt transition. Knowing that the transition from silicon to amorphous oxide takes place in 5–10 Å still does not answer the question of exactly how the two bond together. Fig. 3.3 shows one speculation.

3.2 STRUCTURE

### 56 CHAPTER 3 THERMAL OXIDATION OF SILICON

TABLE 3.1

SiO<sub>2</sub> Crystallographic Data

| Crystal Class          |                                                      | Description  |        |
|------------------------|------------------------------------------------------|--------------|--------|
| $\alpha$ -quartz*      | Hexagonal; stable at room temperature                |              |        |
| β-quartz               | Hexagonal; stable from 575°C<br>870°C                |              |        |
| α-tridymite            | Orthorhombic; stable at room temperature             |              |        |
| β-tridymite            | Hexagonal; stable from 163°C to<br>1470°C            |              |        |
| $\alpha$ -cristobalite | Tetragonal; stable at room<br>temperature            |              |        |
| β-cristobalite         | Cubic; stable from 218°C to 1710°<br>(melting point) |              |        |
| Keatite                | Tetragonal; stable around 400°C<br>and 5000 psi      |              |        |
| Coesite                | Stable at 750°C and 35,000 atm                       |              |        |
| Fibrous                | Orthorhombic                                         |              |        |
| Bond Angle and Length  | Thermal Oxide                                        | Fused Silica | Quartz |
| Si-O length (Å)        | 1.62                                                 | 1.62         | 1.62   |

 SI=O length (A)
 1.62
 1.62

 O=Si=O angle (°)
 147  $\pm$  10†
 145  $\pm$  10†

\*May be either right- or left-handed.

<sup>†</sup>There is a  $\pm 10^{\circ}$  distribution about the mean.

Sources: Data on crystal class from M. Cannon Sneed and Robert C. Brasted, Comprehensive Inorganic Chemistry, Vol. 7, D. Van Nostrand Co., New York, 1958. Data on bond angles and lengths from Naoyuki Nagasima, Jap. J. Appl. Phys. 9, pp. 879–888, 1970.

## FIGURE 3.1

Si-O bonding details. (Source: Adapted from W.H. Zachariasen, "The atomic arrangement in glass," J. Am. Chem. Soc. 54, pp. 3841-3851, 1932.)



109.5° with the others. The separation of Si and O atoms is 1.62/Å.



(b) Linkage of tetrahedral elements gives a quartz-crystal form. When they are linked in a random manner, amorphous silica results.

High-resolution (lattice-image) transmission electron micrograph (TEM) showing transition from single-crystal Si to amorphous SiO<sub>2</sub>. (The section is normal to a (100) surface.) (*Source:* Courtesy of Dr. H.L. Tsai, Central Research Laboratories, Texas Instruments Incorporated.)



# FIGURE 3.3

Suggested model for Si–SiO<sub>2</sub> transition region, viewed looking into silicon (100) face. (*Source:* Adapted from C.R. Helms, *Semiconductor Silicon* 81, p. 455.)



3.3

NONELECTRICAL PROPERTIES 3.3.1 Density Table 3.2 summarizes many of the properties of thermal oxide and compares them to both silicon and other forms of silica.

The density value reported for thermal oxide is close to that of fused silica and is almost the same as that of silicon (2.22 versus 2.33 grams/cc for silicon). These densities, coupled with their respective molecular weights of 60 and 28, dictate that for every volume V of silicon oxidized, 2.2V of oxide will be generated. In terms of thickness, as shown in Fig. 3.4, for an oxide thickness of t, 0.45t of silicon

### 58 CHAPTER 3 THERMAL OXIDATION OF SILICON

### TABLE 3.2

### Room-Temperature Properties of Silicon and Silicon Dioxide

| Property                                 | Thermal                  | Fused                    | Quartz | Silicon                  |
|------------------------------------------|--------------------------|--------------------------|--------|--------------------------|
| Molecular weight                         | 60.08                    | 60.08                    | 60.08  | 28.06                    |
| Specific gravity                         | 2.22(1)                  | 2.22                     | 2.66†  | 2.33                     |
| Linear expansion coefficient<br>(per °C) | $0.5 \times 10^{-6*}$    | $0.5 \times 10^{-6}$     |        | $2.3 \times 10^{-1}$     |
| Young's modulus (dynes/cm2)              | $6.6 \times 10^{11}$ (2) | $7.3 \times 10^{11}$ (3) |        | 16.9 × 10 <sup>0</sup> ‡ |
| Poisson's ratio                          |                          | 0.17                     |        |                          |
|                                          |                          |                          |        |                          |

\*Fused silica value.

<sup>†</sup>Coesite, metastable at room ambient, has a specific gravity of 3.0. Cristobalite, at 2.2, is the same as fused silica. <sup>‡</sup>In any direction in a (111) plane. The range in other directions is from  $13-18.8 \times 10^{11}$  dynes/cm<sup>2</sup>.

Sources: (1) Naoyuki Nagasima, Jap. J. Appl. Phys. 9, pp. 879–888, 1970. (2) R.J. Jaccodine and W.A. Schlegel, J. Appl. Phys. 37, pp. 2429–2434, 1966. (3) H.J. McSkimin, J. Appl. Phys. 24, pp. 988–997, 1953.

### FIGURE 3.4

Consummation of silicon during thermal oxidation.



will be consumed, and the top of the oxide will rise 0.55t above the original silicon surface.

Because the oxide occupies more space as it grows than does the oxide consumed, appreciable compressive stress in the oxide near the interface is generated. The free top surface allows reduction in a direction normal to the growth surface, but since the oxide is tightly bonded to the silicon, no slipping can occur along the interface in the other two directions. Thus, high intrinsic oxide stresses can arise during growth (6, 7). However, above about 950°C, silicon dioxide has a low enough viscosity for the stress to relax, and little residual stress remains. For oxidations at temperatures below 960°C, the relaxation is much slower, and intrinsic stresses of up to  $7 \times 10^9$  dynes/cm<sup>2</sup> have been reported. Lower stresses have been found in wafers oxidized in wet oxygen than in those oxidized in dry oxygen (8). This fact is consistent with a lowering of silica viscosity when it contains H<sub>2</sub>O (9).

### TABLE 3.3

Effect of Oxidation Temperature on Oxide Density as Deduced from Index of Refraction Measurements

| Temperature ( $^{\circ}C$ ) | Density |
|-----------------------------|---------|
| 600                         | 2.286   |
| 700                         | 2.265   |
| 800                         | 2.253   |
| 900                         | 2.236   |
| 1000                        | 2.224   |
| 1150                        | 2,208   |

Source Adapted from E. A. Irene et al., J. Electrochem, Soc 129, p. 2594, 1982.

3.3.2 Expansion Coefficient 3.3 NONELECTRICAL PROPERTIES 59

As the oxidation temperature decreases, there is, based on index of refraction measurements, a slight increase in density (8-10). This change is shown in Table 3.3. By annealing after oxidation in an inert atmosphere at temperatures above about 1000°C, the density and index of refraction return to their high-temperature oxidation values (9). The density decrease after annealing can be independently deduced by measuring the increase in thickness, assuming that care has been taken to ensure that the ambient was indeed nonoxidizing. The thickness change after a 700°C oxidation is about 3%, which is in reasonable agreement with index of refraction measurements (9). The presumed reason for the increasing density at the lower growth temperatures is the reduced amount of viscous flow, which, in turn, prevents the relief of the strain in the oxide that occurs during growth. An increase in the density of oxides grown at high pressures has been observed, but they are also grown at low temperatures, which is apparently the reason for the density increase.

A very large difference exists between the thermal expansion coefficients of silicon and thermal oxide. Therefore, as an oxidized slice cools from oxidation temperature (usually in the  $800^{\circ}C-1200^{\circ}C$  range), the oxide film will be in compression. If residual internal stress occurs, which is also compressive, the two will be additive. If there is an oxide layer on each side of the wafer, the stress on the two sides will be balanced, and there will be no wafer bowing. However, if the oxide is stripped from one side, as is the usual case, substantial bowing may occur. The stress due to thermal mismatch can be up to  $3 \times 10^{\circ}$  dynes/cm<sup>2</sup> in the oxide when it is cooled from 1200°C to room temperature (12). Because of the much greater thickness of the slice, the maximum stress in it will be from 10 to 100 times less.

The stress  $\sigma_i$  in the oxide of a circular slice-oxide sandwich can be determined from the bow in the wafer from Eq. 3.1 (13) as follows:

$$\sigma_f = \frac{\delta E(T^2/t)}{3(1-v)r^2}$$
 3.1

where  $\delta$  is the deflection of the center of the wafer, *E* is Young's modulus of the wafer, *T* is the thickness of the wafer, t is the thickness of the oxide film,  $\nu$  is Poisson's ratio of the wafer, and *r* is the radius of the wafer. The stress  $\sigma_i$  in the slice due to bowing caused by the stress  $\sigma_i$  in the oxide film is given by Eq. 3.2 (12) as follows:

1

$$\sigma_{\rm c} = \frac{-4\sigma_f t}{T} \qquad 3.2$$

CHAPTER 3 THERMAL OXIDATION OF SILICON

Assuming no viscous flow occurs during cooldown (which would reduce the stress), the stress due to thermal mismatch alone can be calculated from Eq. 3.3 as follows:

$$\sigma_f = \frac{(\alpha_f - \alpha_s)E_f \Delta T}{1 - \nu_c}$$
3.3

where  $\alpha_i$  is the expansion coefficient of the wafer (substrate) and  $\alpha_i$ is the expansion coefficient of the oxide film. E, is Young's modulus of the film,  $\Delta T$  is the oxidation temperature minus room temperature, and  $v_i$  is Poisson's ratio of the oxide. If the cooldown is slow, or if subsequent annealing occurs at temperatures lower than the oxidation temperature, viscous flow relaxation can reduce the mismatch stress. Thus, for higher-temperature oxidations, Eq. 3.3 would be expected to overestimate stress.

EXAMPLE Deglecting viscous flow in the oxide or slip in the silicon, use Eq. 3.3 to calculate o, at room temperature if the oxidation were done at 1200°C.

> From Table 3.2,  $E_i = 6.6 \times 10^{11}$  dynes/cm<sup>2</sup> and  $v_i = 0.17$ . Also from the table,  $\alpha_i = 5 \times 10^{-7/\circ}$ C and can be used since the value changes little in going from room temperature to 1200°C. However, the room-temperature value given for silicon is too low. Looking up  $\alpha_s$  in a handbook shows that 4.5  $\times$  10<sup>-6</sup> is a better value over the whole temperature range than is 2.3.  $\Delta T = 1175^{\circ}$ C. Substituting these values in Eq. 3.3 gives

> > $\sigma_c = 3.7 \times 10^{\circ}$

which compares well with, but is higher than, the reference 12 value of  $3 \times 10^{\circ}$  dynes/cm<sup>2</sup>.

Any intrinsic stress remaining in the oxide can be determined by subtracting the calculated stress due to thermal mismatch (Eq. 3.3) from the measured total stress (Eq. 3.1).

The refractive index varies with oxidation temperature, as is shown in Fig. 3.5 for growth on two different crystallographic orientations. The value saturates at high growth temperature at 1.4620 for a wavelength of 5461 A, which is slightly higher than the value of 1.4601 for fused silica (10). The infrared absorption peaks associated with P-O, B-O, and As-O stretching frequencies can be used to estimate the amount of phosphorus, boron, or arsenic incorporated into the thermal oxide during the diffusion process. This procedure has, however, now been largely superseded by X-ray fluorescence (14). The O-H line at wavenumber 3650 can be used to estimate the amount of hydroxyl ions present in steam-grown oxides (15). The

3.3.3 Optical Properties

60

Index of refraction of thermal oxide as a function of growth temperature and orientation. (*Source:* Adapted from E.A. Taft, *J. Electrochem. Soc.* 125, p. 968, 1978.)



locations of both silica lattice and impurity absorption peaks are given in Table 3.4. The optical transmissivity is very low for wavelengths longer than about 5  $\mu$ m, but it is very high from 3.5  $\mu$ m down to less than 0.2  $\mu$ m.

Silica is only slightly soluble in water at room temperature. The percentage concentration C can be calculated from Eq. 3.4<sup>4</sup> as follows:

$$C = 0.382 \times (13.6 + T) \times 10^{-3} \qquad 3.4$$

where T is the temperature in degrees C. The dissolution rate is so slow that atmospheric pressure water-etching is normally inconsequential. However, at higher pressures, etching can become pronounced in a steam atmosphere and must be considered in highpressure oxidation processes. Silica will dissolve in aqueous HF and in heated solutions of bases such as KOH and can be plasma-etched in fluorine compounds such as  $CF_4$ . (Etching details are covered in Chapter 6.)

The reactivity of thermal oxide (and CVD oxide as well) with various metals is of concern in that IC metallization primarily consists of thin metal strips running over thin oxide. Slight interaction

3.3.4 Chemical Properties

<sup>4</sup>See G.W. Alexander et al., J. Phys. Chem. 58, p. 453, 1954.

### 62 CHAPTER 3 THERMAL OXIDATION OF SILICON

| TA | BI | F | 3 | 4    |
|----|----|---|---|------|
|    | ~~ | - | ~ | e 1. |

| Oxide | Optical |
|-------|---------|
|-------|---------|

Absorption Lines

| Wavenumber<br>(per cm) | Wavelength<br>(µm) | Assignment       | Reference |
|------------------------|--------------------|------------------|-----------|
| 458                    | 21.8               | Si-O-Si, bending | (1):      |
| 720                    | 13.9               | B-O-B            | (2)       |
| 810                    | 12.3               | O-Si-O, bending  | (2)       |
| 930                    | 10.8               | As-O, stretching | (3)       |
| 1075                   | 9.30               | Si-O             | (4)       |
| 1096                   | 9.12               | Si-O, stretching | (1)       |
| 1165                   | 8.58               | Si-O             | (4)       |
| 1190                   | 8.40               | Si-O, stretching | (1)       |
| 1220*                  | 8.20               | Si-O             | (4)       |
| 1330                   | 7.52               | P-O, stretching  | (3)       |
| 1390                   | 7.19               | B-O, stretching  | (3)       |
| 2240                   | 4.46               | Si-H, possibly   | (3)       |
| 2260                   | 4.42               | Si-O, overtone   | (3)       |
| 3650                   | 2.74               | O-H, stretching  | (3)       |

\*Observed in cristobalite. When a line is seen in silicon, it is taken as indication of small cristobalite regions (5).

References: (1) Naoyuki Nagasima, Jap. J. Appl. Phys. 9, pp. 879-888, 1970.
(2) D.M. Brown et al., J. Crystal Growth 17, pp. 276-287, 1972. (3) W.A. Pliskin. pp. 506-529, in Howard R. Huff and Ronald R. Burgess, eds., Semiconductor Silicon/73, The Electrochemical Society. Princeton, N.J., 1973. (4) M.L. Naiman et al., Electrochem. Soc. Ext. Abst. 83-1, abst. no. 124, pp. 196-197, 1983. (5) F. Shimura et al., Appl. Phys. Lett. 37, p. 483, 1980.

is desirable so that the metal will adhere well, but excessive reactions can consume the metal or the oxide. Molten aluminum (melting point = 660°C) will reduce silicon dioxide, and at temperatures as low as 500°C, aluminum leads will react enough with thermal oxide to noticeably reduce the oxide thickness. Titanium begins to react at 400°C and, by 500°C, will have reduced the oxide thickness (16). There is also evidence that electolytic action can occur at metalsilica interfaces when metallic ions from MOS electrodes are injected into the SiO<sub>2</sub> (17).

Oxides of arsenic, boron, phosphorus, and antimony, as well as oxides of many other metals, will interact with silicon oxide at elevated temperatures to give mixed oxides with softening temperatures well below that of pure  $SiO_3$ . These oxides are useful if, for example, rounding the sharp corners of an etched oxide by heating to around 1000°C is desired. However, if keeping a phosphorus-glass diffusion source separated from the silicon surface by a layer of thermal oxide is desired, it is a distinct disadvantage.

Heating thermal oxide in the 1000°C temperature range in an ammonia atmosphere will give a "nitrided" oxide with an index of refraction higher than that of the oxide. The mechanism appears to be the addition of nitrogen atoms to the lattice and/or a nitride growth at the silicon-oxide interface, and not an oxygen atom replacement. Thin "thermal nitride" films can also be grown directly on a bare silicon surface (18, 19). These films contain oxygen and perhaps should be characterized as oxynitrides.

As shown in Table 3.5, the electron mobility in silicon dioxide is reasonably high, but because of the large bandgap, current flow is limited by the ability to inject carriers into the oxide (electrode-limited conduction).<sup>5</sup> The emission near room temperature, with electric fields greater than about  $3 \times 10^6$  V/cm and for oxide thicknesses above about 50 Å, is best characterized as Fowler–Nordheim (20, 21). For lower fields, Schottky emission has been suggested (22). In many cases, in the low-field region, motion of ions already in the oxide or trap filling may allow a current flow relatively independent of voltage that will be large enough to mask other effects.

When Fowler-Nordheim (F-N) emission occurs,

where J is the current density, E is the electric field, and A and B are constants. Since this conduction depends on the electrode-toinsulator barrier height, A and B depend on the electrode material. For current flow described by Eq. 3.5, a plot of log  $(J/E^2)$  versus

| Property                                                                     | Value                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |
|------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Dielectric constant                                                          | 3.9                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| Dielectric strength                                                          | ~ 107 V/cm*                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
| Bandgap                                                                      | -9 eV                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |
| Resistivity                                                                  | 10 <sup>12</sup> =10 <sup>16</sup> Ω-cm <sup>3</sup>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
| Electron mobility                                                            | 20-40 cm <sup>2</sup> /V·s                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
| Hole mobility                                                                | $-2 \times 10^{-3} \text{ cm}^2/\text{V} \cdot \text{s}$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |
| *Function of growth conditions and thickness.<br>†Function of applied field. |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
|                                                                              | A CONTRACT OF A |

Source: Bandgap, electron mobility, and hole mobility from data listed in D.J. DiMaria, "Defects and Impurities in Thermal SiO<sub>2</sub>," in S.T. Pantelides, ed., *The Physics of SiO<sub>2</sub> and Its Interfaces*, Pergamon Press, New York, 1978.

# 3.4

ELECTRICAL PROPERTIES 3.4.1 Bulk Electrical Conduction

TABLE 3.5

**Oxide Electrical Properties** 

Another example of electrode-limited conduction is Schottky emission from metal contacts to high-resistivity silicon (discussed in Chapter 10). Bulk-limited conduction includes the familiar electronic conduction in semiconductors as well as ionic conduction observed in ionic crystals at elevated temperatures.

1/E will be a straight line. Fig. 3.6 shows typical room-temperature data for several electrode materials. For polysilicon, the most often used MOS gate electrode material, as empirically determined,  $A = 2 \times 10^{-6}$  A/V<sup>2</sup> and  $B = 2.7 \times 10^8$  V/cm (23). The temperature dependence, as shown in Fig. 3.7, is relatively insensitive to temperatures up to perhaps 200°C. It is this behavior that helps separate F–N tunneling from Schottky emission, which shows an exponential current rise with temperature. For very thin oxides, there are deviations from the F–N prediction of I–V behavior. However, by using the WKB approximation<sup>6</sup> to solve for electron tunneling through various barrier shapes, good agreement between predicted and experimentally observed I–V curves is obtained (24).

Despite the fact that current is electrode limited, an effective resistivity can be calculated for a given field strength and is useful in comparing the thermal oxide to other materials:

$$\rho' = \left(\frac{V}{L}\right) \left(\frac{A}{I}\right) = \frac{E}{J} = \frac{1/E}{J/E^2}$$
 3.6

where  $\rho'$  is the effective resistivity, *L* is the thickness of the oxide, and *A* is the cross-sectional area of the contact. From Fig. 3.6,<sup>7</sup> for an aluminum contact and a field strength *E* of 5.5 × 10<sup>6</sup> V/cm,  $\rho' =$ 1.8 × 10<sup>18</sup> Ω-cm. If the field were reduced to 10<sup>5</sup> V/cm, the effective



<sup>&</sup>lt;sup>b</sup>A method of approximating the solutions of a second-order linear equation that was first applied to the Schrödinger equation by G. Wentzel, H.A. Kramers, and L. Brillouin in 1926.

Fowler–Nordheim plot of thermal oxide *I–V* characteristics. (*Source:* Adapted from M. Lenzlinger and E.H. Snow, *J. Appl. Phys.* 40, p. 278, 1969.)

FIGURE 3.6

<sup>&</sup>lt;sup>7</sup> Values reported by different observers may vary from one another by a factor of at least 10.

Variation of current through oxide as a function of temperature. (Upper curve is for electron injection from silicon; lower curve, for injection from aluminum.) (Source: Data from C.M. Osburn and E.J. Weitzman, J. Electrochem. Soc. 49, p. 603, 1972.)



resistance would increase by several orders of magnitude. Alternatively, a specific contact resistance  $R_c$  in  $\Omega$ -cm<sup>2</sup> can be calculated from Eq. 3.7 as follows:

$$R_c = \frac{V}{J} = \frac{E \times L}{J} = \frac{(1/E) \times L}{J/E_2}$$
3.7

If the oxide in the example above were 1000 Å thick,  $R_e$  would be  $1.8 \times 10^{13} \Omega$ -cm<sup>2</sup>, compared to a value of about 10<sup>6</sup>  $\Omega$ -cm<sup>2</sup> for a Schottky barrier on 1  $\Omega$ -cm silicon. With such high values, the actual magnitude would ordinarily be of little concern, but EEPROM devices depend on injecting charge onto a floating gate and having that charge not leak away over the course of several years.

Above about 400°C, the conduction mechanism apparently changes, and the current becomes linearly dependent on voltage (25). Data are shown in Fig. 3.8.

The surface sheet resistance of thermal oxide is a strong function of humidity, as shown in Fig. 3.9. Because of a finite conductivity, charge can move out from a contact (charge spreading) and, if severe enough, create an inversion layer in the silicon beneath. It has been

3.4.2 Surface Electrical Conductivity



shown that the potential V on the outer oxide surface in the vicinity of an electrode is given by Eq. 3.8 (26) as follows:

$$V = V_o \operatorname{erfc}\left(\frac{x}{2\sqrt{t/RC}}\right)$$
 3.8

where  $V_{\phi}$  is the voltage on the contact, x is the distance out from the contact, t is the time after the voltage is applied to the contact, R is the sheet resistance of the surface, and C is the oxide capacitance.

For any part of the area where V is greater than the threshold voltage  $V_i$ , an inversion layer will occur.

# 3.4.3 Dielectric Breakdown Strength

Dielectric strength is on the order of 107 V/cm. Except under conditions of severe electrical overstress, bipolar ICs will not have fields close to this value. MOS devices, which depend on thin gate oxides for good electrical performance, usually operate very close to breakdown. Thus, any process fluctuations that cause a reduction in dielectric strength will generally cause yield losses and may cause long-term reliability problems as well. The distribution of breakdown fields for MOS capacitors in Fig. 3.10 shows the three peaks often observed. The cluster of very low breakdowns is ascribed to faulty fabrication involving, for example, pinholes in the oxide (23) and gross contamination on the surface before oxidation. The move to better cleanrooms has led to the continuing decrease of the number of units in this category. The next cluster, sometimes merging with and thus giving a long tail to the maximum breakdown distribution, is apparently caused by a reduction in barrier height brought about by such things as silicon surface asperities,8 small regions of metallic precipitates (often decorated crystal defects) such as Ni. Cu, and Sn in the underlying substrate (27), or locally high-sodium



<sup>\*</sup>This effect gives orders of magnitude-increased current for the same field, and it has been suggested that deliberate texturing could be used to reduce the voltage needed to program EEPROMs.



Breakdown field distribution for MOS capacitors.

concentrations in the oxide itself (28). Unlike the behavior of the low-breakdown peaks, the peak of the maximum-breakdown distribution increases as the oxide thickness decreases. Otherwise, it appears to be relatively insensitive to processing parameters (29, 30).

The effect of asperities is generally observed only when the oxide is grown on polysilicon (31). Crystal defects are minimized by oxidizing in a very clean environment and using wafer gettering whenever possible (see Chapter 8 for a discussion of gettering). Good slice surface cleanup before oxidation will minimize local areas of heavy metal precipitates (21). Further improvement can sometimes be obtained by oxidizing, stripping the oxide, and then reoxidizing (32). This step is built into most MOS processes in that either a dummy gate oxide or the thick field oxide is removed and a gate oxidation is then performed. Plasma cleaning of the silicon surface before oxidation may also help (33). Oxidizing in an ambient with a small amount of chlorine-bearing compound present' will also minimize low breakdowns due to metallic contamination (34). Wet oxidation shifts the high-field peak distribution up somewhat from those obtained with dry oxidation but gives oxides with higher mobile ion contamination (35).

In addition to the initial breakdown failures just described, when an otherwise good oxide is subjected to high fields for long periods of time, failures will occur. This phenomenon is referred to as timedependent dielectric breakdown and is one limit to MOS device lifetime. It is sometimes called a latent defect since it shows up only after prolonged operation. However, it is probably better characterized as a wear-out mechanism. The cause(s) is not clearly defined, but apparently it is a barrier height reduction caused by the accumulation of charges near an electrode (36–39). The charge can come from holes, electrons, or ions injected into the oxide from the contacts. Since the failure rate is more severe the thinner the oxide, and as devices have shrunk in size and the gate oxide has been scaled down in thickness, more and more attention has been given to this problem.

The failure rate appears to follow a log normal distribution<sup>10</sup> (40, 23, 41) so that cumulative failures give a straight line when they are plotted on a probability scale axis against time on a logarithmic

68

<sup>&</sup>quot;This process is almost always used in MOS manufacture in order to minimize sodium- and potassium-induced threshold drift.

<sup>&</sup>lt;sup>10</sup>A log normal distribution is one that shows the typical bell-shaped curve when the independent variable is plotted on a log scale. The log standard deviation is given by  $s_{log} = \log$  (time for 50% failures/time for 15.9% failures).

axis.<sup>11</sup> The failure rate increases exponentially both with operating temperature and gate voltage. A model of form

$$TF(x\%) = Ke^{U/kT}e^{h(k)} \qquad 3.9$$

fits thin-oxide data reasonably well (41). TF(x%) is the time for x% of the devices to fail. K is a constant. U is an activation energy that is dependent on the applied electric field E, k is Boltzman's constant, and f(E) is a temperature-dependent function of the applied field. Table 3.6 shows estimates for 100 Å oxides and illustrates the point that, with the current state of processing, thin oxides must have reduced applied voltages in order to exhibit the long lifetimes expected of semiconductor devices. An alternative to time-to-breakdown as a performance criterion is charge-to-breakdown. In this case, the total charge required for failure is used as a measure of performance.

It has long been observed that, under the influence of an electric field, some metal ions can move rather easily through glass<sup>12</sup> and can similarly move through high-purity silica. Na (sodium) was identified in the 1960s (42) as a contaminant that was usually found distributed throughout semiconductor oxides and that would easily migrate under the influence of an electric field. Fields such as those shown in Fig. 3.11, which are required for device operation, are large enough, and in many cases of the required polarity, to cause these mobile ions to pile up at the oxide–silicon interface, where the extra charge drastically affects device operation.

| Field   | Room Temperature | 85°C            |
|---------|------------------|-----------------|
| (MV/cm) | Time (hours)     | Time (hours)    |
| .8      | 1,5              | 0.66            |
| 7       | 250              | 25              |
| 6       | 20,000           | 550             |
| 5       | 3.250,000        | 2500 (3.5 month |

Source: Data extrapolated from information in reference 41 in end-of-chapter listing.

3.4.4 Mobile Ion Transport

# TABLE 3.6

Time for 10% of 100 Å Oxide Capacitors to Fail

<sup>&</sup>lt;sup>11</sup>This kind of distribution is often observed for semiconductor device failure rates. <sup>12</sup>A demonstration of 50 years ago was to immerse a burning light bulb in a container of a molten (300°C) mixture of NaNO, and NaNO, and then, by biasing the light filament negatively, deposit metallic sodium on the inside of the light bulb.



Origin of fields that may cause ion drift.



In addition, the other alkali metals K<sup>+</sup> (potassium), and Li (lithium) are quite mobile. Cs (cesium), apparently because of its large ionic radius, is virtually immobile. Lithium moves about as sodium but is not as widely distributed as sodium and potassium. Sodium is to be found nearly everywhere, and that fact combined with the fact that it is more mobile than potassium makes it the most troublesome of the three. About a hundred degree higher temperature is required for potassium to drift comparably to sodium (43, 44).

One-dimensional ionic motion by diffusion and field-induced drift is described by Eq. 3.10 as follows:

$$\frac{D\partial^2 N}{\partial x^2} - \frac{v\partial N}{\partial x} = \frac{\partial N}{\partial t}$$
3.10

where D is the diffusion coefficient, N is the ionic concentration, v is the velocity given to the ion by the field, and t is the time. The velocity  $v = \mu E$  where E is the electric field and  $\mu$  is the mobility. The mobility is related to the diffusion coefficient D through the expression  $\mu = qD/kT$ ; thus, for a singly charged ion, v = qDE/kTwhere k is Boltzmann's constant and T is the temperature. Experimentally, it has been demonstrated that field-aided diffusion is not what limits the transport of sodium and potassium ions in thermal silicon oxide. Rather, the ions are apparently initially trapped at the metal-oxide interface, and the rate of transfer to the silicon-oxide interface depends primarily on the rate of emission of ions into the oxide (42-46). Thus, diffusion coefficients determined without considering ion trapping at the interfaces will be substantially understated.

The number of ions transported from one interface to another  $N_i$  in ions/cm<sup>2</sup> is given by Eq. 3.11 as follows:

3.4 ELECTRICAL PROPERTIES 71

$$N_i = N_0 K t^{1/2}$$
 3.11

where K is a constant that depends on field, temperature, and the ion species. It should be remembered that in actual practice a fixed number of ions will usually be moved so that  $N_0$  of Eq. 3.11 will be decreasing with time. A MOS transistor threshold voltage will shift as the charge is added to the oxide-silicon interface and can be used to calculate the ion density.<sup>13</sup> The shift  $\Delta V$  is given by Eq. 3.12 as follows:

$$\Delta V = \frac{-N_i q}{C_{\rm ox}} \qquad 3.12$$

where  $C_{ox}$  is the capacitance per unit area of the oxide and q is the electronic charge. Note that electronic charge is measured in coulombs (C).

The first method developed to combat sodium migration (introduced before the actual ion causing the trouble was identified) was the use of heavy phosphorus doping in the top region of the oxide (49). The heavy phosphorus layer both getters existing Na to it and prevents subsequent contamination by blocking Na movement through it (50). In the making of bipolar circuits, getting occurs normally during the emitter cycle of most npn transistors. The only requirement is that the heavy phosphorus layer not be subsequently etched away. In order to maintain good long-term stability, a severalpercentage concentration of phosphorus should be present. However, if the concentration gets too high, reactions with moisture and aluminum lead systems can occur and cause a different kind of failure. The use of a thin layer of 4% P.O. will reduce the migration time by several orders of magnitude. The mechanism for retarding positive ion motion is thought to be ion trapping by a negatively charged P-O complex residing at an Si site in the SiO<sub>2</sub> lattice (46).

It was next discovered that growing oxides in the presence of chlorine<sup>14</sup> would produce an oxide free of sodium and with immunity from rapid sodium migration (51, 52). The chlorine sources first used were Cl<sub>2</sub> and HCl, but very shortly afterward trichloroethylene was

<sup>&</sup>quot;The change in the shape of a MOS capacitor C-V plot can also be used to determine  $N_c$ .

<sup>&</sup>lt;sup>19</sup>The development of chlorine treatments seems to be an example of independent studies in two different laboratories. Further, in both cases, these studies appear to have been initiated as the result of closely examining the results of some other investigation. Bell Northern (51a) introduced sodium contamination from NaCl and from Na<sub>2</sub>CO, and observed that there was not nearly as much contamination from the NaCl source, RCA (51b) grew oxides from a water source with HCl in it (looking for gettering of heavy metals) and observed a lower sodium contamination level.

suggested since it is not corrosive as is HCl (53). Considerably later, trichloroethane was also introduced as being equally as effective and less toxic (54). In general, the more chlorine incorporated into the oxide, the greater the protection, but too high a concentration will produce a chlorine-rich phase separation at the oxide–silicon interface and, in some cases, a blistering of the oxide (55). The region to be avoided is roughly the one bounded by oxidation temperatures above 1100°C and HCl concentrations greater than 8%. Even at low concentrations, the chlorine is still primarily segregated near the Si–SiO<sub>2</sub> interface (56). The chlorine incorporation does not prevent the sodium from moving through the oxide but rather apparently traps and neutralizes it in the high-chlorine concentration region near the interface.

Simultaneously with these activities, great effort has been made to provide a sodium-free processing environment. This effort includes the use of purer materials, HCl cleaning of furnace tubes, and improved cleanroom procedures to minimize contamination from manufacturing personnel. Another method for providing sodium protection is to use an overcoat material that is impervious to sodium migration. Silicon nitride is such a material and is often used. Low-temperature plasma-assisted CVD nitride that can be applied after metallization can be used as a combination scratch protection for leads and moderately effective sodium barrier. High-temperature CVD nitride is much more effective but must be used before metallization.

3.4.5 Interface Charges In addition to the mobile charges just discussed, a variety of charges also reside at or near the Si–SiO<sub>2</sub> interface. The most common of these charges are the fixed positive oxide charge  $Q_i$  (or  $Q_{in}$ )<sup>15</sup> and a variable interface charge  $Q_{in}$  located in traps variously referred to as surface traps, interface traps, surface states, or fast states (43). The number of charges  $N_{in}$  (or  $N_{in}$ ) associated with each can range from  $10^{10}-10^{12}$  charges/cm<sup>2</sup>, and anything above  $10^{10}$  charges/cm<sup>2</sup> will often adversely affect performance for both bipolar and MOS devices. In the case of MOS,  $Q_i$  causes the turn-on voltage  $V_i$  to shift by an amount  $-Q_i/C_{ox}$ , where  $C_{ox}$  is the capacitance per unit area of the oxide.

EXAMPLE

 $\Box \quad \text{Determine } \Delta V_i \text{ for } N_a = 10^{10} \text{ charges/cm}^2 \text{ if the gate oxide thick-ness} = 1000 \text{ Å}.$ 

<sup>&</sup>quot;Old terminology. New symbols were adopted as an outgrowth of a joint Electrochemical Society and IEEE committee. (See, for example, Bruce F. Deal, J. Electrochem, Soc. 127, p. 979, 1980.)

Capacitance is given by  $A \varepsilon \varepsilon_0 / X$  where  $\varepsilon$  is the oxide dielectric constant,  $\varepsilon_0$  is the permittivity of free space, X is the oxide thickness, and A is the area. A consistent set of units uses permittivity =  $8.85 \times 10^{-14}$  farads/cm (abbreviated F/cm), X in cm, and A in cm<sup>2</sup>. Thus,

$$C_{\rm os}/\rm{cm}^2 = \frac{3.9(8.85 \times 10^{-14} \text{ F/cm})}{10^{-5} \text{ cm}}$$
  
= 3.45 × 10<sup>-8</sup> F/cm<sup>2</sup>  
$$Q_a = qN_a = 1.6 \times 10^{-19} \text{ C} \times 10^{11}/\rm{cm}^2$$
  
= 1.6 × 10<sup>-8</sup> C/cm<sup>2</sup>  
$$\Delta V_i = -\frac{1.6 \times 10^{-8} \text{ C/cm}^2}{3.45 \times 10^{-8} \text{ F/cm}^2}$$
  
= -0.46 C/F = -0.46 V

This value is comparable to other terms in the expression for  $V_t$ and thus would not result in a catastrophic shift in  $V_t$ . However, if  $Q_t/q$  were  $10^{12}$ , then its contribution to  $V_t$  would be 4.6 V, and very poor transistors would result. If  $Q_t/q$  were in the  $10^{10}$  to  $5 \times 10^{10}$ range, then its effect would be negligible.

The kind and amount of charge at any given time due to the interface states will depend on the energy distribution of the states, whether they act as donors or acceptors, and the position of the Fermi level (57). These kinds of states give rise to surface recombination and thus can be particularly troublesome for bipolar devices requiring high values of  $H_{fe}$  at low collector currents. With no surface space charge region, the surface recombination velocity *S* is given by Eq. 3.13 (58) as follows:

$$S = \sigma_c v_{th} N_{it}$$
 3.13

where  $\sigma_c$  is the capture cross section of the state for a carrier (10<sup>-16</sup> to 10<sup>-15</sup> cm<sup>2</sup>),  $v_{th}$  is the thermal velocity of carriers ( $\approx 10^7$  cm/s at room temperature), and  $N_{it}$  is the density of states/cm<sup>2</sup>. It has been shown that, in general, when the fast states are uniformly distributed in energy over the central portion of the bandgap, only those within a few kT of mid-gap will contribute. Under these circumstances,  $\pi kTD_{it}$  where  $D_{it}$  is the density per unit area per unit energy is substituted for  $N_{it}$ .

EXAMPLE

If  $D_{ii} = 10^{12}$ /cm<sup>2</sup> – eV for a particular thermal oxide, calculate the surface recombination velocity when  $\sigma_i = 10^{-15}$  cm<sup>2</sup>.

 $k = 8.62 \times 10^{-5} \text{ eV/K}$  (since  $D_u$  is in terms of eV, this value must be used rather than  $1.38 \times 10^{-16} \text{ erg/K}$ ), and S = 811 cm/s, which is extraordinarily high. However, as will be seen shortly, various anneals are available that will reduce  $N_u$  by nearly two

orders of magnitude. *Also*, the high end of the estimated range of capture cross section was used in this calculation. The actual measured values at planar surfaces range from 250 cm/s down to 1–10 cm/s after the proper anneals. These small values found after annealing have little effect on device performance.

Both the fixed charge and the interface charge depend on the orientation of the surface on which the oxide is grown (59). Their values decrease in order from the (111) to (110) to (100) orientation, and since lower  $Q_i$  values allow lower-threshold MOS transistors to be built, most MOS ICs use (100) material. Typical  $Q_i$  values for a 1200°C dry oxygen oxidation (courtesy of George Brown, Texas Instruments Incorporated) are as follows:

| Orientation <sup>16</sup> | $Q_j/cm^2$           |  |  |
|---------------------------|----------------------|--|--|
| (100)                     | $1.1 \times 10^{10}$ |  |  |
| (115)                     | $1.3 \times 10^{10}$ |  |  |
| (110)                     | $4.2 \times 10^{10}$ |  |  |
| (111)                     | $1.5 \times 10^{10}$ |  |  |

However, depending on the oxidation temperature, oxidation ambient, cooldown ambient, and subsequent heat treatment, the values may be higher by at least one order of magnitude. The values may also decrease as processing improves. For example, since 1967 (60), the decrease has been in the 25%-50% range.

Line AC of Fig, 3.12 shows the effect of oxidation temperature using dry oxygen on  $Q_i$  for (111) orientation. Line BC shows the effect of a subsequent anneal in an inert atmosphere. The effect is reversible in that if, after an anneal, the wafer is heated in dry oxygen,  $Q_i$  will increase to the value given by line AC for the new heattreatment temperature. The inset in Fig. 3.12 shows how long it takes at temperature to move from a value set by curve AC to the BC value. Also, at higher temperatures, continued heating in nitrogen will cause  $Q_i$  to again increase in value (43). Fig. 3.13 shows the effect of oxidizing and cooldown atmospheres on  $Q_i$  for (100) orientation wafers. The trend is the same for (111), although the differences are less (57).

 $N_{\mu}$  usually shows the same dependence on oxide growth conditions as does  $Q_{f}$ . However, it can be reduced to a very low level by a subsequent hydrogen anneal in the 300°C-450°C range. Such anneals can be done directly in hydrogen or forming gas or can take place during the aluminum sintering operation commonly used to

74

<sup>&</sup>quot;See Appendix A for a discussion of crystal plane notation.

Effect of heat treatment on fixed charge density for (111) surfaces. (*Source:* Bruce E. Deal, *J. Electrochem. Soc. 121*, p. 198C, 1974. Reprinted by permission of the publisher, The Electrochemical Society, Inc.)



produce good ohmic contacts. In the latter case, it is presumed that hydrogen is produced by the reduction of moisture trapped on the oxide surface before the aluminum was deposited (61). Since silicon nitride, often used as a sodium barrier, is also relatively impervious to hydrogen, anneals to reduce  $N_{\mu}$  after its application are quite difficult (62). For temperatures in the 700°C and above range, noticeable amounts of hydrogen will diffuse through nitride, but, of course, such high temperatures cannot be combined with aluminum metallization.

Effect of processing on  $Q_f$  for (100) wafers. (*Source:* Adapted from Reda R. Razouk and B.E. Deal, *J. Electrochem. Soc. 126*, p. 1573, 1979.)



3.5

# KINETICS OF OXIDATION

Over much of the oxide thickness range of interest, the thickness depends roughly on the square root of the oxidation time t. Early workers therefore used the simple parabolic relation  $AX = \sqrt{t}$  to predict thickness. In this equation, A is a constant, and X is the thickness of oxide grown in a time t. While not strictly accurate, this expression sufficed for the needs of most bipolar processing. However, as early as 1959, it was realized that there was an initial region of about 60 Å where the growth appeared parabolic, followed by a linear region, and then by another parabolic region (1). The very thin region was of little interest at that time and was generally disregarded. However, with the advent of VLSI, the requirement for smaller devices has now led to reduction of MOS gate oxide thicknesses to the 200 Å range and the desire to further reduce them to 100-150 Å. Early efforts to include the nonparabolic behavior over part of the growth range of interest led to a power law expression of the form  $AX^n = t$  where *n* ranges from 1 to 2. The expression was

supplanted by an expression of the form  $AX + BX^2 = t$  where A and B are constants, which for small X gives a linear behavior and for large X, a parabolic behavior. However, until the Deal–Grove model, which is discussed in the next section, there was no good explanation of why these two regions should exist.

The oxide growth rate depends on the crystallographic orientation of the surface being oxidized; the oxidizing ambient—for example, oxygen, steam, or a mixture thereof—and its partial pressures; the temperature; the amount of group IIIA or VA impurity in the silicon; the amount and kind of impurity in the oxidizer; the kind of preoxidation silicon surface cleanup; and the amount of stress in the oxide and the silicon. A full kinetic model must provide for all of these effects, as well as the initial rapid oxidation region. The original basic Deal–Grove model neglected orientation, doping, stress, cleanups, oxidizing gas impurities, and the thin, initially high oxidation rate region but through the years has been modified to account for most of these effects.

In 1965, B.E. Deal and A.S. Grove (63) developed the model mentioned in the previous section. It predicted very well the oxide growth rate over a wide range of conditions. The system that was assumed is shown in Fig. 3.14. As mentioned earlier, it has been shown that the oxidizing species diffuses through the oxide to the silicon surface and reacts. The equilibrium concentration of diffusant (oxidant) in the oxide is assumed to be C', and the actual concentration at the ambient-oxide surface is  $C_0$ . The concentration at the silicon surface is  $C_i$ . The flux of material crossing the ambientoxide interface will be proportional to  $C' - C_0$ ; that diffusing from one side of the oxide to the other is given by  $-D\partial C/\partial x$  where D is





FIGURE 3.14

Geometry used in development of Deal-Grove model.

the diffusion coefficient in the oxide. In the steady-state case, assumed in the derivation,  $\partial C/\partial x = (C_i - C_0)/x$  where x is in a direction normal to the silicon surface. The amount of oxidant consumed at the silicon interface is assumed to be proportional to  $C_i$ —that is, by a first-order reaction. For steady state, all three fluxes must be equal. Thus,

$$h(C^* - C_0) = \frac{D(C_0 - C_i)}{x} = kC_i \qquad 3.14$$

where h and k are rate constants.

Eq. 3.14 can be solved for the flux F as follows:

$$F = \frac{kC^{*}}{1 + (k/h) + (kx/D)}$$
3.15

The oxide thickness increase with time (dx/dt) is proportional to the flux, and the thickness at a time t is given by

$$t = \frac{1}{N} \int_0^x F(x) dt \qquad 3.16$$

where N is the number of oxidant molecules per unit area of oxide. Since an initial native oxide exists on the surface when high-temperature growth begins, and since the derivation is based on steadystate conditions, which may require a small amount of oxide growth, let  $X = X_i$  at  $t = 0^+$ . Eq. 3.16 has as a solution

$$t = \frac{X - X_i}{K_i} + \frac{X^2 - X_i^2}{K_p}$$
 3.17

where

$$\frac{1}{K_l} = \frac{N[(1/h) + (1/k)]}{C^*}$$

and

$$\frac{1}{K_p} = \frac{N}{2DC^*}$$

 $K_l$  is the linear rate constant, and  $K_p$  is the parabolic rate constant. An alternative form of Eq. 3.17

$$X^2 + AX = Bt + X_i^2 + AX_i 3.18$$

is usually used, and it is the one originally formulated by Deal and Grove. It can be rewritten as follows:

$$X^2 + AX = B(t + \tau)$$
 3.19

where  $\tau = (X_i^2 + AX_i)/B$  and is the time corresponding to the initial thickness<sup>17</sup>  $X_i$ . A comparison of Eq. 3.19 to Eq. 3.17 shows that B/A is the linear rate constant  $K_i$  and that B is the parabolic rate constant  $K_a$ .

Steam (or wet  $O_2$ ) oxidation proceeds at a more rapid rate than does dry oxidation and can be explained on the basis of the relative values of C and D for the two processes. Both C and C D over the entire range of practical oxidation temperatures are larger for H<sub>2</sub>O than they are for O<sub>2</sub>, even though  $D_{oxygen} > D_{water}$  (63). The combination of Eq. 3.17 and Eq. 3.19 predicts that under these circumstances, the rate of oxidation in steam will be greater than the rate in dry oxygen,

The nature of the oxygen diffusion mechanism is somewhat uncertain. Some data indicate that it is composed of a molecular oxygen permeation component and a lattice diffusion component and that their relative magnitudes change with temperature (64). Small deviations from Eq. 3.20 in the oxidation rate in the thick-oxide regime have been ascribed to a two-component transport, one of which is independent of oxide thickness (65).

When X is very large,  $X^2 >> AX$  and  $t >> \tau$  so that Eq. 3.19 reduces to

Growth is in the parabolic region, and the rate is diffusion limited. When X is small, Eq. 3.19 is of the form

$$X = \left(\frac{B}{A}\right)(t+\tau) \tag{3.21}$$

and the thickness increases linearly with time. A characteristic time  $A^{2}/4B$  is often taken as the demarcation between the two regimes (63).

Unlike oxides grown in steam, which closely follow the model just described over their whole thickness range, oxides grown in dry oxygen initially behave differently. A thickness-time plot of an oxide grown in dry oxygen has the general character shown in Fig. 3.15. Point 0', which marks the low end of the linear portion of the curve, may be at thicknesses of 50-100 Å. For less thickness, the oxide

3.5.2 Initial Stage of Oxidation

<sup>&</sup>lt;sup>17</sup>The native oxide thickness is probably no more than 20 Å, but when fitted to the experimental data, X for dry oxidations is sometimes in the 100–200 Å range. The growth behavior of thinner oxides is not properly portrayed by Eq. 3.17.



grows in a nonlinear fashion and faster than predicted by Eq. 3.19 when values of A and B determined from thicker oxides are used. The 50-100 Å is an appreciable portion of the total thickness of the thin gate oxides used in high-density MOS circuits, so models that provide a better prediction of thickness versus time in this region are useful. Several alternatives have been proposed. Some are just fitted parameters, and the list includes linear, parabolic, cubic, logarithmic, and inverse logarithmic forms (66). Others, based on various assumptions about the oxidation mechanism, have been proposed (67-72). The region of high-stressed oxide near the interface that was discussed earlier has been suggested as also being a region in which oxygen transport is restricted. Thus, as growth progresses and the layer gets thicker, oxygen reaching the interface will decrease more rapidly than predicted by Eq. 3.14 (67-70). For layers thicker than that of the high-stress region, growth will then proceed as described by Eq. 3.19.

Approaches involving alternate oxidation transport paths or reactions have also been proposed (65, 71–73). It has been pointed out that since the alternate reactions for thicker oxides reduce to Eq. 3.19, the corrections tend to be either terms added to or multiplied by Eq. 3.19 (74). In such form, all of the suggested equations look much alike and have enough constants to fit the experimental data reasonably well. Thus, using a "best fit" criterion to choose the mechanism actually responsible for the enhanced oxidation is difficult. An empirical expression for the oxidation rate that gives a good fit to the data and is used in the computer program SUPREM III (75, 76) is as follows:

Elm Exhibit 2159, Page 99

$$\frac{dX}{dt} = Ce^{-X/L} + \frac{B}{2X + A}$$
 3.22

where L is a characteristic length of about 70 Å. Based on Eq. 3.19, the rate would be

$$\frac{dX}{dt} = \frac{B}{2X + A}$$
 3.23

3.5.3 Reaction Path

At the beginning of this chapter, oxidation was said to occur by the reaction  $Si + O_2 \rightarrow SiO_2$ . However, various paths ultimately result in this reaction. The three parallel paths along which oxidation is currently thought to proceed (77) are as follows:

(1) Si + O<sub>2</sub> = SiO<sub>2</sub> + dislocations + lattice strain (2) Si + Si<sub>V</sub> = (Si - Si<sub>V</sub>)<sub>complex</sub> (Si - Si<sub>V</sub>) + O<sub>2</sub> = O + (Si - O)<sub>complex</sub> (Si - O) + Si<sub>V</sub> = (Si - O - Si<sub>V</sub>)<sub>complex</sub> (Si - O - Si<sub>V</sub>) + O = SiO<sub>2</sub> (3) 2Si + O<sub>2</sub> = O + Si<sub>1</sub> + (Si - O)<sub>complex</sub> Si + (Si - O) = (Si - O - Si)<sub>complex</sub> (Si - O - Si) + O = SiO<sub>2</sub> + Si<sub>1</sub>

where Siv is a silicon vacancy and Si<sub>1</sub> is an interstitial silicon atom.

If some portion of the silicon is oxidized through path 2, the rate will depend on the availability of vacancies, and anything that will change their concentration should affect the oxidation rate. Since some vacancies carry a charge, their population will depend on the Fermi level, which can be changed, even at oxidation temperature, by heavy doping of the silicon. As will be discussed in a later section, heavily doped silicon does have a different oxidation rate than does lightly doped silicon. This rate difference can be explained on the basis of vacancies and is considered to be good evidence of path 2's existence. Simultaneous oxidation and diffusion can produce an enhanced diffusion rate that has been explained on the basis of silicon interstitials that are generated during oxidation. Path 3 is a method by which the interstitials could be produced.

To determine A and B from experimental curves of oxide thickness versus time using Eq. 3.17, 3.18, or 3.19 is not straightforward. However, as an example, Eq. 3.19 can be rewritten as follows:

$$X = \frac{B(t+\tau)}{X} - A \tag{3.24}$$

Elm Exhibit 2159, Page 100

Then, in a plot of X versus  $(t + \tau)/X$ , -A will be the intercept with the X axis, and B will be the slope of the line. Since the line should be straight, a standard least-squares linear curve fitting routine can be used. To neglect  $\tau$  may be tempting, but unfortunately, to do so leads to substantial error in A.  $\tau$  can, however, be estimated by extrapolating from the linear region back to the time axis as shown in Fig. 3.15.

An alternative method is based on the fact that the only reason for concern in the extrapolation of  $X_i$  or  $\tau$  is that, for thin oxides grown in dry oxygen, a deviation from the basic linear-parabolic equation occurs. If it is assumed that the departure occurs for the portion of the curve below 0' in Fig. 3.15, then if the origin of the axes were shifted to O', the points for the remainder of the curve should have a best fit to Eq. 3.24 with  $\tau = 0$ . Thus, if the data points a, b, c, and so on, are sequentially used as the beginning of the curve, and if the data are fitted sequentially to equations of the form (78)

$$X - X_{a} = \frac{B(t - t_{a})}{(X - X_{a})} - A$$
 3.25a

$$X - X_{\rm b} = \frac{B(t - t_{\rm b})}{(X - X_{\rm b})} - A$$
 3.25b

the correlation coefficient should become progressively better until the starting point moves out of the box, at which point it should become constant. This method allows A, B, and  $X_i$  or  $\tau$  to be determined.

Yet another approach (79) is to differentiate Eq. 3.18 or 3.19 to get the expressions

$$\frac{dX}{dt} = \frac{B}{A + 2X}$$
3.26a
$$\frac{dt}{dX} = \frac{A}{B} + \frac{2X}{B}$$
3.26b

Now, independently of  $X_i$  or  $\tau$ , a plot of dt/dX versus X gives a line with a dt/dX intercept of A/B and a slope of 2/B. This method is appropriate, however, only if the data are good enough to allow dependable values of dX/dt to be determined.

3.5.5 Calculation of Oxide Thickness Solving for X from Eq. 3.18 gives

$$X = \sqrt{Bt + \frac{A^2}{4} + X_i^2 + AX_i} - \frac{A}{2}$$
 3.27a

$$X = \sqrt{B(t + \tau) + \frac{A^2}{4} - \frac{A}{2}}$$
 3.27b

Thus, if the rate constants just discussed are to be used, either Eq. 3.27a or 3.27b can be used to calculate the thickness X versus time.

For straight modeling work, equations such as the ones just described, which relate to the physical processes that occur, need not be used. One could, for example, use something as simple as  $t = a + bX + cX^2$  after having first evaluated the constants from experimental data by using straightforward polynomial curve fitting routines. Table 3.7 lists representative rate constants for several conditions.

The parabolic constant is given by  $2DC^*/N$  and hence would be expected to have the same temperature dependence as the combination of solubility limit and diffusion coefficient of the oxidizer in silica. In the case of dry oxygen, the evidence is that the diffusant

| Temperature<br>(°C) | $\begin{array}{c} B_{100} \\ (\AA^2/min.) \end{array}$ | $\frac{B/A_{100}}{(\AA/min.)}$ | $\begin{pmatrix} X_i \\ (\mathring{A}) \end{pmatrix}$ | $\begin{array}{c} B_{111} \\ (\overset{\circ}{A}^2/min.) \end{array}$ | $B/A_{III}$<br>(Å/min.) | $\begin{pmatrix} X_i \\ (\mathring{A}) \end{pmatrix}$ |
|---------------------|--------------------------------------------------------|--------------------------------|-------------------------------------------------------|-----------------------------------------------------------------------|-------------------------|-------------------------------------------------------|
| Dry Oxidation       |                                                        |                                |                                                       |                                                                       |                         |                                                       |
| 800                 | 660                                                    | 0.43                           | 70                                                    | 1,350                                                                 | 0.9                     | 68                                                    |
| 900                 | 5,590                                                  | 2.08                           | 85                                                    | 6,500                                                                 | 4.6                     | 88                                                    |
| 1000                | 28,600                                                 | 8.65                           | 113                                                   | 26,500                                                                | 16.4                    | 118                                                   |
| Net Oxidation       | (640 mm H,C                                            | ))                             |                                                       |                                                                       |                         |                                                       |
| 900                 | $0.24 \times 10^{6}$                                   | 25                             | 0                                                     | $0.25 \times 10^{6}$                                                  | 42                      | 0                                                     |
| 1000                | $0.52 \times 10^{6}$                                   | 111                            | 0                                                     | $0.52 \times 10^{6}$                                                  | 194                     | 0                                                     |
| 1100                | $0.87 \times 10^{6}$                                   | 497                            | 0                                                     | $0.86 \times 10^{6}$                                                  | 821                     | 0                                                     |

'See reference 63.

3.5.6 Temperature Behavior of Rate Constants\*

### TABLE 3.7

**Oxidation Rate Constants** 

TABLE 3.8

Activation Energy

| hem                                | Activation<br>Energy (eV)* | Reference |
|------------------------------------|----------------------------|-----------|
| O <sub>3</sub> diffusion in silica | 1.17                       | (1)       |
| H.O diffusion in silica            | 0.79                       | (2)       |
| B (dry O <sub>3</sub> )            | 1.23                       | (3)       |
| B (steam)                          | 0.78                       | (4)       |
| B/A (steam)                        | 1.94                       | (3, 4)    |
| B/A (dry O <sub>2</sub> )          | 2.0                        | .(3)      |

"The activation energy in eV can be converted to kcal/mol by multiplying by 23.1.

References: (1) A.J. Moulson and J.P. Roberts, *Trans. Furuday Soc.* 57, p. 1208, 1961, (2) F.J. Norton, *Nature* 171, p. 701, 1961, (3) B.E. Deal and A.S. Grove, *J. Appl. Phys.* 36, p. 3770, 1965, (4) Don L, Kendall et al., *J. Electrochem. Soc* 125, p. 1514, 1978.

is O<sub>2</sub>; in wet oxygen or steam, it is H<sub>2</sub>O. The solubility of water in fused silica (C') was found to be essentially constant over the 900°C-1200°C temperature range. No comparable data exists for oxygen, but since the activation energy of B for oxidation with oxygen and water closely matches the activation energies for oxygen and water diffusion coefficients (Table 3.8), it is concluded that C' is relatively temperature-insensitive in both cases. The linear constant B/A, given by (C'/N)(hk)/(h + k), appears not to depend on transfer from the gas stream to the oxide. Therefore, h >> k, and the temperature behavior will follow that of k, the rate constant at the silicon-oxide interface. The observed value of approximately 2 eV matches reasonably well with that of the energy required to break an Si-Si bond (1.8 eV).

More detailed observations have suggested that dry oxides grown above 1000°C behave differently than do those at lower temperatures and that there are different activation energies for the two ranges (80). Since 1000°C is roughly the temperature at which silicon dioxide begins to show appreciable plastic flow, it has been suggested that the increased low-temperature interface and oxide strains that occur during growth are the probable cause.

Crystal orientation will affect only the linear rate constant since, in the parabolic region, growth is limited by nutrient diffusion through the growing oxide. However, orientation effects, which are substantial, can be observed throughout the total growth cycle. Fig. 3.16 shows, for a dry oxidation, how the thickness varies over a range of orientations. The plot is based on thickness versus the angle that the oxidized surface was tilted away from the (100) plane toward the

# 3.5.7 Effect of Crystal Orientation on Rate





(011) plane. The location of this series of planes can be visualized from the standard projection shown in the inset. (For more details on interpreting standard projections, see Appendix A.)

The effects of crystal orientation on oxidation rate become important in the development of high-density memory designs, wherein use is made of the third dimension—that is, of going into the bulk silicon—for capacitors and transistors. Fig. 3.17 shows both an engineer's drawing (left) and a high-magnification cross-sec-

Engineer's drawing (left) and high-magnification cross-sectional SEM micrograph of a trench transistor crosspoint DRAM cell. (*Source:* Courtesy of Texas Instruments Incorporated.)



tional SEM<sup>18</sup> micrograph of a trench transistor crosspoint DRAM cell. In this structure, a capacitor and a transistor are both fabricated in an ~1 µm diameter well (trench) that is etched vertically ~7 µm deep into the silicon. The storage (capacitor) dielectric is silicon oxide that is thermally grown on the inner walls of the approximately 1 µm diameter trench. During the growth of this dielectric, all of the crystallographic planes shown along the periphery of a (001) stereographic projection (see Appendix A), and also shown in the stereographic projection in Fig. 3.16, are being oxidized at different growth rates. Fig. 3.18 (top) is a horizontal cross-sectional view of the storage node area-that is, the view looking from the top (100) surface down, but at the storage node level. The polycrystalline silicon storage node is readily visible as is the silicon dioxide dielectric around the trench wall. It is also evident that this oxide is much thicker in the four <110> directions than in the four <100>directions. Thickness differences corresponding to oxidation on (100), (510), (310), and (110) planes can be seen. Fig. 3.18 (bottom) shows another horizontal cross-sectional view of a defective (leaky) storage capacitor. The defects occur as would be expected-at the thin-oxide areas, near the <100> directions.

The oxidation rate versus orientation function was examined before 1961 in the hope that it would shed light on the oxidation reaction. The oxidations were done with pressures of from 40 atm to 150 atm and at temperatures between 500°C and 800°C. Over this range,

<sup>&</sup>lt;sup>18</sup>SEM work by Dr. H.L. Tsai, Central Research Laboratories, Texas Instruments Incorporated.

*Top:* Horizontal cross-sectional view of trench capacitor showing oxidation of inside wall of trench etched into (100) silicon. *Bottom:* Same view of trench capacitor showing defects in the oxide, located azimuthally near the thin oxide (100) direction.



87

the oxidation rates observed were in the order (110) > (311) > (111)(81). These rates do not correlate with the number of bonds leaving the surface or the number of atoms on the surface. Rather, they are apparently related to the number of sites actually available to the oxidizing specie (which will be less than the total number because of steric hindrance) and to the relative ease of bonding to various sites.

While the linear rate constants measured at atmospheric pressure show  $(B/A_{100}) < (B/A_{111}) < (B/A_{100})$  (82), most actual rates increase in the order (100) < (110) < (111), as shown in Fig. 3.16, and are apparently so ordered due to the impact of the parabolic rate constant, which should be independent of orientation. Under conditions that accentuate the effect of the linear rate constant—for example, oxidation at low temperature or the growth of only a very thin oxide—higher growth rates for (110) orientation are observed (83–84). At partial oxygen pressures below 0.07 atm, a reversal of the (111) and (100) oxidation rates has been reported (85).

Polycrystalline or amorphous<sup>19</sup> silicon oxidation rates are somewhat different from that of single crystals. The overall rate is due to the combination of the rates of the variously oriented small crystallites and, in some cases, to doping segregation along the grain boundaries. Further, polycrystalline or amorphous silicon has a much rougher surface than does the single-crystal surface generally used. During oxidation, rather than getting smoother, the oxide-polysilicon interface gets yet rougher (86–88). An example of the amount of roughness is shown in Fig. 3.19. The exact amount depends on the starting roughness and the doping level. It is this roughness that gives rise to increased electrical conduction of thermal oxide grown on polycrystalline silicon by locally increasing the electric field.

While the various published rate behaviors may disagree somewhat in detail, in general, for both wet and dry oxidation over a substantial temperature range, the thickness-time curve for undoped polysilicon will lie between the undoped (111) and the (100) curves as shown in Fig 3.20. For thin oxides, the thickness will be closest to the (111) orientation and, for longer times, will approach the (100) orientation thickness (89–91). The thickness can be approximated by Eq. 3.28 as follows:

 $X = at^n$ 

3.28

3.5.8 Polycrystalline Silicon Oxidation

88

<sup>&</sup>quot;Even if the starting material is amorphous, it will have crystallized by the time the oxidation temperature is reached. LPCVD silicon layers grown below about 575°C are amorphous and above about 625°C are polycrystalline.

FIGURE 3.20

dry oxidation.)

General trends for oxidation

behavior of polysilicon. (Val-

ues are reasonable for a 900°C

Trend of increasing roughness for thermal oxide grown on phosphorus-doped polysilicon at 800°C pyrogenic oxidation. (*Source:* Data from L. Faraone and G. Harbeke, J. Electrochem. Soc. 133, p. 1410, 1986.)



Elm Exhibit 2159, Page 108

where *a* and *n* are empirically determined constants. Note that *n* ranges from near 1 to under 0.5 have been reported. Boron doping increases the polysilicon rate a small amount and is comparable to that observed under similar circumstances for single crystals.<sup>20</sup> Phosphorus, which, unlike boron, segregates at the grain boundaries, may produce rate enhancements of ten times or more.

Increasing the ambient pressure can substantially increase the oxidation rate for any given temperature. Thus, low-temperature oxidations that might not otherwise be feasible because of excessively slow rates are quite practical when the pressures are raised to 10-20atm. The pressure will affect both rate constants through C<sup>'</sup>. Over the normal range of temperatures, and up to at least 10 atm, the constants for steam are directly proportional to the partial pressure or, in the case of elevated pressures, to the total pressure P (92). In dry oxygen, however, only the parabolic constant B is linear with pressure above 1 atm. The linear constant can be approximated by Eq. 3.29 (93, 94) as follows:

$$\frac{B}{A} = \left(\frac{B}{A}\right)_{1\text{ atm}} P^n \qquad 3.29$$

with *n* between 0.7 and 0.8 for *P* between 1 atm and 1000 atm. For partial pressures less than 1 atm, *n* is also <1 for temperatures  $<1100^{\circ}$ C. The activation energies for both the linear and parabolic constants show a change in slope in the 900°C–950°C range, which coincides with the change in the oxide viscous behavior.

The inclusion of other gases may substantially affect the oxidation rate. For example, because of the large difference in oxidation rates between dry oxygen and wet oxygen or steam, even very small amounts of moisture in the "dry" oxygen can substantially increase the rate (82, 95). The general trend of the effect of moisture on the oxidation rate using oxygen is shown in Fig. 3.21. Some data suggest that the presence of sodium during oxidation will change the rate (96), which, in the 1960s, was of some concern since many oxidation tubes introduced substantial amounts of sodium. Since sodium cannot be tolerated in MOS circuits, techniques have been developed to virtually eliminate it, and the impact on the oxidation rate is now only of academic interest.

Chlorine or a chlorine-bearing species such as HCl, trichloroethane, or trichloroethylene is often used during oxidation. All af-

# 3.5.9 Effect of Pressure on Oxidation

3.5.10 Effect of Ambient Impurities on Oxidation Rate

<sup>&</sup>quot;Impurity-enhanced oxidation is discussed in more detail in a later section.

General trend of effect of moisture on oxidation rate using oxygen for a 1000 minute oxidation at 900°C, (*Source:* Data from E.A. Irene and R. Ghez, *J. Electrochem. Soc. 124*, p. 1757, 1977.)

# 3.5.11 Effect of Silicon Doping on Oxidation Rate



fect the rate. Representative behaviors are shown in Figs. 3.22–3.24. They have similar characteristics in that a small amount enhances the rate, but the effect soon saturates. Small quantities of fluorine are reported to affect the rate much more than does HCl. When NF<sub>3</sub> is used, quantities of 100–500 ppm, depending on temperature, will produce a peak in the oxidation rate that can be as much as five or ten times as great as without the NF<sub>3</sub>. Once the peak is passed, a competing fluorine etching reaction apparently begins (97).

Heavily n- or p-doped silicon has been observed to increase the oxidation rate (98–102). The effect of n-doping is much more pronounced than is the effect of p-doping. In either case, the effect is primarily on the linear rate constant B/A. For n-dopants, the ratio of  $(B/A)_{doped}$  to  $(B/A)_{undoped}$  may be as much as 20. For p-dopants, the ratio is in the range of 2. The n-doped parabolic constant B increases, but by only about a factor of 2 near 800°C and by less at higher temperatures (101). For p-doping, B increases a few percent as the temperature goes from 800°C to 1150°C (100). Within experimental error, that all of the enhancements are independent of orientation is apparent.

A reasonably satisfactory model for n-behavior is based on the supposition that the linear rate constant can be expressed as the sum of two terms, one of which represents the effect of silicon vacancies (102). Thus,

$$\frac{B}{A} = R + KC_{v} \qquad 3.30$$

Oxide thickness versus time for oxidation in various O<sub>2</sub>/HCl mixtures. (*Source:* D.W. Hess and B.E. Deal, *J. Electrochem. Soc.* 124, p. 735, 1977. Reprinted by permission of the publisher, The Electrochemical Society, Inc.)



where *R* is the term that does not depend on vacancy concentration,  $C_v$  is the total concentration of vacancies, and *K* is a constant of proportionality. Vacancies exist as neutrals, single-charged deep acceptors, double-charged deep acceptors, and single-charged donors. The concentration of neutrals is a function only of temperature. The charged vacancy concentration will also depend on the net concentration of electrically active impurities when the doping level is in-



creased to the point where it is greater than  $n_i$  at oxidation temperature.<sup>21</sup> For the intrinsic case (no heavy doping), Eq. 3.30 can be written as

$$\left(\frac{B}{A}\right)^{\prime} = R + KC_{\nu}^{i} = C_{1}e^{-2/kT} \qquad 3.31$$

 $^{21}n_i$  is the intrinsic carrier concentration. Thus, vacancy enhancement occurs when doping is heavy enough for the silicon to be extrinsic at the oxidation temperature.

where  $C_i^*$  is the vacancy concentration while the silicon is intrinsic and  $C_1$  is a constant. The 2 eV activation energy for B/A was discussed earlier. Combining the intrinsic case expression (Eq. 3.31) and the general expression (Eq. 3.30) gives

$$\frac{B}{A} = \left(\frac{B}{A}\right)^{\prime} \left[1 + \left(\frac{K}{C_{i}}\right) \left\{\frac{C_{v}}{C_{v}^{i}} - 1\right\} C_{v}^{i} e^{2ikT}\right]$$
 3.32

By calculating the increase in  $C_v$  and using one set of experimental data to determine the  $KC_v^{j}$  product, curves such as the ones shown in Fig. 3.25a may be plotted. For p-doping, the agreement is not as good as it is for n-doping. Experimentally, a larger effect is observed



(b)

# FIGURE 3.25

(a) Change of B/A with silicon doping level. (Source: C.P. Ho and J.D. Plummer, J. Electrochem. Soc. 126, p. 1523, 1979.
Reprinted by permission of the publisher, The Electrochemical Society, Inc.) (b) Change of oxide thickness with doping level. (Source: Adapted from C.P. Ho et al., J. Electrochem. Soc. 125, p. 665, 1978.)

than that predicted by Eq. 3.32 (102). It is possible that in this case the dopant pileup in the oxide is the driving force rather than vacancies (remember that for p-material, heavier doping will only increase the density of the singly charged donor vacancy). (Dopant pileup is discussed in the next section.)

Even though the change of B/A appears to be independent of wet or dry oxidation and of orientation, the actual oxide thickness will still depend on these things because the B/A to B ratio differs from case to case. Fig. 3.25b shows the effect of doping on oxide thickness for two temperatures and illustrates the difference in rates observed even though the B/A increase with doping is virtually the same (Fig. 3.25a) for the two cases.

The polycrystalline silicon oxidation rate increases with doping. All data regarding p-doping seem to agree that polycrystalline behavior and single-crystal behavior are similar. In the case of very heavy n-doping, some data indicate comparable behavior (90, 103), while other data show an increased enhancement over that predicted by Fig. 3.25 (104). The oxidation rate at high pressure also shows an n-doping dependence. On the basis of very limited data, it does not appear to be as large as the atmospheric pressure effect (105).

Table 3.9 shows some trends of heavy doping effects on oxide thickness compared to a lightly doped wafer under the same conditions. Note that if, for example, the effect of heavy doping is to be minimized, high temperatures, (111) orientation, and a dry oxygen ambient should be chosen (101). Conversely, if the differences in oxide thickness over lightly and heavily doped regions are to be accentuated, then steam oxidations at low temperatures on (100) material are more appropriate. By taking advantage of a maximized effect, it is possible, for example, to grow substantially thicker oxide over an n-channel source and drain than over the gate or to simultaneously grow a thin gate oxide and a much thicker interlevel polysilicon oxide (101, 104).

| n | р | (111) | (100) | High<br>Temp. | Low<br>Temp. | Dry | Wet | Thin | Thick | Effect             |
|---|---|-------|-------|---------------|--------------|-----|-----|------|-------|--------------------|
| x |   | x     |       |               | х            | x   |     | x    |       | Medium<br>increase |
| X |   | X     |       | X             |              | X   |     |      | X     | None               |
| x |   |       | х     |               | x            |     | х   | x    |       | Large<br>increase  |
|   | Х | X     |       |               | X            |     | X   | X    |       | None               |
|   | X | x     |       |               | x            | х   |     | Х    |       | Small<br>increase  |

TABLE 3.9

Impact of Heavy Doping on Oxide Thickness

### 3.5.12 Impurity Profile at the Oxide Interface

While the oxide is growing, silicon is being consumed and the impurities that were in the silicon must be redistributed. The manner in which this distribution occurs depends on the segregation coefficient m of the impurity and the relative diffusion coefficients of the impurity in silicon and in SiO<sub>2</sub> (106, 107). m is defined as the ratio of the equilibrium concentration of impurity on the silicon side to that on the oxide side of the interface. In principle, as oxidation proceeds, three cases are possible. The impurities could be rejected by the oxide (m > 1), and a pileup of impurity in the silicon would then occur (Fig. 3.26a). If m < 1, dopant will be depleted from the silicon and built up in the oxide (Fig. 3.26b). If m = 1, the dopant in the oxide and silicon will be uniform across the interface, but a depletion of impurity from the silicon will still occur (Fig. 3.26c) because the oxide volume is greater than that of the consumed silicon. A high-oxide diffusivity, combined with a rapid transfer across the oxide-ambient boundary, can keep the concentration in the oxide at the interface well below the equilibrium value, which can result in a reduction of the concentration in the silicon near the interface (Fig. 3.26d) even though m > 1.



| TABLE 3.10                                              |                                        |                         |                      |                         | Diffusi                                                                                                            | on Coeff  | icient* (c   | $m^2/s$ )     |
|---------------------------------------------------------|----------------------------------------|-------------------------|----------------------|-------------------------|--------------------------------------------------------------------------------------------------------------------|-----------|--------------|---------------|
| Segregation and Diffusion<br>Coefficients of Impurities | Element                                | m Referen               |                      | ence                    | 1100°C                                                                                                             |           | 1200°C       |               |
| in SiO <sub>2</sub>                                     | Ga                                     | 20                      | (1                   | )                       | 5.3 × 1                                                                                                            | 10-11     | 5.8 ×        | 10 .          |
|                                                         | В                                      | Ť                       | (2                   | )                       | 4.6 × 1                                                                                                            | 10 17     | 3.2 ×        | 10 16         |
|                                                         | In                                     | >1                      | (1                   | )                       |                                                                                                                    |           |              |               |
|                                                         | Р                                      | 10                      | (1                   | )                       | $1.4 \times 1$                                                                                                     | 10 15     | $2.9 \times$ | 10 15         |
|                                                         | Sb                                     | 10                      | (1                   | )                       | 9.9 × 1                                                                                                            | 10 17     | 1.5 ×        | 10 1          |
|                                                         | As                                     | 10                      | (1                   | )                       | 2.6 ×                                                                                                              | 10 16     | 4.4 ×        | 10 15         |
|                                                         | tm values for b                        | †m values for boron:    |                      |                         |                                                                                                                    |           |              |               |
|                                                         |                                        | 900°C                   |                      | 1200°C                  |                                                                                                                    |           |              |               |
|                                                         |                                        | 0.16                    |                      | 0.6                     | wet of                                                                                                             | xidation. | (100)        |               |
|                                                         |                                        | 0.1                     |                      | 0.5                     | wet oxidation, (111)                                                                                               |           |              |               |
|                                                         |                                        | 0.5                     |                      | 1                       | dry or                                                                                                             | kidation, | (100)        |               |
|                                                         | Pressure (bar)                         | 1                       | 5                    | 10                      | 15                                                                                                                 | 20        | (100),       | 920°C         |
|                                                         | m                                      | 0.3                     | 0.55                 | 0.7                     | 0.9                                                                                                                | 0.95      |              |               |
|                                                         | reported values i<br>146, 1973. See al | n M. Ghea<br>so Yasuo Y | zo and E<br>Vada and | D.M. Brown<br>Dimitri A | en chosen arbitrarily from<br>.M. Brown, J. Electrocher<br>Dimitri A. Antoniadis, J.<br>. Electrochem. Soc. 131, p |           |              | p.<br>1. Soc. |

> References: (1) A.S. Grove et al., J. Appl. Phys. 35, p. 2695, 1965. (2) Richard B. Fair and J.C.C. Tsai, J. Electrochem. Soc. 125, p. 2050, 1978; P. Deroux-Dauphin and J.P. Gonchond, J. Electrochem. Soc. 131, p. 1418, 1984.

that the various values listed in the literature may vary orders of magnitude.

often because of sensitivity to concentration or diffusing ambient,

Table 3.10 gives m values and diffusivities of some common silicon dopants. Gallium is one that has out-diffusion from the silicon. even though the high value of m would imply a pileup (108). Because of its importance in device fabrication, boron segregation has been extensively studied. mboron is dependent on the phase that forms at the interface, which, in turn, is dependent on the oxidation conditions. m equals 0.6 at 1200°C for wet oxidation, 1 for very dry oxidation, and for a high-boron-content glass at the surface<sup>22</sup> will be over 2. It is also silicon wafer orientation dependent and increases with oxidation temperature and pressure (109, 110). As in most other process-related data, a substantial spread in the m values is reported.

<sup>&</sup>quot;This case will not normally occur during an oxidation alone but is likely to occur during a diffusion cycle.

3.6

# OXIDE THICKNESS CHARTS

Figs. 3.27-3.31 show representative curves of oxide thickness for oxidations under various conditions. The curves do not cover all circumstances of interest, but they do show the trends. Eq. 3.27 and values of B and B/A will allow behavior outside the range of the curves to be calculated as long as the thickness is greater than perhaps 300 Å. Alternatively, a computer program such as SUPREM. which calculates in a similar fashion, can be used. Fig. 3.27 shows thicknesses for oxidation temperatures in the 700°C-1200°C range in dry oxygen. It also shows the difference in thickness due to slice surface orientation. Fig. 3.28 shows similar curves for steam oxidation at 640 torr. In this case, the steam is generated by burning hydrogen in oxygen directly in the oxidation tube. However, whether the moisture comes from steam or from oxygen bubbled through hot water, for the same pressure, the behavior seems to be the same. Fig. 3.29 shows curves for dry oxygen oxidation at high pressures, and the curves in Fig. 3.30 pertain to high-pressure steam oxidation. Fig. 3.31 shows thicknesses for very short times for the three most common slice surface crystallographic orientations. It also shows the (111)-(100) crossover discussed in the section on orientation effects.



Oxide thickness versus oxidation time for silicon oxidation in dry oxygen. (Smirce: Courtesy of Bruce E. Deal.)



Oxide thickness versus oxidation time for silicon oxidation in pyrogenic H<sub>2</sub>O (640 torr). (*Source:* Bruce E. Deal, *J. Electrochem. Soc. 125*, p. 576, 1978. Reprinted by permission of the publisher, The Electrochemical Society, Inc.)



3.7

PREPARATION OF THERMAL OXIDE 3.7.1 Preoxidation Surface Cleanup\* The kind of cleanup given the surface before oxidation can affect not only the quality of oxide but also the thickness. Some of the many varieties of surface contaminants that may be found on slices before oxidation, along with probable effects on the oxide, are listed in Table 3.11. How many contaminants remain on the oxide depends on the efficiency of the cleanup. A cleaning sequence should proceed in the following order:

- 1. Remove gross organics and particulates.
- 2. Remove organic films (to prevent masking of next step).
- 3. Remove surface oxide.
- Remove surface-adsorbed ions and plated-metal contaminants.
- Reform contamination-free native oxide (usually done as part of step 4).

See references 111-113.

Oxide thickness versus oxidation time for silicon oxidation in dry oxygen at high pressure. (Source: Liang N. Lie et al., J. Electrochem. Soc. 129, p. 2828, 1982. Reprinted by permission of the publisher, The Electrochemical Society, Inc.)



Elm Exhibit 2159, Page 119

Oxide thickness versus time for thermal oxidation of silicon in pyrogenic steam at high pressure. (*Source:* Reda R. Razouk et al., *J. Electrochem. Soc.* 128, p. 2214, 1981. Reprinted by permission of the publisher, The Electrochemical Society, Inc.)





FIGURE 3.31

Oxide thickness versus oxidation time during initial stages of silicon thermal oxidation. (Source: H.Z. Massoud, "Thermal Oxidation of Silicon in Dry Oxygen-Growth Kinetics and Charge Characterization in the Thin Regime," Ph.D. dissertation, Stanford University, 1983.)



101

#### **TABLE 3.11**

| Summary of Slice     |  |
|----------------------|--|
| Surface Contaminants |  |

| Contaminant                  | Possible Sources                                                  | Effect                                             |  |  |
|------------------------------|-------------------------------------------------------------------|----------------------------------------------------|--|--|
| Heavy metals                 | Grinding, polishing, any prior cleanup                            | Reduced lifetime, S-pits<br>(saucer pits) in slice |  |  |
| Low-concentration sodium     | Any prior operation                                               | Charge in oxide                                    |  |  |
| High-concentration<br>sodium |                                                                   | Crystalline areas in oxide                         |  |  |
| Alkali metals                | Grinding, polishing                                               | Reduced oxide field strength                       |  |  |
| Organic films                | Plasticizer from storage<br>containers, organic<br>vapors in room | Change in oxidation rate                           |  |  |
| Adsorbed fluorine            | Etch containing HF                                                | Change in oxidation rate                           |  |  |
| Particulates                 | Particles in air, in<br>cleaning solutions, or<br>from equipment  | Pinholes in oxide                                  |  |  |

Remove all traces of cleaning solutions and all particulates from the surface of the reformed native oxide film.

Heavy metals in particular are difficult to remove since they tend to replate onto the surface from the cleaning solutions. Small particles also are very difficult to remove because of the increasing importance of attractive forces between the particle and the surface relative to the forces that can be exerted on the particle by liquid flowing across the surface. Thus, mechanical scrubbing is almost a necessity for complete small-particle removal. However, mechanical scrubbing has its pitfalls in that the scrub brush can damage the wafer surface and may only slide particulate matter from side to side in oxide windows and not remove it at all. An alternative in many circumstances is ultrasonic agitation of the cleaning bath (114). The intensity must be high enough to produce cavitation but not so high that wafers are broken.

One old, but reasonably good, all-purpose cleaning solution is a sulfuric acid and hydrogen peroxide mixture (affectionately called "piranha"). However, a much better cleaning routine, often referred to as the "RCA cleanup," is as follows (108):

- Remove gross organics with perchloroethylene (tetrachloroethylene).
- Remove residual organic films with an H<sub>2</sub>O<sub>2</sub>-NH<sub>4</sub>O (basic) solution.
- 3. Remove metallics with an H<sub>2</sub>O<sub>2</sub>-HCl (acidic) solution.

- 4. Rinse thoroughly in deionized water.
- 5. Dry.

A single solution that removes organics and some heavy metals very well is choline, water, and a surfactant. However, the acidic RCA solution is more effective for other metals, so that the choline solution followed by  $HCl-H_2O_2$  is potentially better than the RCA sequence. Table 3.12 gives typical compositions of these cleaning solutions. In general, the relative amounts can be changed substantially and not affect performance. Solutions containing hydrogen peroxide lose their peroxide and thus must be periodically replaced.

Hot, concentrated sulfuric acid alone can be used for cleaning but, for maximum performance, must be so hot that a significant amount of acid fumes are released. By mixing it with hydrogen peroxide, the operating temperature can be reduced to a little over  $100^{\circ}$ C. Usually no external heat is required since the H<sub>2</sub>SO<sub>4</sub> heat of solution will provide the initial heating and the heat of decomposition of the H<sub>2</sub>O<sub>2</sub> will keep the solution hot over its working lifetime (115).

Depending on the final cleanup, substantial differences in the composition of the native oxide are observed. For example, HF leaves a thin oxide on the surface that quickly adsorbs large quantities of carbon. A hydrogen-peroxide-based cleaner will leave a

#### **TABLE 3.12**

#### Preoxidation Surface Cleaning Solutions

| Cleanup                     | Cor                                                                                | Conditions                    |                  |                         |                                                           |
|-----------------------------|------------------------------------------------------------------------------------|-------------------------------|------------------|-------------------------|-----------------------------------------------------------|
| Choline clean*<br>(parts by | Choline<br>(CH <sub>3</sub> ) <sub>3</sub> N(CH <sub>2</sub> Ch <sub>2</sub> OH)OH | $H_2O_2$                      | H <sub>2</sub> O | Surfactant<br>(NCW-601) | 50°C ± 5°, 10 min.<br>ultrasonic clean,                   |
| volume)                     | 3                                                                                  | 1                             | 95               | 1                       | 10 min. DI water<br>rinse, spin dry.<br>Room temperature, |
| RCA clean, basic†           | NH4OH                                                                              | H <sub>2</sub> O <sub>2</sub> | H <sub>2</sub> O |                         |                                                           |
| (parts by                   | 1                                                                                  | 1                             | 5                |                         | 10 min. clean time.                                       |
| volume)                     | T                                                                                  | 2                             | 7                |                         | 10 min. DI water<br>rinse, spin dry.                      |
| RCA clean, acid†            | HCI                                                                                | H.O.                          | H <sub>2</sub> O |                         | Room temperature,                                         |
| (parts by                   | 1                                                                                  | 1                             | 6                |                         | 10 min. clean time.                                       |
| volume)                     | 1                                                                                  | 2                             | 8                |                         | 10 min. DI water<br>rinse, spin dry.                      |
| Sulfuric acid clean         | H,SO,                                                                              | H <sub>2</sub> O <sub>2</sub> |                  |                         |                                                           |
| (percent)                   | 60                                                                                 | 40                            |                  |                         |                                                           |
| 241211110                   | 70                                                                                 | 30                            |                  |                         |                                                           |

Oxidation rate versus preoxidation cleanup. (Source: Adapted from F.N. Schweltmann et al., The Electrochemical Soc. Ext. Abst., Vol. 78-1, and Ti-Ken Bean, The Electrochemical Soc. Ext. Abst., Vol. 81-2.)





thinner, cleaner, and apparently rather volatile oxide (116). Presumably through the kind of oxide left, the cleanup can affect the linear oxidation rate constant, as shown by the examples in Fig. 3.32.

The equipment used with the cleaning solutions can be broken into two general categories: Spin-spray and tank immersion. Tank immersion may or may not be combined with ultrasonic agitation. Drying is generally by hot nitrogen, which, in the case of spin-spray cleaning, can be introduced into the spin rinser immediately after the last water rinse. Great care must be taken to ensure that no equipment, including that used for water rinsing and drying, generates and distributes particles onto the slice surface. The cleaning solutions, water, and nitrogen should all pass through point-of-use filters.

Atmospheric pressure reactions are primarily carried out in oxidation tubes heated in a tubular diffusion furnace (for details on construction of furnaces, see Chapter 8). Fig. 3.33 is a schematic of an oxidation tube and the normal input oxidizing gases. The combination of large-diameter wafers and the popularity of rapid thermal anneal processing has prompted investigations of lamp-heated oxidations (117–119). Such a heat source seems feasible and appears particularly applicable to thin oxides where the high thermal inertia of ordinary oxidation furnaces makes the control of short oxidation times difficult.

Wet oxidation is done by bubbling oxygen through water held near the boiling point. Steam oxidation can be done either by having



an external steam generator or by introducing oxygen and hydrogen into the front of the oxidation tube in stoichiometric proportions (pyrogenic or burnt hydrogen system). In either a wet oxygen or a steam system, the line from the water source to the furnace must be wrapped with heating tape and kept hot so that no condensation occurs. Assuming that the carrier gas (oxygen or an inert gas such as helium or argon) is saturated, the partial pressure of water vapor ( $P_w$ ) can be determined from the temperature of water in the bubbler (see Table 3.13). If there is a question of saturation, it can be calculated from Eq. 3.33 (120) as follows:

$$P_{w} = \left(\frac{n_{w}}{n_{w} + n_{\text{gas}}}\right) P_{\text{total}}$$
3.33

where  $P_{\text{total}}$  will be essentially atmospheric pressure if the tube exhaust is not throttled. The *n*'s are the number of moles of water vapor and carrier gas, respectively, and can be measured.

The slices are held perpendicular to the gas flow in either silicon or fused silica boats. The separation of slices is much less than it is for diffusion and is usually only enough to keep the wafers from physically contacting each other. The tubes are sized for the slice diameter, and the gas flows are similarly scaled. The boats must be kept scrupulously clean and not used in any diffusion operation since they would pick up enough diffusant to contaminate both the wafers and the oxidation tubes. The tubes are usually of high-purity fused silica, although silicon carbide or polysilicon tubes are sometimes used (121–122). Both of the latter tubes are more impervious to contamination transfer through their walls, and both sag less at temperatures above 1100°C. Silicon tubes can be given an internal CVD coating of silicon nitride to protect them from oxidation.

Sodium moves very rapidly through the walls of fused silica tubes at oxidation temperature, and early in the development of MOS devices, when it became clear that sodium in the thermal oxide was deleterious, a concentric tube arrangement was introduced. Thus, as sodium and other impurities diffused through the outside tube to the space between the tubes, they could be swept away by gas flowing between the tubes. Sodium also causes the tubes to devitrify and weaken with time at oxidation temperature, and thus impurities can move through the tube walls with even greater ease.<sup>23</sup> Table 3.14 shows wafer contamination levels that can occur from impurities originally in the furnace.

Contamination can also come in through the gas lines. As already discussed, small amounts of water in the oxygen of a "dry" oxidation system will cause a substantial increase in the oxidation rate. Oxygen, water vapor, and hydrocarbons in the inert gas must also be considered. Studies of flatband voltage versus these contaminants have indicated that little shift occurs when oxygen is below 200 ppm, H<sub>2</sub>O is below 120 ppm, and total CO<sub>2</sub> is below 60 ppm (123). These numbers are rather high, and as the shift to thinner oxides and higher operating fields continues, they will probably need to be reduced. Contamination of the oxidation chamber by oxygen, water, and hydrocarbons can occur not only from contaminated incoming gases or plumbing leaks but also from backstreaming from the end of the oxidation tube. Tubes operated with no endcaps or with loose-fitting caps may have enough leakage to exceed the numbers just given. When HCl is used in the system, small leaks can let in enough moisture to allow the HCl to corrode metal plumbing and transport the metallic chlorides into the furnace. Thus, other chlorine-containing gases are often used during oxidation instead of HCL.

#### **TABLE 3.14**

Wafer Contamination from Impurity Transfer through Tube Walls

| Impurity | Contamination<br>Level<br>(atoms/cc) |  |  |  |  |  |
|----------|--------------------------------------|--|--|--|--|--|
| Cu       | $1.4 \times 10^{14}$                 |  |  |  |  |  |
| Fe       | $1.4 	imes 10^{11}$                  |  |  |  |  |  |
| Ni       | 1.3 × 104                            |  |  |  |  |  |
| Cr       | $3.2 \times 10^{11}$                 |  |  |  |  |  |
| Zr       | $2.0	imes10^{10}$                    |  |  |  |  |  |
| Ta       | $6.0 	imes 10^{(i)}$                 |  |  |  |  |  |

Source. Data from Paul F. Schmidt, J. Electrochem. Soc. 130, pp. 196–199, 1983

<sup>&</sup>lt;sup>2</sup>Therefore, tubes should never be handled with bare hands. Enough sodium from the body will transfer where the fingers touch to cause devitrification and indeed will often produce identifiable fingerprint devitrification patterns.

# 3.7.3 High-Pressure Oxidation Processing

# 3.7.4 High-Pressure Equipment

A major processing disadvantage of atmospheric pressure oxidation is the time required at high temperature to grow the thick oxides used, for example, in the oxide isolation described in the next section. From Fig. 3.27, note that a 1  $\mu$ m thick oxide grown in steam on a (100) surface requires approximately 2 hours at 1100°C, 5 hours at 1000°C, or 15 hours at 900°C. The desire to minimize cycle time leads to higher-temperature oxidations. In turn, more diffusion of impurities that may already have been diffused into the wafer results, the incidence and size of stacking faults increases, and the likelihood of slip occurring during heatup or cooldown also increases.

To provide reduced time and/or temperature during oxidation and minimize the effects just discussed, high-pressure oxidation can be used. The oxidation time for a fixed thickness at a given temperature will be inversely proportional to the pressure. Thus, the 1  $\mu$ m layer previously alluded to would take only 3 hours to grow at 5 atm pressure or 1.5 hours at 10 atm. Alternatively, for a given oxidation time, the temperature can be reduced about 30°C for each atmosphere of pressure increase. Again, referring to the earlier example, if 5 hours is an acceptable time, by using 10 atm, the growth temperature could be dropped from 1000°C to about 700°C.

Early high-pressure oxidation studies, beginning in 1959, were made using a bomb-type apparatus in which the slices and a measured amount of water were sealed in a heavy-walled metal container and heated to the desired oxidation temperature (1, 2, 81). The amount of water included would then determine the operating pressure. The attainable pressure can be quite high, and studies up to 500 atm have been made. More recent versions of the bomb approach have used externally connected high-pressure gases to supply the internal pressure (124). By using this approach, dry as well as wet oxidations can be performed. Such equipment is cumbersome, however, and not very amenable to production usage. To achieve the rates and temperatures currently deemed appropriate for production, pressures of only 10-20 atm are required. By using a fairly conventionally designed oxidation furnace inside a cold-walled high-pressure container, much more satisfactory production equipment for this pressure range is obtained (105, 125-128). An alternative method, suitable for a few atmospheres, is to use a thick-walled fused silica tube in an ordinary furnace (129). Extra safety precautions should be taken since mishandling of the tube can introduce surface damage that will reduce the breaking strength. For dry oxidation, high-pressure oxygen can be introduced directly into the high-pressure chamber. For steam oxidation, an external high-pressure steam generator or pyrogenic steam internally formed can be used. Pyrogenic steam is currently the most common source.

When a pn junction is used for component isolation, as shown in Fig. 3.34a, the heavy doping near the surface allows only a very narrow depletion layer to form and hence forms a high-capacitance junction. Such high capacitance at bipolar collector-base junctions. MOS source and drain junctions, and CMOS well isolation junctions can seriously degrade high-frequency performance. On the bottom of the diffusion, where the concentration is lighter and the space charge region much wider, the total capacitance is usually less than it is on the periphery, even though the area may be greater. If a sidewall oxide isolation, as shown in Fig. 3.34b, can be achieved, device performance will improve. The process used in this case is usually referred to as LOCOS (local oxidation of silicon) (130) or dielectric isolation.24 It depends on the fact that silicon nitride is impervious to oxygen diffusion and can be used as a mask against oxidation. Thus, if a bare silicon wafer is covered with a nitride layer that then has openings etched in it, a subsequent oxidation step will grow oxide only where the nitride is removed.

Silicon nitride is assumed to oxidize by one or both of the following reactions:

 $\begin{array}{l} Si_3N_4 \ + \ 6H_2O \rightarrow 3SiO_2 \ + \ 6H_2 \ + \ 2N_2 \\ Si_3N_4 \ + \ 6H_2O \rightarrow 3SiO_2 \ + \ 4NH_3 \end{array}$ 

Examples are shown in Fig. 3.35. The rate is slow enough for nitride to be a very practical masking material. In order to speed up the silicon oxidation cycle at low temperatures where little dopant diffusion occurs, high pressure is often used. At pressures of about 150 atm, the nitride oxidizes too rapidly to be practical, but at the lower

FIGURE 3.34

3.7.5 Selective

Oxidation

Two methods of isolating components.



<sup>4</sup>Dielectric isolation is somewhat of a misnomer since the term usually refers to a completely different process that provides for full oxide isolation.

Amount of silicon nitride oxidized under various conditions. (Nitride used in highpressure study was deposited at 800°C; the other was annealed at 1200°C after deposition.) (Source: Atmospheric data from 1. Franz and W. Langheinrich, Solid State Electronics 14, p. 499, 1971; high-pressure data from H. Miyoshi et al., J. Electrochem. Soc. 125, p. 1824, 1978.)



pressures typically used, the ratio of silicon to nitride oxidation rates is close to that at atmospheric pressure (131-132).

Because of the high stress at the nitride-silicon interface, dislocations will often be generated in the silicon at the corners of windows during oxidation (133-136). To prevent this problem, a thin layer of thermal oxide (pad or buffer oxide) is generally used between the nitride and the silicon. Unfortunately, oxygen can diffuse through this oxide along the interface and cause additional oxide growth under the nitride, giving rise to the enhanced "birdbeak" structures shown in Fig. 3.36. Reducing the pad oxide thickness reduces the length of the birdbeak (136). Since the oxide is thicker than the silicon it consumes, a recess is often first etched in the silicon so that after oxidation the surface is approximately flat. The recess changes the character of the oxide profile as is shown in Fig. 3.36c. Minimizing the effect hinges on an optimum choice of recess depth, nitride thickness, and pad oxide thickness. In general, thin pads help but may lead to more crystal defects. Typically, thicknesses of 300-500 Å are used. Thick nitride causes more defects, but a minimum thickness is dictated by the amount of nitride oxidized during the silicon oxidation cycle. Typical thicknesses of 1000-2000 Å are used. By using an additional nitride sidewall protection layer, the enhanced birdbeak can be largely eliminated (135, 137-138), as shown in Fig. 3.36d. However, additional defects may be introduced into the silicon with such a process (139).

In addition to crystal defects and an exaggerated birdbeak profile, a "white ribbon" is also sometimes seen just inside the thick oxide frame after a MOS gate oxidation. It is caused by a very narrow band of nitride that prevents oxidation. The origin of the band



has been ascribed to a transport of nitrogen from the masking nitride to the silicon-oxide interface (140). Thus, if the nitride is stripped and the pad oxide is removed in dilute HF, the nitride at the oxidesilicon interface remains unless substantial overetching occurs during the oxide removal step. Alternatively, the pad oxide removal could be followed by a subsequent nitride etch. However, such a second nitride removal step is seldom used. The growth and subsequent removal of a sacrificial oxide to remove the nitride is a more common step.

When a silicon wafer is uniformly oxidized, the oxidation process can be described by means of one-dimensional equations such as Eq. 3.15. When holes are cut in a uniformly thick oxide and then reoxidized; when local areas are initially oxidized, as was described in the preceding section; and when trenches (grooves) are being oxidized for either trench capacitors or trench isolation, the geometries are such that at least two- and sometimes three-dimensional oxidation solutions are required to fully describe the behavior. During local reoxidation, which occurs, for example, when a gate oxide is grown, an observable thickness variation near the thick/thin step is at least partially described by a two-dimensional analysis (141). During birdbeak development, oxygen must diffuse laterally along the pad oxide and react with silicon beneath the oxide (131, 142). When

3.7.6 Two- and Three-Dimensional Oxidation protuberances or the walls of small holes or grooves are oxidizing, two- or three-dimensional solutions are required if the oxide thickness is more than perhaps 25% of the initial radius (143). Much of the difference between one- and two- or three-dimensional solutions arises because of the difference in area between growing surface and the surface presented to the ambient. In addition, growth in confined space usually increases stress and cannot be handled in one dimension.

In addition to the various ionic and interface electrical charges, which are defects in their own right, a variety of other defects exist as well. Pinholes in the oxide are generally introduced during lithographic steps, but they can also be produced during oxide growth wherever nonvolatile particulates are attached to the wafer. Even if a particulate does not produce a discernible hole, it may well produce a region in the oxide that has a lower breakdown strength than the main body of the oxide (144) (as was discussed in an earlier section). Pinholes in a previous oxidation can allow small heavily doped regions to occur in unwanted positions during a subsequent diffusion. During the next oxidation, these areas will oxidize more rapidly and produce oxide "spots." These defects are not a result of any misprocessing during thermal oxidation.

> Heavy sodium contamination, followed by long anneals, will produce devitrification (145). Oxide defects that look like small flowers can be formed during subsequent deposition steps if local high concentration of the dopant oxides form separate phases. Also, long anneals after a high-concentration phosphorus diffusion have been shown to promote local SiO, crystallization (146). Oxides grown on heavily boron-doped wafers have small separate phase regions identified as amorphous B<sub>1</sub>O<sub>1</sub>, and high-temperature oxidations have some occluded silicon regions of perhaps 0.05 µm diameter (100). During oxidations in the 1100°C-1200°C range in atmospheres containing 10% HCl, small regions containing a separate, chlorine-rich phase have been reported.25 After several-hour oxidations, bubbles in the oxide sometimes occur and can be severe enough to cause oxide-silicon separation (147-148). The separation is apparently caused by a chlorine gas buildup at the interface.

> Heavy metals, as discussed in the next section, lead to crystallographic defects in the silicon wafer. In addition, however, metal precipitates that form near the silicon surface may extend into the oxide and cause a premature breakdown of MOS gate oxides, apparently because of an electric field enhancement from the small.

# 3.7.7 Oxide Physical Defects

<sup>&</sup>quot;To minimize this problem, the normal concentration range is from 2% to 4%.

rod-like precipitates. Both iron and copper have been shown to cause this problem (149).

3.7.8 Defects Generated in Silicon during Oxidation Two kinds of crystallographic defects commonly occur in silicon wafers after oxidation. First, saucer pits (S-pits) (150) that have densities in the  $10^4$ – $10^6$ /cm<sup>2</sup> range may occur. Second, oxidation-induced stacking faults (OSF or OISF) may occur, and although they occur at a much lower density, they are more deleterious to device yield.

S-pits, also sometimes referred to as haze or fog, can be seen after the oxide is stripped and the wafer is given a light etch in a delineant such as Wright etch.28 Their presence is a sign of furnace or wafer contamination with a precipitate-forming metal. For a given amount of contamination, the haze density will vary with the furnace removal rate. Slow rates will allow more and larger precipitates to grow. Thus, there is a conflict in processing requirements. Slip minimization requires a slow cooldown cycle, while S-pit suppression requires rapid cooldown. Cu, Ni, S, Ca, Fe, K, and Zn all have been reported as possible S-pit initiators (151). A small portion of Spit defects have, in turn, been shown to initiate the more damaging oxidation-induced stacking faults (152). Methods to minimize or eliminate S-pits and stacking faults are to keep the wafer and furnace as free as possible of contaminants and to use heavy phosphorus doping, mechanical damage, misfit dislocation, or oxygen precipitate intrinsic gettering to trap the metals (32, 154).

Stacking faults are more likely to occur in (100) material than in (111), and n-type (100) material has more stacking faults than p-type. It has been suggested that the higher density of bulk defects in the p-type (100) material provides for more internal gettering (32). High-temperature oxidations favor a high fault density. By the use of high-pressure oxidation, the temperature can be lowered, and as a consequence, the OSF density may be lowered as well (155–156). The growth of the faults is fed by the supply of silicon interstitials generated during oxidation. The experimentally observed fault length  $L_{cf}$  is described by Eq. 3.34 as follows:

$$L_{tr} = Kt^{2/3} e^{-2.3 \pi V/kT} \qquad 3.34$$

where K is a constant and t is the oxidation time (151). Thus, lowering the temperature or shortening the oxidation time minimizes the size of the faults.

<sup>&</sup>quot;For the formulation of this and other suitable etchants, see Chapter 6.

# KEY IDEAS 3

- Despite the fact that the bulk chemical, physical, and electrical properties of vapordeposited SiO<sub>2</sub> are close to those of thermally grown SiO<sub>2</sub>, the electrical properties of the oxide-silicon interface are substantially different, and, thus far, only the thermal oxide is adequate for pn junction passivation and MOS transistor gates.
- Under the influence of an electric field, sodium, potassium, and a few other ions are particularly mobile in SiO<sub>2</sub>.
- If mobile ions such as sodium are present in the oxide, the electrical fields present in devices during normal operation are often enough to cause the ions to migrate to the silicon-oxide interface and adversely affect device performance.
- For oxide growth using water vapor as the

PROBLEMS 3

- If a (111) silicon wafer is oxidized, how much silicon is consumed while 3000 Å of thermal oxide is grown? How much is consumed if the orientation is (100)?
- 2. If a sidewall oxide isolated transistor (Fig. 3.34b) is to be made with an isolating oxide thickness of  $1.2 \ \mu m$ , how deep should the depression in the silicon be if the oxide surface is to be level with the main silicon surface? Draw a sketch to show the steps.
- 3. A (100) 150 mm diameter 600 μm thick silicon wafer has 1 μm of oxide thermally grown on one side. Calculate the center deflection of the wafer if the stress in the oxide is 2 × 10<sup>s</sup> dynes/cm<sup>2</sup>. Note that neither Young's modulus *E* nor Poisson's ratio ν for silicon is constant for all directions lying in a (100) plane. For purposes of this problem, use values of *E* = 1.5 × 10<sup>s</sup> dynes/cm<sup>2</sup> and ν = 0.2. (For more details on the ani-

oxidant or when  $O_2$  is the oxidant and oxides more than a few hundred Å thick are grown, the Deal–Grove model is quite satisfactory.

- For dry oxides thinner than a few hundred Å, marked deviations from the Deal-Grove model occur, and several alternatives have been proposed.
- The oxidation rate depends on the wafer surface orientation, and (111) surfaces oxidize more rapidly than (100) surfaces.
- Heavily doped silicon oxidizes more rapidly than does high-resistivity silicon.
- The most common defects introduced into silicon during oxidation arise from metallic contamination either on the surface or in the silicon bulk.

sotropy of these properties, see J.J. Wortman and R.A. Evans, J. Appl. Phys. 36, p. 153, 1965.)

- Calculate the capacitance of a MOS gate that has a cross section of 2 μm by 2μm if the gate oxide is 400 Å thick and there are no spurious interface or oxide charges.
- If a MOS transistor is contaminated with potassium, how much, in atoms/cm<sup>2</sup>, is present at the oxide-silicon interface if the threshold voltage has changed by -0.2 V? Assume that the gate oxide is 400 Å thick.
- Using the data from Fig. 3.28, determine A and B for a 1100°C wet oxidation of a (100) wafer. Are the numbers consistent with the data of Table 3.7?
- Using the data from Fig. 3.27. determine A. B. and τ for a 1000°C dry oxidation of a (100) wafer.

- If the only uncontrolled variable in an oxidation is the moisture content of the oxygen used in a "dry" oxidation, estimate the maximum amount of water vapor that would be tolerable if a nominal 1200 Å thick oxide is to be reproducible to within 10%.
- Calculate the time required to grow a 1 μm thick layer of dry oxide at 1100°C on a wafer doped with 2 × 10<sup>30</sup> atoms/cc of phosphorus.

# REFERENCES 3

- M.M. Atalla, "Semiconductor Surfaces and Films; the Silicon–Silicon Dioxide System," in Harry C. Gatos, ed., Properties of Elemental and Compound Semiconductors, Interscience Publishers, New York, 1959.
- J.R. Ligenza and W.G. Spitzer, "The Mechanism for Silicon Oxidation in Steam and Oxygen," J. Phys. Chem. Solids 14, p. 131, 1960.
- W.A. Pliskin and R.P. Gnall. "Evidence for Oxidation Growth at the Oxide–Silicon Interface from Controlled Etch Studies." J. Electrochem. Soc. 111, pp. 872–873, 1964.
- W.H. Zachariasen, "The Atomic Arrangement in Glass," J. Am. Chem. Soc. 54, pp. 3841–3851, 1932.
- S.A. Schwarz and M.J. Schulz, Chap. 2, "Characterization of the Si–SiO, Interface," in Norman G. Einspruch and Robert S. Bauer, eds., VLSI Microelectronics Microstructure Science, Vol. 10, Academic Press, New York, 1985.
- J.R. Patel and Norio Kato, "X-Ray Diffraction Topographs of Silicon Crystals with Superposed Oxide Films II. Pendellosung Fringes: Comparison of Experiment with Theory," J. Appl. Phys. 44, pp. 971–977, 1977.
- E.P. EerNisse, "Stress in Thermal SiO<sub>2</sub> during Growth," Appl. Phys. Lett. 35, pp. 8–10, 1979.
- M.V. Whelan et al., "Residual Stresses at an Oxide-Silicon Interface," *Appl. Phys. Lett.* 10, pp. 262–264, 1967.
- E.A. Irene et al., "A Viscous Flow Model To Explain the Appearance of a High Density Thermal Oxide at Low Oxidation Temperatures," J. Electrochem. Soc. 129, pp. 2594–2597, 1982.

10. If 10 μm by 10 μm holes are cut in a 1 μm thick oxide grown on a (100) high-resistivity silicon wafer, how thick will the initially 1 μm thick oxide be by the time 1500 Å has been regrown in the windows? The second oxide is grown at 1000°C in dry O<sub>2</sub>.

- F.A. Taft, "The Optical Constants of Silicon and Dry Oxygen Oxides of Silicon at 5461 Å," J. Electrochem. Soc. 125, pp. 968–971, 1978.
- E.A. Irene et al., "Residual Stress, Chemical Etch Rate, Refractive Index, and Density Measurements on SiO, Films Prepared Using High Pressure Oxygen," J. Electrochem. Soc. 127, pp. 396–399, 1980.
- R.J. Jaccodine and W.A. Schlegel, "Measurement of Strains at the Si-SiO, Interface," J. Appl. Phys. 37, pp. 2429–2434, 1966.
- R.Glang et al., "Determination of Stress in Films on Single Crystal Silicon Substrates," *Rev. Sci. Iust.* 36, pp. 7–10, 1965.
- Carlo Grilletto, "An X-Ray Fluorescence Techmique for the Rapid Determination of Phosphorus in PSG Film," *Solid State Technology*, pp. 27–30, February 1977.
- W.A. Pliskin, "Use of Infrared Spectroscopy for Characterization of Dielectric Films on Silicon," in Howard R. Huff and Ronald R. Burgess, eds., *Semiconductor Silicon*/73, Electrochemical Society, Princeton, N.J., 1973.
- C.Y. Ting et al., "Interaction between Ti and SiO.," in Kenneth E. Bean and George A. Rozgonyi, eds., VLSI Science and Technology/ 84, Electrochemical Society, Princeton, N.J., 1984.
- D.J. Silversmith, "Dielectric Loss, Electrode Deterioration, and the Sodium Problem in MOS Structures," *J. Electrochem. Soc.* 119, pp. 121– 122 and earlier references therein, 1972.
- Judith A. Nemetz and Richard E. Tressler, "Thermal Nitridation of Silicon and Silicon

Dioxide for Thin Gate Insulators," Solid State Technology, pp. 209-216 and references therein, September 1983.

- R.P. Vasquez and A. Madhukar. "A Kinetic Model for the Thermal Nitridation of SiO<sub>2</sub>/Si," J. Appl. Phys. 60, pp. 234–242, 1986.
- M. Lenzlinger and E.H. Snow, "Fowler-Nordheim Tunneling into Thermally Grown SiO<sub>2</sub>," J. Appl. Phys. 40, pp. 278–283, 1969.
- C. M. Osburn and E.J. Weitzman, "Electrical Conduction and Dielectric Breakdown in Silicon Dioxide Films on Silicon," J. Electrochem, Soc. 119, pp. 603–609, 1972.
- N.J. Chou, "D.C. Conduction in SiO<sub>2</sub> Films at Elevated Temperatures," J. Elect. Mat. 1, pp. 344–347, 1972.
- Yu-Pin Han et al., "Ultra-Thin Gate–Oxide Characteristics and MOS/VLSI Scaling Implications," *IEDM*, pp. 98–102, 1982.
- Chi Chang et al., "Charge Tunneling and Impact on Thin Oxide Device Reliability," *Electrochem. Soc. Ext. Abst.* 83–1, abst. no. 394, 1983.
- J.K. Srivastava et al., "Electrical Conductivity of Silicon Dioxide Thermally Grown on Silicon," J. Electrochem. Soc. 132, pp. 955–963, 1985.
- 26 W. Shockley et al., "Mobile Electric Charge on Insulating Oxides with Application to Oxide Covered P-N Junctions," *Surface Science 2*, pp. 277–287, 1964.
- P.S.D. Lin et al., "Adverse Effects of Substrate Stacking Faults on Leakage and Breakdown Strength of Thin Oxide Capacitors," *Electrochem. Soc. Ext. Abst.* 82–2, abst. no. 215, 1982.
- C.M. Osburn and D.W. Ormond, "Sodium Induced Barrier Height Lowering and Dielectric Breakdown on SiO<sub>2</sub> Films on Silicon," J. Electrochem, Soc. 121, pp. 1195–1198, 1974.
- C. M. Osburn and D.W. Ormond, "Dielectric Breakdown in Silicon Dioxide Films on Silicon I: Measurement and Interpretation," J. Electrochem. Soc. 119, pp. 591–597, 1972.
- C.M. Osburn and D.W. Ormond, "Dielectric Breakdown in Silicon Dioxide Films on Silicon II. Influence of Processing and Materials," J. Electrochem. Soc. 119, pp. 597-603, 1972.
- M. Sternheim et al., "Properties of Thermal Oxides Grown on Phosphorus In-Situ Doped Polysilicon," J. Electrochem. Soc. 130, pp. 1735– 1739, 1983.

- P.W. Koob et al., "Reduction of Surface Stacking Faults on N-Type (100) Silicon Wafers," J. Electrochem. Soc. 133, pp. 806–810, 1986.
- S. Iwamatsu, "Effects of Plasma Cleaning on the Dielectric Breakdown in SiO, Film on Si." J. Electrochem. Soc. 129, pp. 224–225, 1982.
- C. M. Osburn, "Dielectric Breakdown Properties of SiO<sub>2</sub> Films Grown in Halogen and Hydrogen-Containing Environments," J. Electrochem. Soc. 121, pp. 809–815, 1974.
- Richard G. Cosway, "Effect of Oxidation Ambient on the Electrical Properties of Thin Silicon Dioxide," *J. Electrochem. Soc.* 132, pp. 3052– 3054, 1985.
- C.M. Osburn and N.J. Chou, "Accelerated Dielectric Breakdown of Silicon Dioxide Films," J. Electrochem. Soc., 120, pp. 1377– 1384, 1973.
- S. Holland et al., "Time-Dependent Breakdown of Thin Oxides," *Electrochem. Soc. Ext. Abst.* 86–2, abst. no. 396, 1986.
- J.J. Tzou et al., "Charge Generation and Breakdown in SiO<sub>2</sub>," *Electrochem. Soc. Ext. Abst.* 86-2, abst. no. 397, 1986.
- T.S. Taylor et al., "Wearout of Thin Silicon Dioxide Layers due to Trap Generation Associated with Tunneling Currents," *Electrochem. Soc. Ext. Abst.* 86–2, abst. no. 398, 1986.
- D.L. Crook, "Method of Determining Reliability Screens for Time-Dependent Dielectric Breakdown," Proc. Reliability and Physics of Failure Symposium, pp. 1–7, 1979.
- J.W. McPherson and D.A. Baglee, "Acceleration Factors for Thin Oxide Breakdown," J. Electrochem. Soc. 132, pp. 1903–1908, 1985.
- E.H. Snow et al., "Ion Transport Phenomena in Insulating Films," J. Appl. Phys. 36, pp. 1664– 1673, 1965.
- Bruce E. Deal, "The Current Understanding of Charges in the Thermally Oxidized Silicon Structure," J. Electrochem. Soc. 121, pp. 198C– 205C, 1974.
- G.F. Derbenwick, "Mobile Ions in SiO<sub>2</sub>: Potassium," J. Appl. Phys. 48, pp. 1127-1130, 1977.
- S.R. Hofstein, "An Investigation of Instability and Charge Motion in Metal-Silicon Oxide-Silicon Structures," *IEEE Trans. on Electron Dev. ED*-13, pp. 222-227, 1966.
- 46. M. Kuhn and D.J. Silversmith, "Ionic Contami-

nation and Transport of Mobile Ions in MOS Structures," J. Electrochem. Soc. 118, pp. 966– 970, 1971.

- J.M. Eldridge and D.R. Kerr, "Sodium Ion Drift through Phosphosilicate Glass-SiO<sub>2</sub> Films," J. Electrochem. Soc. 118, pp. 986–991, 1971.
- T.W. Hickmott, "Thermally Stimulated Ionic Conductivity of Sodium in Thermal SiO<sub>2</sub>," J. Appl. Phys. 46, pp. 2583–2595, 1975.
- D.R. Kerr et al., "Stabilization of SiO<sub>2</sub> Passivation Layers with P<sub>2</sub>O<sub>4</sub>," *IBM J. of Res. Dev.* 8, pp. 376–384, 1964.
- P. Balk and J.M. Eldridge, "Phosphosilicate Glass Stabilization of FET Devices." Proc. IEEE 57, pp. 1558–1563, 1969.
- (a) R.J. Kriegler et al., "The Effect of HCI and Cl<sub>2</sub> on the Thermal Oxidation of Silicon." J. Electrochem. Soc. 119, pp. 388–392, 1972. (b) P.H. Robinson and F.P. Heiman, "Use of HCI Gettering in Silicon Device Processing," J. Electrochem. Soc. 118, pp. 141–143, 1971.
- References in J. Monkowski, "Role of Chlorine in Silicon Oxidation: Part I," Solid State Technology, July 1979; "Part II," August 1979.
- Mao-Chieh Chen and John W. Hile, "Oxide Charge Reduction by Chemical Gettering with Trichloroethylene during Thermal Oxidation," J. Electrochem. Soc. 119, pp. 223-225, 1972.
- Edmond J. Janssens and Gilbert J. Declerck. "The Use of 1,1,1,-Trichloroethane as an Optimized Additive To Improve the Silicon Thermal Oxidation Technology." J. Electrochem. Soc. 125, pp. 1696–1703, 1978.
- D.W. Hess and R.C. McDonald, "Investigation of Silicon Etching and Silicon Dioxide Bubble Formation during Silicon Oxidation in HCI– Oxygen Atmospheres," *Thin Solid Films* 42, pp. 127–131, 1977.
- Ronald L. Meek, "Residual Chlorine in O<sub>2</sub>:HCT Grown SiO<sub>2</sub>," J. Electrochem. Soc. 120, pp. 308–310, 1973.
- Reda R. Razouk and Bruce E. Deal, "Dependence of Interface State Density on Silicon Oxidation Process Variables," J. Electrochem. Soc. 126, pp. 1573–1581, 1979.
- A.S. Grove, Physics and Technology of Semiconductor Devices, John Wiley & Sons, New York, 1967.
- 59. P. Balk et al., "Orientation Dependence of Built-

In Surface Charge on Thermally Oxidized Silicon." Trans. IEEE 53, pp. 2133-2134 (L), 1965.

- B.E. Deal et al., "Characteristics of the Surface-State Charge (Q<sub>s</sub>) of Thermally Oxidized Silicon," J. Electrochem. Soc. 114, pp. 266–274, 1967.
- (a) P. Balk, "Effect of Hydrogen Annealing on Silicon Surfaces," *Electrochem, Soc. Ext. Abst.* 14 (1), abst. no. 109, 1965. (b) P. Balk, "Low-Temperature Annealing in the Al-SiO<sub>2</sub>-Si System," *Electrochem. Soc. Ext. Abst.* 14 (2), abst. no. 111, 1965.
- B.E. Deal, E.L. MacKenna, and P.L. Castro, "Characteristics of Fast Surface States Associated with SiO<sub>2</sub>-Si and Si<sub>1</sub>N<sub>4</sub>-Si Structures," *J. Electrochem. Soc. 116*, pp. 997–1005, 1969.
- B.E. Deal and A.S. Grove, "General Relationship for the Thermal Oxidation of Silicon," J. Appl. Phys. 36, pp. 3770–3778, 1965.
- J.A. Costello and R.E. Tressler, "Isotope Labeling Studies of the Oxidation of Silicon at 1000°C and 1300°C." J. Electrochem. Soc. 131, pp. 1944–1947, 1984.
- A.G. Revesz et al., "Structure of SiO. Films as Revealed by Oxygen Transport," J. Electrochem. Soc. 133, pp. 586–592, 1986.
- A.C. Adams et al., "The Growth and Characterization of Very Thin Silicon Dioxide Films," J. Electrochem. Soc. 127, pp. 1787–1793, 1980
- William A. Tiller, "On the Kinetics of the Thermal Oxidation of Silicon IV: The Two-Layer Film Approximation," J. Electrochem, Soc. 130, pp. 501–506, 1983.
- A, Fargeix and G. Ghibaudo, "Dry Oxidation of Silicon: A New Model of Growth Including Relaxation of Stress by Viscous Flow," J. Appl. Phys. 54, pp. 7153–7158, 1983.
- J.K. Srivastava and E.A. Irene. "Measurement of the Effect of Intrinsic Film Stress on the Overall Rate of Thermal Oxidation of Silicon," *J. Electrochem. Soc.* 132, pp. 2815–2816, 1985.
- Eugene A. Irene et al., "Silicon Oxidation Studies: Orientation Effects on Thermal Oxidation," J. Electrochem. Soc. 133, pp. 1253–1256, 1986.
- J. Blanc, "A Revised Model for the Oxidation of Si by Oxygen," Appl. Phys. Lett. 33, pp. 424– 426, 1978.
- 72. S.M. Hu, "Thermal Oxidation of Silicon: Chem-

isorption and Linear Rate Constants." J. Appl. Phys. 55, pp. 4095–4105, 1984.

- 73. C.J. Han and C.R. Helms, "A Physical Model for Si Oxidation Kinetics in the Thickness Range from 30 Å to 1 μm," *Electrochem, Soc. Ext. Abst.* 86-1, abst. no. 157, 1986.
- Joseph Blanc, "On Modeling the Oxidation of Silicon by Dry Oxygen," J. Electrochem. Soc. 133, pp. 1981–1982, 1986.
- Hisham Z. Massoud et al., "Thermal Oxidation of Silicon in Dry Oxygen Growth-Rate Enhancement in the Thin Regime: Parts I and II," J. Electrochem. Soc. 132, pp. 2685–2700, 1985.
- H.Z. Massoud, C.P. Ho, and J.D. Plummer, Stanford University Tech. Rpt., in J.D. Plummer, ed., Computer Aided Design of Integrated Circuit Fabrication Processes for VLSI Devices, 1982.
- 77, J.D. Plummer. "The Role of the Si/SiO<sub>2</sub> Interface in Silicon Oxidation Kinetics," in Howard R, Huff and Rudolph J. Kriegler, eds., Semiconductor Silicon/81, Electrochemical Society, Pennington, N.J., 1981.
- E.A. Irene and Y.J. van der Meulen, "Silicon Oxidation Studies: Analysis of SiO, Film Growth Data," J. Electrochem. Soc. 23, pp. 1380–1384, 1976.
- M.A. Hopper et al., "Thermal Oxidation of Silicon," J. Electrochem. Soc. 122, pp. 1216–1222, 1975.
- Hisham Z. Massoud, James D. Plummer, and Eugene A. Irene, "Thermal Oxidation of Silicon in Dry Oxygen—Accurate Determination of the Kinetic Rate Constants," J. Electrochem. Soc. 132, pp. 1745–1753, 1985.
- Joseph R. Ligenza, "Effect of Crystal Orientation on Oxidation Rates of Silicon in High Pressure Steam," J. Phys. Chem. 65, pp. 2011–2014, 1961.
- E.A. Irene, "The Effects of Trace Amounts of Water on the Thermal Oxidation of Silicon in Oxygen," J. Electrochem. Soc. 121, pp. 1613– 1616, 1974.
- Hideo Sunami, "Thermal Oxidation of Phosphorus-Doped Polycrystalline Silicon in Wet Oxygen," J. Electrochem. Soc. 125, pp. 892–897, 1978.
- Eugene A. Irene et al., "Silicon Oxidation Studies: Silicon Orientation Effects on Thermal Ox-

idation," J. Electrochem. Soc. 133, pp. 1253-1256, 1986.

- S.I. Raider and L.E. Forget, "Reversal of Relative Oxidation Rates of <111> and <100> Oriented Silicon Substrates at Low Oxygen Partial Pressure," J. Electrochem. Soc. 127. pp. 1783– 1787, 1980.
- M. Sternheim et al., "Properties of Thermal Oxides Grown on Phosphorus In-Situ Doped Polysilicon," J. Electrochem. Soc. 130, pp. 1735– 1740, 1983.
- E.A. Irene, E. Tierney, and D.W. Dong, "Silicon Oxidation Studies: Morphological Aspects of the Oxidation of Polycrystalline Silicon," *J. Electrochem. Soc.* 127, pp. 705–713, 1980.
- L. Faraone and G. Harbeke, "Surface Roughness and Electrical Conduction of Oxide/Polysilicon Interfaces," J. Electrochem. Soc. 133, pp. 1410– 1413, 1986.
- Hideo Sunami, "Thermal Oxidation of Phosphorus-Doped Polycrystalline Silicon in Wet Oxygen," J. Electrochem. Soc. 125. pp. 892–897, 1978.
- C.Y. Lu and N.S. Tsai, "Thermal Oxidation of Undoped LPCVD Polycrystalline Silicon Films," *J. Electrochem. Soc.* 133, pp. 446–447, 1986.
- C.Y. Lu and N.S. Tsai. "Thermal Oxidation of Heavily Boron Implanted Polycrystalline Silicon Films," J. Appl. Phys. 59, pp. 3574–3576, 1986.
- Reda R. Razouk et al., "Kinetics of High Pressure Oxidation of Silicon in Pyrogenic Steam," J. Electrochem, Soc. 128, pp. 2214– 2220, 1981.
- Liang N. Lie et al., "High Pressure Oxidation of Silicon in Dry Oxygen," J. Electrochem. Soc. 129, pp. 2828–2834, 1982.
- C. Camelin and G. Demazeau, "High Pressure Dry Oxidation Kinetics of Silicon-Evidence of a Highly Stressed Structure," *Appl. Phys. Lett.* 48, pp. 1211–1213, 1986.
- E.A. Irene and R. Ghez, "Silicon Oxidation Studies: The Role of H.O." J. Electrochem. Soc. 124, pp. 1757–1761, 1977.
- A.G. Revesz and R.J. Evans, "Kinetics and Mechanism of Thermal Oxidation of Silicon with Special Emphasis on Impurity Effects." *J. Phys. Chem. Solids* 30, pp. 551–564, 1969.
- 97. M. Morita et al., "Fluorine Enhanced Thermal

Oxidation of Silicon in the Presence of NF,." Appl. Phys. Lett. 45, pp. 1312-1314, 1984.

- B.E. Deal and M. Selar, "Thermal Oxidation of Heavily Doped Silicon," J. Electrochem. Soc. 112, pp. 430–435, 1965.
- C.P. Ho et al., "Thermal Oxidation of Heavily Phosphorus-Doped Silicon," J. Electrochem, Soc. 125, pp. 665–671, 1978.
- 100. E.A. Irene and D.W. Dong, "Silicon Oxidation Studies: The Oxidation of Heavily B- and P-Doped Single Crystal Silicon," J. Electrochem. Soc. 125, pp. 1146–1151, 1978.
- 101. Charles P. Ho and James D. Plummer, "Improved MOS Device Performance through the Enhanced Oxidation of Heavily Doped n. Silicon," *IEEE Trans. on Electron Dev. ED-26*, pp. 623–630, 1979.
- 102. C.P. Ho and J.D. Plummer, "Si/SiO, Interface Oxidation Kinetics: A Physical Model for the Influence of High Substrate Doping Levels, I. Theory, II. Comparison with Experiment and Discussion," J. Electrochem. Soc., 126, pp. 1516– 1522 and pp. 1523–1530, 1979.
- 103. Krishna C. Saraswat and Harinder Singh. "Thermal Oxidation of Heavily Phosphorus-Doped Thin Films of Polycrystalline Silicon," J. Electrochem. Soc. 129, pp. 2321–2326, 1982.
- 104. John, J. Barnes et al., "Low Temperature Differential Oxidation for Double Polysilicon VLSI Devices," J. Electrochem. Soc. 126, pp. 1779– 1785, 1979.
- 105. Makoto Hirayama et al., "High-Pressure Oxidation for Thin Gate Oxide Insulator Process." *IEEE Trans. on Electron Dev. ED-29*, pp. 503– 507, 1982.
- 106 A.S. Grove et al., "Redistribution of Acceptor and Donor Impurities during Thermal Oxidation of Si," J. Appl. Phys. 35, pp. 2695–2707, 1964.
- 107. B.E. Deal et al., "Observation of Impurity Redistribution during Thermal Oxidation of Silicon Using the MOS Structure," J. Electrochem. Soc. 112, pp. 308–314, 1965.
- 108. A.S. Grove et al., "Diffusion of Gallium through a Silicon Dioxide Layer," J. Phys. Chem. Solids 25, pp. 985–992, 1964.
- 109. Richard B. Fair and J.C.C. Tsai, "Theory and Direct Measurement of Boron Segregation in SiO, during Dry, Near Dry, and Wet O. Oxidation," J. Electrochem. Soc. 125, pp. 2050–2058 and references therein, 1978.

- 110. P. Deroux-Dauphin and J.P. Gonchond, "The Influence of High Pressure Oxidation on Boron Redistribution in LOCOS Structures," J. Electrochem. Soc. 131, pp. 1418–1423, 1984.
- J.A. Amick, "Cleanliness and the Cleaning of Silicon Wafers," Solid State Technology, pp. 47–52, November 1976.
- 112, F.N. Schwettmann et al., "Variation of Silicon Dioxide Growth Rate with Preoxidation Clean," *Electrochem. Soc. Ext. Abst.* 78–1, abst. no. 276, 1978.
- Werner Kern and David, A. Puotinen, "Cleaning Solutions Based on Hydrogen Peroxide for Use in Silicon Semiconductor Technology," *RCA Review 31*, pp. 187–206, 1970.
- 114. A. Mayer and S. Shwartzman, "Megasonic Cleaning: A New Cleaning and Drying System for Use in Semiconductor Processing," J. Elect. Mat. 8, p. 885, 1979.
- 115. F. Pintchovski et al., "Thermal Characteristics of the H<sub>2</sub>SO<sub>4</sub>-H<sub>2</sub>O<sub>2</sub> Silicon Wafer Cleaning Solution," J. Electrochem. Soc. 126, pp. 1428– 1430, 1979.
- 116. R.C. Henderson, "Silicon Cleaning with Hydrogen Peroxide Solutions: A High Energy Electron Diffraction and Auger Electron Spectroscopy Study," J. Electrochem. Soc. 119, pp. 772–775, 1972.
- 117. Gary Grant et al., recent news paper presented at Electrochemical Society Meeting, New Orleans, October 1984.
- 118. J. Nulman, J.P. Krusius, and A. Gat, "Rapid Thermal Processing of Thin Gate Dielectrics— Oxidation of Silicon," *Electron Dev. Lett. EDL*-6, pp. 205–207, 1985.
- Yoshiyuki Sato and Kazuhide Kiuchi. "Oxidation of Silicon Using Lamp Light Radiation," J. Electrochem. Soc. 133, pp. 652–654, 1986.
- Y. Ota and S.R. Butler, "Reexamination of Some Aspects of Thermal Oxidation of Silicon," J. Electrochem. Soc. 121, pp. 1107–1111, 1974.
- W. Dietze et al., "The Preparation and Properties of CVDSilicon Tubes and Boats for Semiconductor Device Technology," J. Electrochem. Soc, 121, pp. 1112–1115, 1974.
- 122, Santos Mayo and William H. Evens, "Thermodynamic Considerations in the Use of Polysilicon Oxidation Tubes for Clean SiO, Preparation," J. Electrochem, Soc. 125, pp. 106–110, 1978.

- 123. Thomas G, Wolfe, Ray Roberge, and George A. Brown, "Effect of Gaseous Growth Ambient Impurities on the Electrical Properties of Gate Oxides," *Electrochem. Soc. Ext. Abst.* 85–2. abst. no. 200, 1985.
- L.P. Trombetta et al., "Electrical Properties of Silicon Dioxide Films Fabricated at 700°C III: High Pressure Thermal Oxidation," J. Electrochem. Soc. 132, pp. 2706–2713, 1985.
- 125. P.T. Panousis and M. Schneider, "High Pressure Steam Apparatus for the Accelerated Oxidation of Silicon," *Electrochem. Soc. Ext. Abst.* 73–1, abst. no. 53, 1973.
- 126. Robert Champagne and Monte Toole, "High-Pressure Pyrogenic Oxidation in the Production Environment," Solid State Technology, pp. 61– 63. December 1977.
- 127. R.J. Zeto et al., "Pressure Oxidation of Silicon: An Emerging Technology," Solid State Technology, pp. 62–69, July 1979.
- E. Bussmann, "High-Pressure Oxidation in n-Channel MOS Technology," Semiconductor International, pp. 162–165, April 1983.
- Kenneth E. Bean et al., "Semiconductor Processing Facility for Providing Enhanced Oxidation Rate," U.S. Patent 4,599,247, July 8, 1986.
- 130. E. Kooi and J.A. Appels, "Selective Oxidation of Silicon and Its Device Applications," in Howard R. Huff and Ronald R. Burgess, eds., Semiconductor Silicon/73, Electrochemical Society, Princeton, N.J., 1973. (See also the earlier references contained therein.)
- 131. R.J. Powell et al., "Selective Oxidation of Silicon in Low-Temperature High-Pressure Steam," *IEEE Trans. on Electron Dev. Ed-21*, pp. 636– 640, 1974.
- 132. H. Miyoshi et al., "Selective Oxidation of Silicon in High Pressure Steam," J. Electrochem. Sor, 125, pp. 1824–1829, 1978.
- E. Bassous et al., "Topology of Silicon Structures with Recessed SiO<sub>5</sub>," J. Electrochem. Soc. 123, pp. 1729–1737, 1976.
- 134. Kenji Shibata and Kenji Taniguchi, "Generation Mechanism of Dislocations in Local Oxidation of Silicon," J. Electrochem. Soc. 127, pp. 1383– 1387, 1980.
- 135, 1. Magdo and A. Bohg, "Framed Recessed Oxide Scheme for Dislocation-Free Planar Si Structures," J. Electrochem, Soc. 125, pp. 932– 936, 1978.

- Y. Tamaki et al., "Evaluation of Dislocation Generation on Silicon Substrates by Selective Oxidation," J. Electrochem. Soc. 130, pp. 2266– 2270, 1983.
- 137. D. Kahng et al., "A Method for Area Saving Planar Isolation Oxides Using Oxidation Protected Sidewalls." J. Electrochem. Soc. 127, pp. 2468–2471, 1980.
- 138. M. Inuishi et al., "Defect-Free Process of a Bird's Beak Reduced LOCOS," *Electrochem. Soc. Ext. Abst.* 86–2, abst. no. 273, 1986.
- 139. R.C. Y. Fang et al., "Defect Characteristics and Generation Mechanism in a Bird Beak Free Structure by Sidewall Masked Technique," J. Electrochem. Soc. 130, pp. 190–196, 1983.
- 140. E. Kooi et al., "Formation of Silicon Nitride at a Si–SiO, Interface During Local Oxidation of Silicon and during Heat Treatment of Oxidized Silicon in NH, Gas," J. Electrochem. Soc. 123, pp. 1117–1120, 1976.
- 141. Lynn O. Wilson, "Numerical Simulation of Gate Oxide Thinning in MOS Devices." J. Electrochem. Soc. 129, pp. 831–837, 1982.
- 142. J.C.H. Hui and W.G. Oldham, "Two-Dimensional Oxidation Mechanisms in Local Oxidation Process," *Electrochem. Soc. Ext. Abst. 83– 1*, abst. no. 384, 1983.
- 143. Lynn O. Wilson and R.B. Marcus, "Oxidation of Silicon: Cylinders and Spheres," *Electrochem. Soc. Ext. Abst.*, 86–2, abst. no. 547, 1986.
- 144. K. Yambe et al., "Thickness Dependence of Dielectric Breakdown of Thermal SiO, Films," *Electrochem. Soc. Ext. Abst. 83–1*, abst. no. 309, 1983.
- 145. R.L. Meek and R.H. Braun, "Devitrification of Steam Grown Silicon Dioxide Films," J. Electrochem. Soc. 119, pp. 1538–1544, 1972.
- 146. F.I. Alessandrini and D.R. Campbell, "Catalyzed Crystallization in SiO, Thin Films," J. Electrochem. Soc. 121, pp. 1115–1118, 1974.
- 147. J. Monkowski et al., "The Structure and Composition of Silicon Oxides Grown in HCI/O. Ambients," J. Electrochem. Soc. 125, pp. 1867– 1873, 1978.
- 148. D.W. Hess and R.C. McDonald, "Investigation of Silicon Etching and Silicon Dioxide Bubble Formation during Silicon Oxidation in HCI-Oxygen Atmospheres," *Thin Solid Films* 42, pp. 127–131, 1977.
- 149. Kouichirou Hondo et al., "Breakdown in Silicon

Oxides (II)—Correlation with Fe Precipitates," *Appl. Phys. Lett.* 46, pp. 582–584 and references therein, 1985.

- D. Pomerantz, "A Cause and Cure of Stacking Faults in Silicon Epitaxial Layers," J. Appl. Phys. 38, pp. 5020–5026, 1967.
- 151. C.W. Pearce and V.C. Kannan, "Saucer Pit Defects in Silicon," *Electrochem. Soc. Ext. Abst.* 83-1, abst. no. 307, 1983.
- 152. C.W. Pearce and R.G. McMahon, "Role of Metallic Contamination in the Formation of 'Saucer' Pit Defects in Epitaxial Silicon," J. Vac. Sci. Technol. 14, pp. 40–43, 1977.
- 153. W.T. Stacy et al., "The Role of Metallic Impurities in the Formation of Haze Defects," in Howard R. Huff and Rudolph J. Kriegler, eds.,

Semiconductor Silicon/81, Electrochemical Society, Pennington, N.J., 1981.

- 154. S.P. Murarka et al., "A Study of Stacking Faults during CMOS Processing: Origin, Elimination, and Contribution to Leakage," J. Electrochem. Soc. 127, pp. 716–724, 1980.
- 155. Makoto Hirayama et al., "High-Pressure Oxidation for Thin Gate Insulator Process," *IEEE Trans. Electron Dev. ED*-29, pp. 503-507, 1982.
- 156. L.E. Katz and L.C. Kimerling, "Defect Formation during High Pressure, Low Temperature Steam Oxidation of Silicon," J. Electrochem. Soc. 125, pp. 1680–1683, 1978.
- 157. S.P. Murarka and G. Quintana, "Oxidation Induced Stacking Faults in N- and P-Type (100) Silicon," J. Appl. Phys. 48, pp. 46–51, 1977.



# Deposition of Inorganic Thin Films

# 4.1 INTRODUCTION

As was briefly discussed in Chapter 2, much of an integrated circuit is composed of thin layers of either conducting or insulating films deposited over the semiconductor wafer surface. In general, the conducting films are metal, but they may also be metal silicides or polycrystalline silicon (poly, or polysilicon). Low-temperature polysilicon deposition for interconnects and MOS gates will be discussed in this chapter, but the metals and silicides will be considered in Chapter 10. Amorphous silicon, used in some solar cells, requires somewhat different deposition conditions and will not be discussed. Thermally grown silicon dioxide is universally used on silicon ICs. but other thin nonmetallic films are used for diffusion masks, diffusion sources, surface passivation (GaAs), capping<sup>1</sup> (GaAs), insulation between metal leads, capacitance dielectrics, and protective overcoats. In the case of gallium arsenide, since thermal oxides do not grow well and do not possess the required characteristics, all insulating films used in GaAs ICs must be deposited rather than grown in situ from the wafer surface itself as in silicon processing. Examples of peripheral applications involving other materials are the use of silicon carbide for X-ray projection printing masks, silicon carbide for wear-resistance coatings to thermal printheads, and ferroelectrics for polarizable MOS gates.

The common ways of depositing thin layers of insulating materials are vacuum evaporation and sputtering (often referred to as physical vapor deposition), coating the wafer with a liquid film that will form an inorganic insulating material when heated,<sup>2</sup> and chem-

Capping is the adding of an impervious layer to the GaAs wafer to prevent evaporation of arsenic from the surface during subsequent high-temperature operations.

Sometimes called spin-on glass, so named for the way it is applied. (See Chapter 5 for a discussion of methods of applying spin-on materials to wafers.)

ical vapor deposition (CVD).<sup>3</sup> Less often used is the conversion of thin-deposited metal films to their oxide by anodization and the coating of wafers with a frit slurry of a low-melting-point glass that will fuse and form a continuous layer when heated.

The primary materials used for protective overcoats are silicon nitride and doped silicon dioxide. Silicon monoxide is used occasionally in specialized applications such as overcoating thin-film resistors. Thin-film diffusion sources can be either doped CVD SiO<sub>2</sub> or a spin-on material that will thermally decompose into volatiles and a doped SiO<sub>2</sub> film. When thermal silicon dioxide cannot be used as a diffusion mask, either CVD SiO<sub>2</sub> or silicon nitride is usually satisfactory. Insulation between metal leads<sup>4</sup> is usually provided by doped CVD SiO<sub>2</sub>. GaAs surface passivation is also usually done with CVD SiO<sub>2</sub>. In some capacitor applications, a dielectric with a dielectric constant higher than that of SiO<sub>2</sub> is desirable. Currently, no preferred alternative exists, but TiO<sub>2</sub> is an example of a material with a substantially higher dielectric constant.

While a multitude of film properties must be considered, many properties depend on the choice of film material, and some depend on deposition conditions. Of these properties, the most important ones are listed in Table 4.1. The etching characteristics of most of the films are discussed in Chapter 6. The electrical properties, such as resistivity, interface charge density, and dielectric strength, for various thin-film dielectrics have been extensively studied with the goal of providing a usable substitute for silicon thermal oxide passivation. Thus far, they have not proven satisfactory for most such applications, but they are widely used for protective coatings against both mobile ions<sup>5</sup> and mechanical damage and as isolation between levels of interconnects.

For ordinary IC applications, the thermal expansivity is of interest only as it affects the mechanical stress in the film. The breaking strength of most films is in the 10<sup>10</sup> dynes/cm<sup>2</sup> range, and for films to remain crack free, the film stress should be less than about 10<sup>9</sup>

122

<sup>&#</sup>x27;CVD in this discussion is considered to include all chemical forms of deposition from the gas phase. Later in the chapter, a distinction will be made between various CVD techniques.

<sup>&</sup>lt;sup>4</sup>The terms *leads* and *interconnects* are used interchangeably in this and the remaining chapters to mean thin-film electrical interconnections between the various elecuit components comprising an integrated circuit.

<sup>&#</sup>x27;The effect of mobile ions is discussed in Chapters 3 and 11.

#### TABLE 4.1

Important Properties of Films

| Property                                 | Comment                                                                                                                                                  |  |  |  |  |
|------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|--|
| Etch rate                                | Rate must be high enough to etch film in a reasonable time, but not so high as to be uncontrollable.                                                     |  |  |  |  |
| Ease of etching                          | A readily available and relatively nontoxic<br>solvent is desirable.                                                                                     |  |  |  |  |
| Electrical resistivity                   | For most nonconductive applications, any<br>insulating material is satisfactory. (The<br>criteria for conductive layers are<br>discussed in Chapter 10.) |  |  |  |  |
| Interface charge density                 | This property is of importance when<br>material is deposited directly on a pn<br>junction or is to be used as a MOS gate<br>dielectric.                  |  |  |  |  |
| Dielectric strength                      | This property determines minimum<br>thickness for a given breakdown<br>voltage.                                                                          |  |  |  |  |
| Thermal expansivity                      | Difference in expansivity of wafer and file<br>contributes to internal stress.                                                                           |  |  |  |  |
| Thermal conductivity                     | This property is of concern only in special<br>applications.                                                                                             |  |  |  |  |
| Hardness                                 | This property determines scratch resistance.                                                                                                             |  |  |  |  |
| Mechanical stress                        | Excessive internal stress can crack film or<br>wafer.                                                                                                    |  |  |  |  |
| Effectiveness as a mobile ion barrier    | This property is very important for<br>protective coatings.                                                                                              |  |  |  |  |
| High-temperature flow<br>characteristics | Some films are reflowed after window<br>etching to make step coverage easier.<br>Thus, low-temperature flow is<br>necessary.                             |  |  |  |  |

dynes/cm<sup>2</sup> over the whole range of temperatures to which the films are subjected. The total stress<sup>6</sup>  $\sigma_f$  in the film is given by

$$\sigma_f = \frac{(\alpha_f - \alpha_s)E_f\Delta T}{1 - \nu_f} + \sigma_{if} \qquad 4.1$$

where  $\alpha_s$  is the wafer (substrate) expansivity,  $\alpha_f$  is the film expansivity,  $\Delta T$  is the temperature excursion,  $E_f$  is Young's modulus of the film,  $v_f$  is Poisson's ratio for the film, and  $\sigma_{if}$  is the internal stress of

<sup>&</sup>lt;sup>6</sup>A positive stress is tensile; a negative stress is compressive.

the film.  $\sigma_{if}$  is process dependent and can be highly variable. (A method of experimentally determining  $\sigma_f$  was discussed earlier in section 3.3.2.)

Sputtering as a method of depositing films has been known at least since 1852<sup>7</sup> and thus ranks as comparable in age to the evaporation method. However, probably because it does not require as high a vacuum as evaporation, sputtering seems to have been used much more extensively than evaporation prior to 1900. But, since the high pressures (typically 10–100 mtorr) allowed substantial contamination to occur, not until recently, when the procedure of an initial hard-vacuum pumpdown followed by introduction of a very-highpurity sputter gas was introduced, did sputtering become a widely accepted tool for semiconductor fabrication. Its main advantage is that deposition is independent of the temperature of the substrate, and films can therefore be deposited at room temperature. Unfortunately, however, lack of adhesion and/or film cracking may prevent the use of such films unless the temperature of the substrate is raised to about 200°C.

Fig. 4.1 is a schematic of a sputter deposition chamber. As

#### FIGURE 4.1

Schematic of a sputter deposition chamber. (The chamber may be very large so that many wafers can be placed on the wafer holder, or it may be just large enough for one slice to be sputtered at a time.)



7W.R. Grove, Phil. Trans., p.1, 1852.

# 4.2

### SPUTTER DEPOSITION

shown, a plasma is formed between the target (material to be deposited) and a metallic platen holding the wafers. Positive ions from the plasma are accelerated to the target, where they dislodge (sputter) molecules from it. Some of these molecules then strike and condense on the wafer surface. The ions used for sputtering are positive, and the target is negatively biased. This negative bias can be done directly by a DC power supply or accomplished by a charge buildup on the target when it is fed by a capacitance-coupled RF voltage. (For a further discussion of this voltage, see Chapter 6.) The use of an RF system also makes possible the sputtering of insulating materials such as silica.

The ions used for sputtering come from the residual gas in the chamber and, in principle, could come from any gas. Usually, an inert gas is preferred so that it does not react with the target material. Further, some ions are much more efficient than others at sputtering. In most cases, the efficiency increases in the order of He-N-Ne-N<sub>2</sub>-Ar-Kr-Xe, and the difference between He and Ar is as much as 10 times (1). This sequence is in the same order as the increases in the atomic or molecular weights, which is 4-14-20-28-40-84-131.

In sputtering two-component materials such as SiO<sub>2</sub> or Si<sub>1</sub>N<sub>4</sub>, the target can be the same as the material to be sputtered, which is sometimes a problem to construct, or the target may be made of one component while the sputter gas contains the other. Thus, sputtering a silicon target in a nitrogen ambient can give silicon nitride. This process is called reactive sputtering. The compound formation can occur at the target surface before sputtering so that it is the compound that is sputtered, or it can occur by a reaction at the substrate surface. Ordinarily, reactions would not be expected in the plasma because of the low density of reactants. When some materials are subjected to a plasma containing oxygen, oxidation occurs, and it appears that, in the case of the reactive sputtering of titanium to form TiO<sub>2</sub>, the oxide forms on the titanium surface and then is sputtered away (2). The electrical and physical properties of reactively sputtered SiO2 and Si1N4 have been extensively studied (3, 4), but the mechanism of their formation does not seem to have been considered. Even when conventional sputtering is used, the addition of an appropriate gas such as oxygen during oxide sputtering or nitrogen during nitride sputtering appears to help ensure stoichiometry and will often improve the electrical properties of the film (5, 6).

The kind of sputtering equipment configuration depends on the electrode arrangement. The simplest type is diode sputtering, either DC or RF, as was shown in Fig. 4.1. Diode sputtering has the disadvantage of having a relatively slow deposition (sputtering) rate because of the limited number of ions produced in the plasma. By installing a filament to supply additional electrons to the plasma (triode sputtering), the number of ions and thus the sputtering rate can be substantially increased (7). Another way of increasing the ion density is by using a magnetic field to spiral the electrons toward the cathode.<sup>8</sup> The electron path length is then increased, and thus the ion generation efficiency. This method is referred to as magnetron sputtering and is commonly used (8, 9).

## 4.3

### **EVAPORATION\***

Evaporating material in a hard vacuum (typically  $<10^{-5}$  torr) and allowing it to hit a substrate in the chamber, as shown schematically in Fig. 4.2, is a coating method that has been known for about as long as sputtering. However, it did not become widely used until much later, probably because of the lack of good vacuum pumps and good ways of heating the evaporate. The wrapping of wire of the desired evaporate around a twisted tungsten filament (introduced after 1900) provided a simple evaporation source for most metals and greatly increased the ease of evaporation deposition. Thus, by the time aluminum metallization was applied to semiconductors, evap-

#### FIGURE 4.2

Schematic of a simple evaporation system.



<sup>\*</sup>In electrical engineering parlance, the cathode is the most negative electrode. However, some plasma process engineers and some literature define the electrode coupled to an RF power supply as the cathode regardless of whether it is positive or negative with respect to the other electrode. (See the plasma section of Chapter 6 for more discussion of electrode voltages.)

'See reference 10.

oration was a well-established technique and, indeed, was much favored over sputtering.

The high vacuum is necessary so that the evaporant collides with a negligible number of gas molecules on its way to the substrate (wafers in this case). The rate of evaporation is dependent on the vapor pressure of the source (evaporant) material, so for a reasonable rate, the source temperature must be at least a few hundred degrees C. Thus, in most cases, a high vacuum is also required to minimize chemical reactions between the material being evaporated and the residual gases. For a minimal number of collisions, the mean free path L of the molecules in the chamber must be considerably greater than the distance from source to substrate. Several expressions have been developed for the mean free path, but the most common one is a modified Clausius derivation that gives

$$L = \frac{T/273}{\xi^2 \psi \pi \sqrt{2}} \tag{4.2}$$

where T is the temperature in K,  $\psi$  is the number of gas molecules/ cc, and  $\xi$  is the diameter of the gas molecules in cm.  $\xi$  can be estimated from Arnold's rule (see Chapter 2 in reference 10):  $\xi$  in Å =  $(V_m)^{1/3}$ , where  $V_m$  is the molar volume in cm<sup>3</sup> at the boiling point but is generally in the 2–5 Å range. At 0°C,  $\psi = 2.3 \leftarrow 2.7 \times 10^{19}P$ , where P is in atmospheres.

The deposition rate r will be given by the difference between the rate of arrival from the source and the rate of evaporation from the wafer surface. Unless the wafer is very hot, evaporation from its surface will be negligible, and the deposition rate will be proportional to the source evaporation rate. The source evaporation rate  $r_s$  is proportional to the equilibrium partial pressure of the source material at the evaporation temperature and is usually expressed in terms of grams/s. The substrate deposition rate is related to  $r_s$  through the deposition efficiency  $\eta$ , which is the ratio of the solid angle subtended by the substrate to the emission solid angle of the source. Thus, for evaporation from a small (point) source in which material is emitted in all directions (a  $4\pi$  solid angle),

$$\eta = \frac{A}{4\pi Z^2} \tag{4.3}$$

where Z is the distance from source to substrate and A is the substrate area projected onto a plane perpendicular to Z. If the source is a small crucible, then material is emitted only into a hemisphere, and Eq. 4.3 becomes

n

$$=\frac{A}{2\pi Z^2}$$
4.4

Elm Exhibit 2159, Page 146

Since the different components of multicomponent materials usually have different vapor pressures, it is difficult to deposit such materials by evaporation. Nevertheless, through the use of molten sources of an appropriate composition or by flash evaporation of a continuous stream of feed material, the composition can be reasonably well controlled. However, when aluminum interconnect leads doped with a small percentage of materials such as copper or silicon became necessary, sputtering proved to be a more practical method of deposition. (Metal deposition is covered in more detail in Chapter 10.) It is also difficult to vaporize high-melting-point materials such as fused silica, but by using an electron beam impinging on the surface of the material to be evaporated, satisfactory rates can be obtained. When evaporated quartz (fused silica) was first used as a protective overcoat for ICs, it was generally electron beam (E-beam) evaporated. However, RF sputtering is now the most common method.

Ion beam deposition uses a plasma to generate ions that can then be accelerated as desired toward the sputtering target as shown in Fig. 4.3 (11). By separating the ion generation, which must be done in low vacuum, from the main chamber, the wafers may be kept in a hard vacuum, thereby minimizing film contamination by the internal gas ambient. Further, a separate ion generator allows high power to be applied to it without the possibility of the wafers' overheating or suffering lattice damage from incident electrons. It is also possible to use the ion beam itself as the deposition material. By deflecting a focused beam, patterned depositions can be made directly on a wafer. (See also Sec. 5.14.)



Schematic of an ion beam deposition system. (The pressure in the vacuum chamber may be quite low—for example, 10<sup>-5</sup> torr.)



# 4.4

ION BEAM DEPOSITION

128

### 4.5 CHEMICAL VAPOR DEPOSITION

Chemical vapor deposition (CVD) is done by choosing either a compound that will decompose (pyrolyze) under the influence of heat to give the desired film or a set of compounds that will react to give the film. In both cases, the reaction must take place only at the wafer surface. If the reaction takes place partially or totally in the gas stream above the wafer, "fluff" and other forms of particulates will form. Such conditions generally give films with low density and many pinholes. In almost every case, CVD reactions that are useful in IC manufacturing can be characterized as surface catalyzed. That is, even though the reactants may be present in the gas ambient above the wafers and at the same temperature as the wafers, the reaction proceeds on the hot wafer surface and on any other hot surfaces, but not in the gas stream. However, some processes used in other industries do deliberately form particles in a high-velocity gas stream and then direct the stream to the surface to be coated. There, a portion of the particles stick, coalesce, and form a useful coating.

Specific reactions will be discussed in the following sections. However, an example of decomposition is the heating of silane (SiH<sub>4</sub>) to temperatures above about 400°C, at which point the silane decomposes to give silicon and hydrogen (12). An example of two components that react is the use of silane plus oxygen to give SiO<sub>2</sub>. In this particular case, a stream of silane in air will ignite spontaneously, and under some circumstances, silane–air mixtures will violently explode. However, when properly diluted, the mixture is well behaved and is widely used to produce silica films at temperatures as low as 400°C.

CVD techniques can be used to deposit amorphous, polycrystalline, or single-crystal layers. Single-crystal deposition is referred to as epitaxy and is discussed in detail in Chapter 7. This chapter is devoted solely to amorphous and polycrystalline depositions.

The CVD process has several distinct subdivisions. One, done at atmospheric pressure, is referred to as CVD or APCVD. When lower pressures are used, where diffusion effects are minimized, the operation is LPCVD (low-pressure CVD). If a plasma is used to generate ions or radicals that recombine to give the desired film, the process is PECVD (plasma-enhanced CVD). In some cases, photon energy is used instead of or as an adjunct to the thermal energy usually required for the reaction. These processes are referred to as photon-enhanced CVD. Silica and silicon nitride are often deposited by CVD. Silicon nitride is also widely deposited by PECVD. Polycrystalline silicon layers are generally deposited by LPCVD. Photon-enhanced CVD is currently still largely experimental. The choice of which process to use depends on factors such as the maximum temperature that can be used and the amount of stress and the

#### 130 CHAPTER 4 DEPOSITION OF INORGANIC THIN FILMS

#### FIGURE 4.4

Typical operating temperatures for depositions of various kinds of silicon dioxide and nitride.



number of pinholes that can be tolerated. The maximum temperature, in turn, depends on where in the process the layer is to be deposited. Fig. 4.4 shows typical temperature ranges for the deposition of various oxides and nitrides. If the goal is only to minimize diffusion, any method other than thermal oxidation will probably be satisfactory. If the layer is to be deposited over refractory contacts, the upper end of the CVD oxide temperature range will probably suffice. But, if deposition is to be over aluminum metallization, either the low end of the CVD oxide range or else PECVD will be required. If a coating is to be put over a unit with bonding wires attached, a very low temperature (perhaps photon-enhanced deposition) may be required.

4.5.1 Atmospheric Pressure and Low-Pressure Depositions

The deposition rate will depend on the rate of reactants arriving at the surface and the rate at which the surface reaction proceeds. Depending on which of these two factors is limiting, the observed rate is said to be, respectively, mass-transport controlled or kinetically controlled. Mass-transport limitations can occur either because of a limit on the concentration at the gas inlet or because the reactants can get to the surface only by diffusing through some sort of relatively stagnant gaseous layer (diffusion-limited). The kinetic region's behavior is largely dominated by effects at the growing interface and generally shows an Arrhenius behavior: rate  $= r = ae^{-E/RT}$ , where *a* is a constant, *E* is the activation energy in kcal/mol, *R* is the universal gas constant, and *T* is absolute temperature.<sup>9</sup>

EXAMPLE If a process has an activation energy of 1 eV,<sup>10</sup> what is the activation energy in kcal/mol?

From Appendix B, 1 eV = 3.832 × 10 <sup>20</sup> cal and 1 molecule =  $1 \text{ mol}/6.023 \times 10^{23}$ . Thus,  $1 \text{ eV/molecule} = 3.832 \times 10^{-30} \text{ cal/}$  $1 \text{ mol}/6.023 \times 10^{23} = 23.08 \times 10^3 \text{ cal/mol} = 23.08 \text{ kcal/mol}.$ 

For APCVD, the temperature dependence of the deposition rate for the various processes is generally characterized by a kineticcontrolled region at lower temperatures and a mass-transportlimited region at higher temperatures. Thus, a rate versus 1/T plot has the characteristics shown in Fig. 4.5a. In the kinetic-controlled region, if the temperature can be adequately controlled, the deposition rate across the wafer(s) can be quite uniform, as shown in Fig. 4.5b. However, in the mass-transport-limited region, where the rate depends on the flux of reactants, a substantial change of deposition rate can occur along the flow direction, as is also shown in Fig. 4.5b. The actual shape of the curve can vary widely. Some deposition conditions give a linear decrease with distance, and some give an



Effect of deposition temperature on deposition rate and uniformity.



"The more common expression found in solid-state physics literature for these kinds of relations is  $r = ae^{-E/kT}$ , where E is in eV per molecule (or per atom or electron, depending on the situation) and k is Boltzmann's constant in eV/K. In discussions of chemical processes, however, R instead of k is normally used, and the activation energy is expressed in kcal/mol instead of eV.

"Ordinarily, when eV is used in this context, the phrase "per molecule," while being implicit, is not written.

exponential decrease. By reactor gas flow or hardware changes, the profiles can be substantially flattened and, in some cases, may even be caused to tilt up at the trailing edge.

The gas flow in APCVD and LPCVD reactors can be broken into two general categories of flow: parallel to the wafer surface, as shown in Fig. 4.6a, and perpendicular to the surface of a series of wafers, as shown in Fig. 4.6b. The parallel flow, as actually embodied in CVD reactors, takes many forms. One of the oldest places wafers along a horizontal tube as shown in Fig. 4.7a. However, pancake and barrel configurations as shown in Figs. 4.7b and 4.7c, respectively, have been widely used in epitaxial CVD. Also used are single-wafer reactors as shown in Fig. 4.7d, which are most practical for wafers with diameters above about 100 mm. Parallel flow is used almost exclusively for APCVD, and perpendicular flow is normally found only in low-pressure systems. Modeling of deposition rates, which requires information on both the flow characteristics and the chemical reaction path, has been done for both kinds of systems. In the case of parallel flow, most of the work has been done for the specific case of silicon epitaxial growth. Modeling of flow as in Fig. 4.6b has been done for polysilicon and silicon nitride depositions.

In order to gain insight into the specific details of flow, several studies using either smoke patterns or holography have been made. In the case of parallel flow down a long, rectangular or circular tube, if the susceptor is the hottest part of the system, spiraling of the gas as it moves along (13), as well as one or more circulating gas cells,





Examples of reactor configurations in which gas flows across wafer surface.



(a) Horizontal tube configuration



(c) Barrel configuration

(d) Single-wafer configuration

#### FIGURE 4.8

Cross section of flow pattern in a tubular horizontal reactor. (Source: Adapted from R. Takahashi et al., J. Electrochem. Soc. 119, p. 1406, 1972.)



which in cross section appear as shown in Fig. 4.8 (14), can occur. The circulating cells, sometimes referred to as conduction rolls, are predicted mathematically and can lead to deposition nonuniformities (15). Often, a layer just above the susceptor/wafers will occur that appears to be laminar, and turbulent flow will fill the rest of the chamber (16, 17). The value of the Reynolds number" Re for a particular set of circumstances can be used to judge whether or not laminar flow will occur. In general, for a value greater than about 2000, turbulence will occur. However, under the influence of steep temperature gradients, as would be found in the gas above a heated substrate, turbulence is more likely, and it has been suggested that



Wafer

<sup>&</sup>quot;A series of dimensionless ratios are regularly used in fluid flow analysis, one of which is the Reynolds number Re. It is defined as  $Re = \rho V \ell/\mu$ , where  $\rho$  is the gas density, V is its velocity,  $\mu$  is its absolute viscosity, and  $\ell$  for this case is the free distance above the substrate/wafers.

the ratio of the Grashof number<sup>12</sup> Gr to  $Re^2$  is then a good criterion. For a given gas velocity, the Reynolds number for argon and nitrogen is nearly two orders of magnitude greater than it is for helium and hydrogen, and it has been observed in some studies that there is much more turbulence when argon and nitrogen are used (16). Observations of flow patterns in the geometry of Fig. 4.6b show that with increasing flow, eddies begin to form at the edges of the wafers but that by decreasing wafer spacing, higher velocities can be used before the eddies begin (18).

When gas flows over a flat plate (the substrate and wafers of parallel flow reactors), it is normally assumed that the gas velocity V is zero at the surface of the plate. The velocity increases normal to the plate until at some distance  $\delta$  it equals the velocity in the main stream,  $\delta$  increases with distance along the plate in the direction of flow and is approximately given by

$$\delta = \left(\frac{x\mu}{\rho V}\right)^{1/2} = \left(\frac{x\ell}{Re}\right)^{1/2}$$
 4.5

where x is the distance from the inlet along the gas stream,  $\rho$  is the gas density, and  $\mu$  is the absolute viscosity. The region between the plate and  $\delta$  is called the boundary layer and, when thin, is often assumed to be stagnant. If flow is between two plates or is, for example, in a tube, and when the length of the flow path is enough, the upper and lower boundary layers will meet, making the flow "fully developed." The boundary layer then cannot be considered stagnant; rather it has a velocity that ranges from a maximum in the center to zero at the surfaces.

Note that the values of gas properties such as  $\rho$  and  $\mu$  to be used in Eq. 4.5 and similar calculations are the ones appropriate for the reactor chamber operating temperature. Hence, ordinary roomtemperature handbook values will not suffice. Some high-temperature values are readily calculable, but more often they are available in the form of polynomial expressions that have been fitted to experimental points. A useful example is the viscosity of hydrogen, given in micropoise by (19)

$$\ln\left(\frac{\mu}{S}\right) = A \ln T + \frac{B}{T} + \frac{C}{T^2} + D \qquad 4.6$$

134

<sup>&</sup>lt;sup>17</sup>The Grashof number is defined as  $Gr = p^2 g \ell^2 \beta \Delta T/\mu^2$ , where g is the gravitational constant and  $\beta$  is the gas expansion coefficient. For an ideal gas, the expansion coefficient  $\beta$  is just 1/T, so in the ratio of Gr to  $Re^2$ , given by  $Gr/Re^2 = g(\Delta T/TV^2)$ , no gas property is involved and the kind of gas used should not affect the onset of thermal-gradient-induced turbulence. Experimentally, turbulence for ratios greater than 0.5 (when expressed in cgs units) have been reported (13).

where S = 88.0, A = 0.68720, B = -0.61732, C = -111.49, D = -3.9001, and T is the temperature in K.

EXAMPLE

If the inlet flow of hydrogen into a 100 mm diameter tube furnace operating at atmospheric pressure and 1100°C is 100 liters/minute, what are the density and the velocity of the gas in the tube if the boundary layer thickness is considered to be negligible?

Since the volume of a gas at constant pressure varies as  $V_i/T_1$ =  $V_2/T_2$ , the density will vary inversely with temperature; that is,  $\rho = (T_i/T_2)(0.090)$  gram/liter, or  $\rho = (273/1373)(0.090) = 0.018$ gram/liter. The change in flow rate due to the increased temperature in the tube is the same as the change in gas volume. Thus, the new flow rate is  $100(T_2/T_1) = 500$  liters/minute. The velocity equals flow rate divided by cross-sectional area, or  $5 \times 10^5$  cm<sup>3</sup>/minute/ 78.5 cm<sup>2</sup> = 6369 cm/minute = 106 cm/s.

Even though reactants are carried along by the flow, they must diffuse through the stagnant layer to reach the wafer surface. When the temperature is high enough for no kinetic limitations, the choice of assumptions made about the nature of the layer gives rise to various expressions for deposition rate. If no stagnant layer is considered, a linear drop in rate along the gas path is predicted because of reactant gas depletion (20). If a uniformly thick stagnant layer is considered, the falloff in rate is exponential (21). There are also solutions assuming the boundary layer thickness increases as in Eq. 4.2 (22), where it increases but is not assumed to be stagnant (23) and where it is assumed that the flow is fully developed before the first wafer (24). Models for the barrel-type reactor of Fig. 4.7c are also available but are considerably more complex than those for the simple tubular geometry (19, 25, 26) of Fig. 4.7a.

In any of these cases, since it is assumed that there is no reaction rate limitation, the flux J of reacting species moving from the gas stream to the surface will determine the deposition rate. Travel through the boundary layer will be by diffusion, and the flux in a simple one-dimensional case will be given by

$$J = \frac{-D\partial C}{\partial X}$$
 4.7

where D is the diffusion constant (coefficient) and C is the concentration of reactant. (Diffusion behavior is discussed in detail in Chapter 8.)

The gaseous diffusion constants are very large compared to those encountered in solid-state diffusion and more often are calculated instead of being experimentally measured. As an example of the values to be expected, D for SiCl<sub>4</sub> in H<sub>2</sub> at 1500 K is 5.7 cm<sup>2</sup>/s. Several equations have been developed, although most of

Elm Exhibit 2159, Page 154

them give comparable results (25). Simple theory predicts that D should vary as  $T^{h}/P$  where b = 1.5 and P is the gas pressure. Experiment and more elaborate theory substantiate the 1/P dependency but yield a b value of between 1.75 and 2.

The perpendicular wafer geometry of Fig. 4.6b, desirable because of the relatively inexpensive equipment and the large number of slices that can be processed per run, behaves differently from the other geometries in that no flow occurs between wafers. Not only must reactants move from the main gas stream to the edges of the wafers, but also they must then diffuse down between the wafers and produce a uniform deposit. Even if the system were operating in the kinetically controlled region, a problem of radial reactant depletion over large-diameter wafers would still exist unless the spacing between wafers were inordinately large. However, by operating at reduced pressure, diffusion can be substantially increased, and the geometry becomes practical (27). For an LPCVD system operating at 0.5 torr (atmospheric pressure = 760 torr), the diffusion constant, and consequently the flux of reactant, is approximately 1500 times that at atmospheric pressure. Note, however, that if the surface reaction generates more moles of gaseous products than are consumed, their outward flow will impede the inward diffusion flow of fresh reactant. For example, in the decomposition of tetraethylorthosilicate (TEOS) to produce SiO<sub>3</sub>, 1 mol TEOS diffusing to the wafer surface produces 1 mol solid SiO2 on the surface and liberates 4 mol ethylene and 2 mol water. This effect is sometimes called convection via chemical reaction and can cause severe radial deposition nonuniformity unless a large percentage of the reactor gas is the inert carrier (28).

Like the barrel configuration, the modeling of the tubular LPCVD system is considerably more involved than is the modeling for the simple flat-plate horizontal reactor. The problem is further complicated because, in many cases, tubular LPCVD is used to deposit compound layers such as silicon nitride, and the equations to be solved must then include many more reactions and reactants. Also often included is consideration of the fact that while the final deposition reaction is surface catalyzed, there may be a partial decomposition of reactants in the gas stream (29–31).

Several equipment configurations have been used to minimize the effects of reactant depletion along the direction of flow. One, used in tube deposition chambers where the wafers lie flat,<sup>13</sup> is to have wafers on a stand tilted with respect to the gas flow so that

136

<sup>&</sup>quot;This geometry is largely obsolete.

those downstream project higher into the stream. This geometry increases the velocity of gas since the cross-sectional area is decreased. The increased velocity then decreases the thickness of the boundary layer so that more reactant can reach the wafer surface (20). The temperature of the wafers can be increased along the path of gas flow so that the increased deposition rate due to the higher temperature can at least partially compensate for a decrease due to depletion. Tubular LPCVD systems usually use this approach (32). Another method, applicable to pancake geometries, and used in some plasma processing equipment, provides an inward radial flow so that the depleting reactants are concentrated as they approach an exhaust port in the center of the wafer holder (33).

In the kinetically controlled region, the deposition rate is limited, not by material transport, but by the rate of some chemical reaction. During deposition, a whole series of reactions must occur, including adsorption and desorption of one or more species on the deposition surface. If the limiting reaction is one that occurs in the gas phase (in other words, the production of an intermediary through  $A + B \rightarrow C$ ) the deposition rate r of species C will be of the form

$$r = K[A][B] \tag{4.8}$$

where [A] and [B] denote the concentration of species A and B and K is the experimentally determined rate constant. When a surface adsorption step is limiting, as appears to be the case in most CVD processes, the rate equation is much more complex and will vary depending on the specific adsorption details. In this case, some variation of the Langmuir–Hinshelwood model can often be used for rate predictions.<sup>14</sup>

When a gas is excited by a high enough electric field—for example, in the reaction chamber of a plasma deposition reactor such as the one shown in Fig. 4.9—a glow discharge (plasma) is formed. In the plasma, high-energy electrons exist that can impart enough energy to reaction gases for reactions that normally take place only at high temperature to proceed near room temperature. The reactor looks superficially like the sputtering equipment shown in Fig. 4.1, but there are some substantial differences. In plasma-enhanced CVD (PECVD), the inlet gas contains the reactants for deposition, and the anode instead of being sputtered away remains unaffected. The voltage applied to sputtering electrodes may be either RF or DC,

4.5.2 Plasma-Enhanced CVD

<sup>&</sup>lt;sup>14</sup>For more details on reaction rates predicted for the more complex reactions, refer to a standard chemistry text or to Don W. Shaw, Chapter 1, in C.H.L. Goodman, ed., Crystal Growth-Theory and Techniques, Plenum Press, London, 1971.



depending on the specific mode of operation, but plasma deposition requires RF voltage.

The glowing (plasma) region will contain, in addition to the free electrons, normal neutral gas molecules, gas molecules that have become ionized, ionized fragments of broken-up gas molecules, and free radicals.<sup>15</sup> Deposition occurs when the molecules of incoming gases are broken up in the plasma and then the appropriate ions are recombined at the surface to give the desired film. Thus, molecule *AB* could be broken into ions *A* and *B*, and molecule *CD* into ions *C* and *D*. *A* and *D* could then react at the wafer surface to give film *AD*. The actual reactions that occur are thought to be considerably more complicated than those encountered in ordinary CVD, and modeling of the deposition process is not very extensive (35).

The fraction of the gas molecules ionized will generally be from 0.1% to 1%; the range of electron energies, from 1 eV to 20 eV; and the density of ions, in the  $10^9-10^{12}/\text{cm}^3$  range (34). The gas plasma is separated from the electrodes and the confining chamber by a dark region (ion sheath). All electrons, ions, neutral molecules, neutral

<sup>&</sup>lt;sup>15</sup>Free radicals, usually designated with an asterisk after their chemical symbol (e.g., F<sup>\*</sup>), are neutral species whose electrons have absorbed extra energy and hence are quite active chemically. Since the electrons rather quickly lose their energy, the life of a radical is short.

fragments, and free radicals may leave the plasma, either by diffusion or because of additional velocity imparted to them either by impact with other particles or by the applied RF field. Since the gas is excited by an RF field, the electrons with their light mass will reach a much higher velocity during a cycle than will the ions. Electrons will thus have very high effective temperatures, while the ions have effective temperatures not far above room temperature. The plasma can be used as an adjunct or alternative to high temperatures for providing the energy necessary for a reaction to proceed. Therefore, PECVD ordinarily will have lower operating temperatures than will APCVD or LPCVD and will be used when lower temperatures are needed.

The low temperatures of the gas and the substrate reduce the mobility of the reactants impinging on the wafer deposition surface. Thus, plasma-deposited layers are usually amorphous and often contain higher percentages of extraneous reaction components than would be expected of high-temperature CVD depositions. A negative voltage of a few volts exists between the plasma and the wafer, and if any positive ions are present in the plasma, they will have gained substantial energy by the time they strike the wafer. This energy may help in producing a more defect-free film, but it can also cause damage to the substrate. In addition, even though the wafer is negative with respect to the plasma, some of the high-energy electrons in the plasma escape and strike the wafer, causing radiation damage. Usually, some of the subsequent high-temperature processing steps will anneal out the damage. However, by using an alternative reactor geometry in which the plasma is located away from the wafers, plasma-produced neutral molecule fragments and free radicals can still reach the wafers, while the electrons and ions are prevented from reaching them (36). Such reactors are often referred to as "afterglow" and usually use a microwave cavity to produce the plasma. Because of the limited size of the cavity, the plasma products must be dispersed over a wide area and large volume in order to cover the area of typical production wafers (37).

4.5.3 Photon-Enhanced Energy for the or CVD absorption of ph ways in which th of energy by the

Energy for the deposition reaction can sometimes be obtained by absorption of photons rather than from high temperatures. The two ways in which this absorption can occur are by the direct absorption of energy by the reactants and by a "photosensitizer" that absorbs the light energy and then transfers energy to the reactants by collision. In the case of SiH<sub>4</sub> and NH<sub>3</sub> reacting to form Si<sub>3</sub>N<sub>4</sub>, both silane and ammonia will directly absorb photon energy when the wavelength is less than about 2200 Å. If mercury vapor is also present, it will very strongly absorb at 2537 Å and transfer enough energy to the reactants for the reaction to proceed. The rate varies linearly with UV intensity, and it appears that the mercury is adsorbed on the surface where it absorbs energy from the photons and then transfers it to adsorbed reactants. In other cases, it is likely that mercury vapor absorbs energy and then transfers it to some gaseous species to produce a precursor that then further reacts on the surface. Processes for SiO<sub>2</sub>, Si<sub>3</sub>N<sub>4</sub>, and amorphous silicon have been reported (38–41).

The impetus for much of the early work on CVD silicon dioxide seems to have been the desire to produce an oxide that could be used as a capacitor dielectric. However, even before 1946, the burning of methyl silicate was used to produce an in situ SiO<sub>2</sub> optical coating on glass (42). In 1961, SiO<sub>2</sub> produced by the pyrolytic decomposition of tetraethoxysilane [Si(OC<sub>2</sub>H<sub>3</sub>)<sub>4</sub>], ethyltriethoxysilane [C<sub>2</sub>H<sub>5</sub>Si(OC<sub>2</sub>H<sub>5</sub>)<sub>3</sub>), and other alkoxysilanes was proposed as a diffusion mask for germanium<sup>16</sup> (43). Since then, several other reactions have been studied, including the most commonly used reactions such as the pyrolytic decomposition of tetraethoxysilane (TEOS, tetraethylorthosilicate, ethyl silicate) and the SiH<sub>4</sub>-oxygen reaction.

For TEOS deposition, a carrier gas such as nitrogen is bubbled through the TEOS to provide a vapor, which is then fed into a reactor containing the wafers. The wafers as well as the gas must be kept at an elevated temperature. If the vapor is heated to the cracking temperature and the products are allowed to condense on a cool substrate, hard polymers are formed that have many of the characteristics of silica. However, only when the wafers are also heated is pure SiO, deposited (44). The reactor may be tubular with the slices placed normal to the tube axis (45) as shown in Fig. 4.6b, or it may have a pancake configuration with the wafers lying flat on the heater. Temperatures for deposition can range from about 400°C to at least 1100°C. However, at low temperatures, the deposition rate is low, and the deposit is soft and porous. At the higher temperature ranges, a dark film is sometimes obtained, probably because of elemental carbon. Temperatures near 800°C provide a reasonable deposition rate and good-quality films. In this, and indeed in any of the other deposition processes as well, if the wafer surface is not scrupulously clean, excessive pinholes will occur in the film. Thus, having a good cleaning procedure is very important. Further, since deposition on all of the hot surfaces will occur, if the reactor is not periodically

### 4.6

DEPOSITION OF SPECIFIC MATERIALS 4.6.1 CVD Silicon Dioxide

<sup>&</sup>quot;Germanium, like gallium arsenide, lacks a native oxide suitable for surface passivation, diffusion masking, and so on.

cleaned, particulates will flake off and get on the wafers while they are in the reactor. To minimize this effect, wafers can be placed vertically or even held upside down in the reactor.<sup>17</sup>

The reaction of compounds such as silicon tetrachloride (SiCl<sub>4</sub>) or silicon tetrabromide (SiBr<sub>4</sub>) with oxygen will also, in principle, produce silica. However, in order to maintain better control of the film properties, most processes using the halides do not use pure oxygen but rather use some compound such as H<sub>2</sub>O, CO<sub>2</sub>, or NO<sub>2</sub> to supply the oxygen (46, 47). By the use of an oxygen source such as CO<sub>2</sub>, which is in itself nonoxidizing, a silica film can be grown directly after an epitaxial deposition without an extensive purging of the epitaxial reaction by-products. With an oxidizer and no purging, various products other than pure SiO<sub>2</sub> are formed. The overall CO<sub>2</sub> reaction can be written as

$$SiCl_4 + 2CO_7 + 2H_7 \rightarrow SiO_7 + 2CO + 4HCl$$
 4.9

However, the actual reaction must proceed with some intermediate steps since without excess hydrogen, no silica is formed (46). The use of SiBr<sub>4</sub> instead of SiCl<sub>4</sub> allows the use of a lower deposition temperature and was of interest for germanium before the introduction of silane-produced silica as described next.

The use of silane, which allows  $SiO_2$  to be deposited at a lower temperature, was first reported in 1967 (48). With  $O_2$  used, the reaction often assumed is

$$SiH_4 + 2O_2 \rightarrow SiO_2 + 2H_2O \qquad 4.10$$

However, when the temperature is below about 500°C and the silane concentration is low, no water has been detected (49), thus leading to the reaction of Eq. 4.11 as more probable in that temperature range:

$$SiH_4 + O_2 \rightarrow SiO_2 + 2H_2$$
 4.11

Above about 600°C, water vapor has been observed (50).

As in the SiCl<sub>4</sub> reaction, oxygen can be supplied by other oxygen sources, such as  $CO_2$  (51). The actual reaction that occurs is not clear since, while the production of water in an intermediate step is anticipated, the experimental data indicate otherwise (52). An advantage of using  $CO_2$  instead of oxygen is that, as before, if a deposited oxide is to be added immediately after epitaxy, there is no

<sup>&</sup>quot;An "upside down" reactor heated by infrared heat lamps was used for silane oxide deposition as early as 1967.

oxygen to react with residual gases from the epitaxial deposition and cause particulates (51). By adding HCl—that is,

$$SiH_4 + CO_2 + HCI \rightarrow SiO_2 + \cdots$$
 4.12

a deposited oxide on silicon is produced that has interface properties close to those of thermal oxide (53).

Silicon dioxide films were plasma deposited as early as 1965 (54), but it was not until 1987 that machines specifically designed for depositing plasma oxide were marketed. Plasma films generally have a lower intrinsic stress than CVD films do and thus are less likely to crack when they are deposited in thick layers. The stress of plasma silica can be changed from compressive to tensile by changing the operating conditions, as is shown in Fig. 4.10 for one machine configuration (55). Step coverage will generally be better than is obtained with atmospheric pressure CVD (56). As is common with PECVD materials, stoichiometry can deviate somewhat from SiO<sub>2</sub>, and a substantial amount of hydrogen may be incorporated in it (57). The usual deposition temperature range is from 200°C to 400°C, and for material deposited over this range, the etch rate decreases as the temperature increases.

Reactions that can be used include the following (58):

$$\mathrm{SiH}_4 + \mathrm{O}_2 \rightarrow \mathrm{SiO}_2 + 2\mathrm{H}_2 \tag{4.13}$$

$$SiH_4 + 2N_2O \rightarrow SiO_2 + 2H_2 + 2N_2 \qquad 4.14$$

$$SiH_4 + CO_2 \rightarrow SiO_2 + CH_4$$
 4.15

$$Si(OC_2H_5)_4 \rightarrow SiO_2 + \cdots$$
 4.16

 $\begin{array}{c} 2 \\ 1.5 \\ 1.5 \\ 1.6 \\ 0.5 \\ 0.5 \\ 0.5 \\ -0.5 \\ -1.5 \\ -2 \\ 200 \end{array}$ 

#### 4.6.2 Plasma Silicon Dioxide

FIGURE 4.10

Film stress versus deposition pressure for PECVD silicon oxide deposition. (Source: Adapted from E.P. van de Ven et al., Proc. 4th IEEE V-MIC Conference, June 1987.) When pure oxygen is used (reaction in Eq. 4.13), excessive particulates and pinholes as well as general nonreproducibility are observed (54, 59). The reaction in Eq. 4.9 is the one generally used and, because of the nitrogen, may leave a small percentage of nitrogen in the film (57).

Doped oxides are used for diffusion sources (see Chapter 8), ion migration barriers (see Chapter 3), isolation between conductor layers (see Chapter 10), and as a planarizer to fill in, for example, the trenches of trench capacitors. The doping for diffusion sources will generally be either antimony, arsenic, phosphorus, or boron. For ion migration protection, a small percentage of phosphorus is required. Isolation between conductive layers (interlevel oxide) usually will have phosphorus to act as an ion migration barrier, but the prime reason for doping is to decrease the reflow temperature.<sup>18</sup> As will be discussed in Chapter 10, a major difficulty in interlevel connections is the running of leads over abrupt steps such as those that occur in contact holes in the interlevel oxide. If the softening point of the interlevel oxide can be reduced enough, then it can be reflowed after the windows are cut in order to round off the abrupt edges. Similarly, after an oxide is deposited over a series of leads, the abrupt shoulders can be smoothed by reflowing.

Doped oxide films can be made by introducing dopant compounds such as phosphine (PH<sub>3</sub>) or diborane (B<sub>3</sub>H<sub>4</sub>) along with SiH<sub>4</sub> and an oxygen source. The dopants then react to give an oxide (P<sub>2</sub>O<sub>3</sub> or B<sub>2</sub>O<sub>3</sub> in this case) that will co-deposit with the SiO<sub>2</sub> to form the appropriate mixed silicate. An extensive list of dopants that have been used in conjunction with TEOS depositions is summarized in Table 4.2 (60). PH, and B2H6 are commonly used when silane is the source of silicon. The reactions of dopant and the silica source are not independent in that the introduction of a dopant will often substantially change the oxide deposition rate. The deposition rate of silica from undoped TEOS is negligible at 600°C, but the addition of phosphine (PH<sub>4</sub>) leads to a usable rate (60). Further, the introduction of a second dopant may materially affect both of their deposition rates. For example, when PH<sub>3</sub> or B<sub>2</sub>H<sub>6</sub> is used singly in conjunction with SiH, in a plasma reactor, the concentration of boron or phosphorus in the silica increases with an increase in the B<sub>2</sub>H<sub>2</sub>/SiH<sub>4</sub> or

4.6.3 Doped Silicon Oxide

<sup>&</sup>lt;sup>18</sup>The term *reflow temperature* is somewhat subjective but is the temperature at which the viscosity of the glass is low enough for surface tension to appreciably smooth sharp corners and edges in the original glass surface. This expression does not correspond to any of the terms *annealing point*, *softening point*, or *deformation point*, which are widely used in the glass industry and which are defined in terms of specific viscosities.

| TABL | E 4.2 |
|------|-------|
|------|-------|

| Materials Used with TEOS to |
|-----------------------------|
| Produce Doped CVD Oxide     |

| Dopant                    | Dopant Source*                      | Deposition<br>Pressure | Deposition<br>Temperature (°C) | Film Application                    |
|---------------------------|-------------------------------------|------------------------|--------------------------------|-------------------------------------|
| B                         | Process                             | AP                     | 650-730                        | Diffusion source                    |
| В                         | $B(OCH_3)_3$<br>B(OCH_3)_3 (+O_2)   | LP                     | 680, 750                       | Trench filling, diffusion<br>source |
|                           | B(OC <sub>3</sub> H <sub>7</sub> ), | LP                     | 750                            | Diffusion source                    |
|                           | $B(OC_3H_7)_3(+O_3)$                | AP                     | 475, 800                       | Diffusion source                    |
| P                         | $PH_3 + O_2$                        | LP                     | 650                            | Diffusion source                    |
|                           | $POCl_3 + O_7$                      | LP                     | 725                            | Flow glass                          |
|                           | PO(OCH <sub>1</sub> ) <sub>1</sub>  | AP                     | 740, 800                       | Diffusion source                    |
|                           | PO(OCH <sub>3</sub> ) <sub>3</sub>  | LP                     | 650-800                        | Flow glass                          |
|                           | P(OCH <sub>1</sub> ) <sub>3</sub>   | LP                     | 750                            | Diffusion source                    |
|                           | $P(OC,H_s)$                         | LP                     | 750                            | Diffusion source                    |
| $\mathbf{B} + \mathbf{P}$ | $B(OCH_3)_3 + PH_3/O_2$             | LP                     | 620-700                        | Trench filling, flow glass          |
| ~                         | $B(OCH_1)_1 + PO(OCH_1)_1$          | LP                     | 680                            | Flow glass                          |
|                           | $B(OCH_1)_3 + P(OCH_1)_3$           | LP                     | 650-725                        | Flow glass                          |
| As                        | $AsCl_1 + O_2$                      | AP                     | 500-700                        | Diffusion source                    |
|                           | $As(OC_2H_3)_3(+O_2)$               | LP                     | 700-730                        | Trench doping                       |
|                           | $AsO(OC_2H_3)_1 + O_2$              | LP                     | 700-730                        | Trench doping                       |
| Sb                        | $SbCl_{2} + O_{2}$                  | AP                     | 500                            | Diffusion source                    |
|                           | $Sb(C_2H_5)_1 + O_2$                | AP                     | 250-500                        | Dielectric                          |
| Sn                        | $Sn(C_2H_3)_4$                      | AP                     | 700                            | Diffusion source for GaA            |
| 1. m. 1                   | Sn(CH <sub>3</sub> ) <sub>4</sub>   | AP                     | 500                            | Diffusion source for GaA            |
| Zn                        | $Zn(C_2H_3)_2$                      | AP                     | 700                            | Diffusion source for GaA            |

\*See the source of this table for references to the individual processes.

Source: F.S. Becker and S. Röhl, J. Electrochem. Soc. 134, p. 2923, 1987. Reprinted with the permission of the publisher, The Electrochemical Society, Inc.

 $PH_3/SiH_4$  ratio. However, when they are used together, an interaction occurs, and an increase in the diborane ratio causes a reduction in the amount of phosphorus deposited (61). Also, the rate-temperature dependence will generally be different for the various dopants and the oxide so that as temperature is changed, for fixed inputs, the oxide composition will change.

Typically, with an SiO<sub>2</sub> deposition having a small percentage of boron and a small percentage of phosphorus (BPSG for borophososilicate glass), satisfactory reflow can be achieved at temperatures of between 800°C and 900°C. Fig. 4.11 shows a cross section of reflowed BPSG glass. In this case, a 5500 Å film of BPSG (4% B, 4% P) was deposited over 5000 Å thick polysilicon leads, reflowed at 800°C, and densified at 900°C in wet nitrogen. There is a limit to how much the reflow temperature can be reduced since phosphorus contents above 7%–8% cause corrosion problems with aluminum leads and boron contents above about 4% give a glass unstable in high humidity (62). In some cases, the as-deposited film may be unstable and require additional high-temperature heat treatment to stabilize the composition. For example, in using PH<sub>3</sub> +  $B_2H_6$  in conjunction with SiH<sub>4</sub> to form a glass with over about 3% boron and phosphorus, the boron may precipitate as boric oxide (63). For some compositions, annealing in a particular temperature range may cause a phase separation. Alternative compositions using arsenic or germanium instead of boron have been examined (62), but the ease of preparing BPSG makes it the current frontrunner for interlevel applications.

Amorphous CVD silicon nitride films are deposited in the 800°C-900°C temperature range and are generally referred to as "high-temperature nitride." Such nitride can be quite close in composition to Si<sub>3</sub>N<sub>4</sub>, although silicon-rich films can be produced. The other "nitride" is made by plasma deposition at temperatures in the 200°C-300°C range and is referred to as "plasma nitride." Its composition is not stoichiometric, but its properties are close enough to those of Si<sub>3</sub>N<sub>4</sub> to be very useful.

It was established very early on that, unlike SiO<sub>2</sub>, little sodium transport occurred in silicon nitride films. Thus, if they could be used instead of or in conjunction with SiO<sub>2</sub>, considerable silicon device protection against sodium migration could be achieved. However, because of the Si-Si<sub>3</sub>N<sub>4</sub> interface properties, it was found that where  $Q_f$  or  $N_a$  were important, the nitride could seldom be used in direct contact with silicon. Silicon nitride is relatively impermeable to oxygen; however, it will oxidize, although considerably slower than will silicon. A comparison is shown in Fig. 4.12, from which it can be seen that the differential is enough that a nitride covering of modest thickness can be used to mask areas of silicon that are to remain oxide free. Unlike SiO2, since silicon nitride provides an efficient barrier against gallium diffusion, it can be used as a diffusion mask when gallium diffusions are made. Of more significance is the fact that it can be used to cap GaAs and prevent gallium from escaping from the GaAs arsenide surface at high temperatures.

The common reaction used for preparing high-temperature nitride is (64, 65)

$$3SiH_4 + 4NH_3 \rightarrow Si_3N_4 + 12H, \qquad 4.17$$

4.6.4 CVD Silicon Nitride

#### FIGURE 4.11

Example of how reflowing smooths out sharp corners left after a CVD layer is applied over leads. (*Source:* Photograph courtesy of Novellus Systems, Inc., San Jose, California.)



At 700°C, the deposition rate is only a few Å per minute, but by 900°C, the rate can, with appropriate flow rates, be a few thousand Å per minute (66). Temperatures above 1000°C–1100°C are not used because the higher temperatures favor localized crystallite formation in the amorphous film. In order to ensure that good stoichiometry is achieved, it is customary to use a substantial excess of ammonia—for example, a silane-to-ammonia ratio of 1:15 to 1:20. Hydrogen is commonly used as a carrier gas, but nitrogen or an inert gas such as helium has also been used (67). Various properties of silicon nitride are listed in Table 4.3. Also included for comparison are properties of nitride prepared by the PECVD process discussed in the next section.

Other silicon-bearing species such as  $SiCl_4$  (68, 69) and  $SiH_2Cl_2$  (27) can also be combined with ammonia to give silicon nitride. The overall reaction for  $SiCl_4$  is

$$3SiCl_4 + 4NH_3 \rightarrow Si_3N_4 + 12HCl$$
 4.18

As is true in most of the surface reactions involved in CVD processing, the intermediate steps are not totally clear, but  $SiCl_4$  and NH<sub>3</sub> begin reacting at room temperature or below, perhaps first by

$$SiCl_4 + 8NH_3 \rightarrow Si(NH_2)_4 + 4NH_4Cl$$
 4.19a

or by

$$SiCl_4 + 6NH_3 \rightarrow SiN_2H_2 + 4NH_4Cl$$
 4.19b



FIGURE 4.12

Oxidation rate of silicon and high-temperature silicon nitride in a wet oxygen atmosphere. (Source (a): Data from Bruce Deal, J. Electrochem. Soc. 125, p. 576, 1978. Source (b): Data from I. Fränz and W. Langheinrich, Solid-State Electronics 14, p. 499, 1971.)

| TA | BL | E 4 | .3 |
|----|----|-----|----|
| 10 |    |     |    |

Properties of Silicon Nitride

| Property                                         | High-Temperature<br>CVD | PECVD*     | Reference |
|--------------------------------------------------|-------------------------|------------|-----------|
| Density (grams/cm <sup>3</sup> )                 | 2.8-3.1                 | 2.4-2.8    | (1, 2)    |
| Hardness (Knoop)                                 | 1000-3500               |            | (3)       |
| (Moh)+                                           | 9-                      |            | (3)       |
| Stress (10 <sup>9</sup> dynes/cm <sup>2</sup> )‡ | 12-18                   | -2 to $+5$ | (1, 2)    |
| Expansion coefficient (10 %C)                    | 4                       | 4-7        | (1)       |
| Refractive index                                 | 2.0\$                   | 2.0-2.2    | (3, 2)    |

\*PECVD properties can change substantially with deposition conditions. The values given here are typical ranges. †Decreases to near 7 as deposition temperature drops from 900°C to 800°C.

 $\ddagger + = tension; - = compression.$ 

\$As film becomes silicon rich, the index will increase, reaching the value for silicon in limit of no nitrogen.

References: (1) J.R. Hollahan and R.S. Rosler, p. 350, in John L. Vossen and Werner Kern, eds., Thin Film Processes, Academic Press, New York, 1978. (2) A.C. Adams, Solid State Technology, p. 135, April 1983. (3) Kenneth E. Bean et al., J. Electrochem, Soc. 114, p. 733, 1967.

Then, as the temperature increases, these reaction products break down and rearrange to eventually give  $Si_3N_4$  (68). Film deposition over the 550°C–1200°C temperature range has been reported, but temperatures of around 900°C are typical. The substantial amount of HCl produced by the reaction in Eq. 4.18 has been reported to help clean the deposition equipment and the wafer surface and to minimize ion contamination (70).

While all of the CVD nitride processes just described were originally done at atmospheric pressure, commercial low-pressure deposition equipment (LPCVD) using SiH<sub>2</sub>Cl<sub>2</sub>-NH<sub>3</sub> and SiH<sub>4</sub>-NH<sub>3</sub> feed gases can also be used (27). By using ammonia and either SiCl<sub>4</sub> or SiF<sub>4</sub> in an LPCVD system operating at temperatures of from 1400°C to 1500°C, deposits of substantially pure  $\alpha$ -Si<sub>3</sub>N<sub>4</sub> can be obtained (71). Such deposits are, however, not applicable to Si or GaAs IC manufacturing because of the high temperatures involved (and perhaps because of the properties of crystalline silicon nitride, which have not been extensively studied).

4.6.5 Plasma Silicon Nitride

Silicon nitride deposition from a plasma was first reported in 1965 and used silane and ammonia in an RF-induced plasma (54). In addition, silane-nitrogen (72),  $N_2$ -SiBr<sub>4</sub> in a DC discharge plasma (54), and  $N_2$ -Sil<sub>4</sub> in a microwave afterglow (73) have also been reported. The common reactants now used are silane and ammonia, with argon or helium as a carrier/diluent. The first deposition equipment (reactor) was either for single-slice processing or of tubular design that held only a few slices and thus was not very amenable to volume production. However, a parallel-plate RF plasma deposition machine described in 1974 allowed production quantities to be economically processed and opened the way for the widespread application of plasma deposition processing<sup>19</sup> in the semiconductor industry (33).

Silicon nitride film production using the parallel-plate concept was apparently the earliest serious application of PECVD to the semiconductor industry (33, 74, 75). The main advantage is that deposition is at a much lower temperature than is straight CVD so that it can be used to cover, for example, metallized wafers. As in the case of the plasma oxide, however, many of the properties are considerably different from those of high-temperature CVD. Some of them were listed in Table 4.3, where they can be compared with the high-temperature values. Also, as in PECVD oxide, substantial hydrogen will be incorporated in the film when silane and/or ammonia are used.

Applications of polysilicon to IC construction include its use for leads and MOS gate electrodes (see Chapter 10 for more details), as a fuse material in making ROMs (read-only memories), as load resistors in some MOS circuits, and as a diffusion source for emitter or sources/drains. In addition, active devices will occasionally be made in thin-film polysilicon (76, 77). All of these applications require layers that are generally less than a micron thick so that relatively slow deposition rates (generally associated with lower temperatures) are quite acceptable.

Polycrystalline silicon (polysilicon) occurs on occasion when the growing of single-crystal epitaxial silicon is attempted, and it will generally have rather large grain size. These occurrences are sporadic and undesirable. When polysilicon films are to be used in their own right, a uniform and generally small grain size is required, which usually implies a considerably lower deposition temperature than that used for epitaxy. Further, the applications themselves normally require relatively low-temperature depositions in order to minimize diffusion and/or wafer damage. In principle, any of the silicon epitaxy reactors discussed in Chapter 7 can be used, along with any of the reactions listed. Typically, however, for such thin applica-

# 4.6.6 APCVD and LPCVD Polysilicon

148

<sup>19</sup> Tubular plasma etch equipment for photoresist removal was already widely used.

tions, low-pressure tube depositions with silane as the silicon source are now used. An LPCVD reactor provides for a very economical deposition (78), and silane decomposition proceeds at temperatures as low as 400°C. When much thicker polycrystalline layers are required—for example, in dielectrically isolated wafers (79)—speed of deposition, as well as an inexpensive feedstock is desirable, and the process used is much like that of high-temperature epitaxy. Typically, the hydrogen reduction of SiCl<sub>4</sub> or SiHCl<sub>3</sub> at temperatures as high as 1200°C in pancake-type reactors is used.

The deposition rate for LPCVD polysilicon depends on temperature, silane partial pressure, and whether or not a doping gas such as phosphine, arsine, or diborane is present. Fig. 4.13 shows growth rates r versus deposition temperature that have been reported. These rates can be characterized by

$$r = a e^{-E/RT}$$
 4.20

where E is typically about 35 kcal/mol and which, as discussed earlier, is indicative of operation in the kinetically controlled region. Below about 580°C, the deposits are amorphous; above this temperature, the texture, grain size, and orientation depend on deposition conditions, with the grains generally increasing in size with deposition temperature (80–82). Fig. 4.14 shows the range over which var-



FIGURE 4.13

LPCVD polysilicon growth rate versus 1/T. (Source (a): Data from Richard S. Rosler, Solid State Technology 63, pp. 63-70, April 1977. Source (b): Data from G. Harbeke et al., J. Electrochem. Soc. 131, p. 675, 1984. Source (c): Data from P. Joubert et al., ibid., 134, p. 2541, 1987.)

#### FIGURE 4.14

Morphology versus temperature of silicon deposited on SiO<sub>2</sub> from silane. (*Source:* P. Joubert et al., *J. Electrochem. Soc. 134*, p. 2541, 1987. Reprinted by permission of the publisher, The Electrochemical Society, Inc.)



ious crystallographic features have been reported to exist versus deposition temperature and silane partial pressure. By using molecular beam epitaxial techniques (evaporation of silicon in a very hard vacuum), polycrystalline silicon depositions down to 400°C have been observed (83). The CVD amorphous phase is unstable at higher temperatures and, if deposited, is generally converted to polysilicon during subsequent processing. One hour in nitrogen at 800°C significantly reduces the amount of amorphous material (80), while 30 minutes at 900°C is reported to give very stable, fine-grained material (81).

A problem with depositing polycrystalline silicon on amorphous substrates such as  $SiO_2$  that was recognized early on is that of obtaining uniform nucleation and coverage (84). The details of the beginning of nucleation have since been studied for silicon on both  $SiO_2$  and silicon nitride (85). This problem is quite pronounced with high-temperature  $SiCl_4$  depositions but is hardly noticeable with low-temperature silane depositions. In fact, when high-temperature SiCl<sub>4</sub> depositions are desired for their higher rates, beginning with a short silane pyrolysis is sometimes helpful (86).

The addition of the doping gases phosphine or arsine to silane reduces the deposition rate, usually to impractical levels, as well as produces greater variation in layer thickness, Thus, n-type doping is done after deposition, either by diffusion or by ion implantation (see Chapters 8 and 9 for details of these processes) (87–89). The rate decrease appears to be caused by the dopant atoms that compete for sites where the silane reduction would normally take place. Diborane causes an increase in deposition rate (87) but not enough to cause any process problems, so p-doping can be done directly in LPCVD reactors.

Besides the problem of introducing the doping impurity into the polysilicon, the problem of the behavior of the dopant and carriers at the grain boundaries must also be addressed. The resistivity of a poly layer is never as low as that of a single-crystal layer of the same thickness even when the total impurity concentration is the same (90–92). One possible reason is that doping impurities that segregate at grain boundaries are electrically inactive. Another reason is that grain boundaries trap carriers, which cause potential barriers at the boundaries and, in turn, reduces the carrier mobility in polycrystal-line material. The contribution of each of these effects is still in question, but, in general, the larger the grains, which implies fewer boundaries, the lower the resistivity.

Polysilicon has been plasma deposited using both dichlorosilane  $(SiH_2Cl_2)$  (93) and silane (94) as feed material. The amorphous-poly temperature regions appear comparable to those of LPCVD poly. The stress in PECVD films, both poly and low-temperature amorphous, is usually compressive (93, 95). In principle, the rate should be less sensitive to temperature, and doping should be easier. However, despite the wide use of plasmas to produce hydrogenated amorphous silicon films. LPCVD continues to be the prime technique for IC poly applications.

4.7

OTHER CVD FILM MATERIALS The materials already described include those most commonly used in IC fabrication, but by no means include all materials that can be deposited chemically from the vapor. Table 4.4 lists other less commonly deposited materials that have been reported. Some materials, such as tantalum pentoxide, have been examined for their dielectric properties. Others, such as Ge<sub>3</sub>N<sub>4</sub>, have been of interest as possible passivating layers for germanium or gallium arsenide pn junctions.

4.6,7 Plasma Polysilicon

Elm Exhibit 2159, Page 170

| Material           | Reactants                                       | Method     | Reference |
|--------------------|-------------------------------------------------|------------|-----------|
| Aluminum oxide     | $Al_3(CH_3)_6^* + O_7$                          | CVD        | (1)       |
|                    | Al(OC <sub>1</sub> H <sub>2</sub> ),† pyrolysis | CVD        | (2)       |
|                    | See reference 2 for other choices.              |            |           |
| Tantalum pentoxide | $TDDAA\ddagger + O_2$                           | CVD        | (3)       |
| the second second  | $TaCl_{s} + H_{s} + CO_{s}$                     | CVD        | (4)       |
| Aluminum nitride   | AICI, - NH, pyrolysis                           | CVD        | (5)       |
| Boron nitride      | $B_{1}H_{n} + NH_{1}$                           | CVD        | (6)       |
|                    | $B_1H_6 + NH_1$                                 | Plasma CVD | (7)       |
| Germanium nitride  | $GeCl_4 + NH_3$                                 | CVD        | (8, 9)    |
|                    | $GeH_4 + NH_1$                                  | Plasma CVD | (10)      |
| Silicon carbide    | $SiCl_4 + C_3H_8$                               | CVD        | (11, 12)  |
| Trimethyl aluminum |                                                 |            |           |

#### TABLE 4.4

Less Commonly

"Trimethyl aluminum.

\*Aluminum isopropoxide.

\$Tantalum dichloro-diethyoxy-acetylacetonate, TaCl2 (OC2H,)2C3H7O2.

References: (1) M.T. Duffy and Werner Kern, RCA Review 31, p. 754, 1970. (2) Werner Kern and Vladimir S. Ban. Chap. 111-2, in John L. Vossen and Werner Kern, eds., Thin Film Processes, Academic Press, New York, 1978.
(3) E. Kaplan et al., J. Electrochem. Soc. 123, p. 1570, 1976. (4) W.H. Knausenberger and R.N. Tauber, J. Electrochem. Soc. 120, p. 927, 1973, (5) T.L. Chu and R.W. Kelm, Jr., J. Electrochem. Soc. 122, p. 995, 1975. (6) M.J. Rand and J.F. Roberts, J. Electrochem. Soc. 115, p. 423, 1968. (7) S.B. Hyder and T.O. Yep, J. Electrochem. Soc. 123, p. 1721, 1976. (8) H. Nagai and T. Niimi, J. Electrochem. Soc. 115, p. 671, 1968. (9) Takehisa Yashiro, J. Electrochem. Soc. 119, p. 780, 1972. (10) D.B. Alford and L.G. Meiners, J. Electrochem. Soc. 134, p. 979, 1987. (11) K.E. Bean and P.S. Gleim, J. Electrochem. Soc. 114, p. 1158, 1967. (12) K. Furukawa et al., p. 231, in Ext. Abst. of Conference on Solid State Devices and Materials, Tokyo, 1987.

4.8

SPIN-ON GLASSES

Spin-on glasses, often called organic silicates, can be an alkoxysilane, such as tetraethoxysilane, in a solvent, an acyloxysilane in a solvent, or a silicon polymer such as a polysiloxane in a solvent. In each case, the liquid may be applied as a thin film to the wafer in the same manner as photoresist (see Chapter 5) and then heated to convert it to a silica film. When tetraethoxysilane (TEOS) based solutions are used, a catalytic agent (initiator) along with heat is ordinarily used to convert the TEOS to silica. When a doped film is made, the doping agent itself—for example, POCl<sub>3</sub>—may suffice as the initiator (96). By reacting TEOS with acetic anhydride (acetyl oxide) in a suitable solvent, an acyloxysilane results. These materials can be decomposed into silica in the 200°C temperature range without the use of an initiator (97). The use of polysiloxanes increases the shelf life of a spin-on over that provided by acyloxysilanes and provides for thicker films (97). However, bake temperatures of from 600°C to 900°C are required (98).

These materials were originally used as diffusion dopant sources (see Chapter 8) and thus had to also have a doping impurity added to them. They have also been used to mask diffusions and for leads protection. An acyloxysilane rather than a silicon polymer is more appropriate as a leads protector since heating to about 200°C will drive off the solvent and most water. (Complete densification does not occur, however, until it is heated to about 800°C.) More recently, spin-ons have been examined for use as an interlevel insulator. In any of these applications, they must remain crack free over the operating temperature range and not produce enough stress in the wafer to introduce defects. The primary advantage of spin-on glasses is the simplicity of applying a layer to a wafer, but when they are used as an interlevel insulator, they have the additional advantage of topography planarization.

While most of the spin-on glasses are silicon based, formulations exist that use other materials. One, for example, decomposes into  $TiO_2$  instead of  $SiO_2$  upon heating (99) and is useful when a dielectric constant higher than that of  $SiO_2$  is desired. For boron doping applications, a boron-nitrogen based polymer that reacts with moisture and oxygen to give  $B_2O_3$  has been developed (100).

By using a glass frit applied by squeegee or electrophoretic deposition and then fired, a wider glass composition range than can be obtained from any of the other methods described is available. Frit also allows thick layers to be applied easily and holes and grooves in the wafer to be filled. However, the firing temperature must be high enough for the glass to soften and flow and hence severely restricts its use. The most common application, and one not related to ICs, is the covering of exposed junctions of high-voltage discrete devices.20 Thus, the composition is chosen not only on the basis of expansion coefficient and softening point but also on passivating characteristics. Typically, a zinc-borosilicate glass is used. The general procedure is to formulate the glass, crush it, ball-mill it to a 5-10 µm powder, and then either mix it with a binder and squeegee it over the surface or disperse it along with suitable ions in a dielectric liquid such as isopropyl alcohol and electrophoretically deposit it (101-103).

4.9 GLASS FRITTING

<sup>&</sup>lt;sup>20</sup>Because of low breakdown voltages associated with the curvature of planar pn junctions, high-voltage junction devices are usually mesa structures and would have exposed junctions were it not for a coating of silicone or deposited glass.

SAFETY 4

Many of the reactants used in chemical vapor deposition, as well as some of the reaction products, are very poisonous and/or pyrophoric and/or explosive. Thus, care must be taken to ensure that leaks do not exist in the piping or deposition chamber and that the CVD reaction products are safely burned or vented. When new processes are being developed, all reactants should be carefully studied with respect to health hazards. In particular, metallorganics should be viewed with great caution. CVD equipment should, in general, be housed in rooms with high-capacity emergency exhaust facilities (scram rooms), and gas cabinet enclosures should be continuously exhausted.

Silane and hydrogen are both very explosive. If a small leak occurs in a silane line, the issuing silane will generally spontaneously ignite and burn, leaving a white silica powder telltale mark. However, if there is not enough air for the silane to burn as, for example, in a small enclosure, probably nothing will happen until the enclosure door is opened, at which time a violent explosion will occur. At least one death has occurred due to an explosion following the raising of the door of a fume hood with the blower turned off in which silane had been flowing all night. To minimize the likelihood of such occurrences, silane is usually supplied diluted with an inert gas such as argon to the point where an explosion is unlikely.

Hydrogen is sometimes used as a carrier gas and is explosive when it is mixed with air in concentrations by volume of from 4% to 75% hydrogen (104). It will not spontaneously ignite at room temperature but has an auto-ignition temperature of about 585°C. Being lighter than air, hydrogen tends to collect in stagnant volumes near room ceilings. Vents and good air circulation will minimize the risk of explosion in the event that a leak does occur. Hydrogen monitors should be installed wherever hydrogen is likely to escape into confined rooms. While wafer-fab facilities will generally have piped-in hydrogen, laboratories often use individual cylinders. (For precautions in the handling of hydrogen cylinders, the reader should consult reference 104.)

Caution should also be exercised in the design of the piping system so that no possibility of the feeding of a gas under high pressure back into a cylinder of low-pressure gas exists. To be absolutely safe, there should be no direct pipe connections. As an example of the seriousness of such episodes, a cylinder of silane became contaminated with nitrous oxide, presumably from cross-connected piping, and three people were killed as they tried to bleed off the mixture from the silane cylinder (105). It has sometimes been the practice to depend on pressure regulators to reduce the pressure to

Elm Exhibit 2159, Page 173

the point where no cross contamination should occur. However, if the regulators fail, as they do on occasion, then contamination can still occur. Even feeding the various gases separately into an atmospheric pressure reaction chamber is no absolute assurance of safety. If the exhaust vent clogs, pressure can build up in the chamber and, unless the chamber explodes first, feed gases back into lowpressure lines.

High-pressure gas cylinders and cylinders containing explosive or poisonous gases should be handled so that the likelihood of the cylinder falling and breaking off the valve does not exist. If cylinder venting should be necessary and there is any possibility of an explosion, then a remote system to open the valve, coupled with sandbags and personnel flack clothing, should be used.

While not normally considered a problem, fine silicon dust such as might be produced by gas phase reactions is explosive when it is mixed with air. The likelihood of such an explosion is minimal for particle sizes above perhaps 10  $\mu$ m (106).

Halide compounds such as SiCl<sub>4</sub>, SiHCl<sub>3</sub>, and SiH<sub>2</sub>Cl<sub>2</sub> are more noxious than poisonous in small quantities, although in the presence of moisture from the air or body membranes, HCl (or HBr or HF) will be formed and will cause respiratory tract or lung damage. SiHCl<sub>3</sub> is flammable, and SiH<sub>2</sub>Cl<sub>2</sub> may self-ignite if a bottle containing it is broken. Once it is ignited, there is no good way of fighting it (107). The sometimes recommended use of a coarse water sprinkle does not necessarily contain it and produces large amounts of HCl. Phosphine, diborane, arsine, and stibine, all occasionally used as an oxide dopant, are very toxic and, because of the toxicity, are generally supplied diluted with an inert carrier gas. Their reported safe limits in air being breathed are given in Table 4.5. Also shown is the limit for ammonia, which is about three orders of magnitude higher.

TABLE 4.5

Safe Limits for Some Gases Used in CVD

| Gas       | PEL* (ppm) | IDLH† (ppm) |
|-----------|------------|-------------|
| Arsine    | 0.05       | 6           |
| Diborane  | 0.1        | 40          |
| Phosphine | 0.3        | 200         |
| Stibine   | 0.1        | 40          |
| Ammonia   | 100        |             |

\*PEL = Permissible exposure limit (maximum breathing air concentration for an 8 hour day).

**†IDLH** = Immediately dangerous to life or health (maximum limit to which one can be exposed for 30 minutes and survive with no impairment).

Source: NIOSH/OSHA Pocket Guide to Chemical Hazards.



- Sputtering or evaporation are best suited for metal depositions.
- Chemical vapor deposition is most commonly used for dielectric depositions.
- In a few specialized applications, either glass frit or spin-on organic silicates are used instead of CVD.
- Low-pressure CVD is used to minimize gaseous diffusion effects and allow closer

# PROBLEMS 4

- 1. Based on Eq. 4.1, describe an experiment that will allow the separation of intrinsic stress and stress introduced by differential thermal expansion. Comment: Beware of changing experimental conditions such as deposition temperature or substrate material that may, in turn, change  $\sigma_{ij}$ in an unpredictable fashion.
- Calculate the mean free path in an evaporation chamber at a chamber pressure of 10 \* torr. Consider the gas molecule diameter to be 4 Å.
- 3. Which of the following materials would you suggest be deposited by sputtering and which by evaporation?

Aluminum Platinum Aluminum oxide Silicon Silicon nitride Explain your choices.

- For a process that is kinetically controlled, plot rate versus temperature if the rate is 1 μm/minute at 1100°C and the activation energy is 35 kcal/mol.
- 5. Plot the following CVD temperature and rate

spacing of wafers inside a deposition chamber.

- Plasma-deposited materials are generally not stoichiometric and may contain a high percentage of reactant products.
- Sputtering and evaporation can deposit material on cold wafers. CVD and plasma CVD generally can deposit only on heated wafers.

data and mark the region that is diffusion controlled:

| T (°C) | # (µm/min.) |  |
|--------|-------------|--|
| 1250   | 2.2         |  |
| 1200   | 2.0         |  |
| 1150   | 2.1         |  |
| 1100   | 1.8         |  |
| 1050   | 1.3         |  |
| 1000   | 0.8         |  |
| 950    | 0.6         |  |

- 6. Since in a low-pressure deposition system the gaseous diffusion constant is much larger than it is in an atmospheric system, would deposition in the diffusion-controlled region be expected to also be much higher? Explain your answer.
- Some CVD reactor designs in which the chamber is a long tube with the exhaust at one end have several gas inlets positioned along the length of the tube. What would be a possible advantage of such an inlet system?
- Sketch the gas piping for a high-temperature silicon nitride deposition system. (Be sure to read the safety section.)

# REFERENCES 4

- J.L. Vossen and J.J. Cuomo. Chap. II-1. "Glow Discharge Sputter Deposition." in John L. Vossen and Werner Kern, eds., *Thin Film Processes*, Academic Press, New York, 1978.
- K.G. Geraghty and L.F. Donaghey, "Kinetics of the Reactive Sputter Deposition of Titanium Oxides," J. Electrochem. Soc. 123, pp. 1201–1207, 1976.
- J. Kortlandt and L. Oosting, "Deposition and Properties of RF Reactively Sputtered SiO<sub>2</sub> Layers," Solid State Technology, pp. 153–159, October 1982.
- C.J. Mogab et al., "Effect of Reactant Nitrogen Pressure on the Microstructure and Properties of Reactively Sputtered Silicon Nitride Films," J. Electrochem. Soc. 122, pp. 815–822, 1975.
- S. Suyama et al., "The Effect of Oxygen-Argon Mixing on Properties of Sputtered Silicon Dioxide Films," J. Electrochem. Soc. 134, pp. 2260– 2264, 1987.
- G.J. Kominiak, "Silicon Nitride by Direct RF Sputter Deposition," J. Electrochem. Soc. 122, pp. 1271–1273, 1975.
- T.C. Tisone, "Low Voltage Triode Sputtering with a Confined Plasma," Solid State Technology, pp. 34–40 and references therein, December 1975.
- Vance Hoffman, "High Rate Magnetron Sputtering for Metallizing Semiconductor Devices," Solid State Technology, pp. 57-61 and references therein, December 1976.
- Ted Van Vorous, "Planar Magnetron Sputtering: A New Industrial Coating Technique," Solid State Technology, pp. 62–66, December 1976.
- Various chapters, in Carroll F. Powell, Joseph H. Oxley, and John M. Blocher, Jr., eds., Vapor Deposition, John Wiley & Sons, New York, 1966.
- George R. Thompson, Jr., "Ion Beam Coating: A New Deposition Method," Solid State Technology, pp. 73–77, December 1978.
- E.A.V. Ebsworth, Volatile Silicon Compounds, Macmillan, New York, 1963.
- 13. Vladimir S. Ban, "Transport Phenomena Mea-

surements in Epitaxial Reactors," J. Electrochem. Soc. 125, pp. 317-320, 1978.

- R. Takahashi et al., "Gas Flow Pattern and Mass Transfer Analysis in a Horizontal Flow Reactor for Chemical Vapor Deposition," J. Electrochem. Soc. 119, pp. 1406–1412, 1972.
- S. Rhee et al., "On Three-Dimensional Transport Phenomena in CVD Processes," J. Electrochem. Soc. 134, pp. 2552–2559, 1987.
- L.J. Giling, "Gas Flow Patterns in Horizontal Epitaxial Reactor Cells Observed by Interference Holography," J. Electrochem. Soc. 129, pp. 634–644, 1982.
- Vladimir S. Ban and Stephen L. Gilbert, "The Chemistry and Transport Phenomena of Chemical Vapor Deposition of Silicon from SiCl<sub>4</sub>," J. Crystal Growth 31, pp. 284–289, 1975.
- J. Monkowski and J. Stach, "System Characterization of Planar Source Diffusion," Solid State Technology, pp. 38–43, November 1976.
- C.W. Manke and L.F. Donaghey, "Analysis of Transport Processes in Vertical Cylinder Epitaxy Reactors," J. Electrochem. Soc. 124, pp. 561-569, 1977.
- P.C. Rundle, "The Epitaxial Growth of Silicon in Horizontal Reactors," Int. J. Electronics 24, pp. 405–413, 1968.
- F.C. Eversteyn et al., "A Stagnant Layer Model for the Epitaxial Growth of Silicon from Silane in a Horizontal Reactor," J. Electrochem. Soc. 117, pp. 925–931, 1970.
- S.E. Bradshaw, "The Kinetics of Epitaxial Silicon Deposition by the Hydrogen Reduction of Chlorosilanes," Int. J. Electronics 21, pp. 205– 227, 1966.
- J.C. Gillis et al., "Fluid Mechanical Model for CVD in a Horizontal RF Reactor," pp. 21–29, in CVD '84 (Vol. 84-6), Electrochemical Society, Pennington, N.J., 1984.
- W.H. Shepherd, "Vapor Phase Deposition and Etching of Silicon," J. Electrochem. Soc. 112, pp. 988–994, 1965.
- E. Fujii et al., "A Quantitative Calculation of the Growth Rate of Epitaxial Silicon from SiCl<sub>4</sub> in a

Barrel Reactor," J. Electrochem. Soc. 119, pp. 1106–1113, 1972.

- H.A. Lord, "Convective Transport in Silicon Epitaxial Deposition in a Barrel Reactor," J. Electrochem. Soc. 134, pp. 1227–1235, 1987.
- Richard S. Rosler, "Low Pressure CVD Production Processes for Poly, Nitride, and Oxide," *Solid State Technology*, pp. 63–70, April 1970.
- William Velander and Daniel White, Jr., "Induced Convective Effects on Intrawafer Uniformity in LPCVD," J. Electrochem, Soc. 134, pp. 951–956, 1987.
- K.F. Jensen and D.B. Graves, "Modeling and Analysis of Low Pressure CVD Reactors," J. Electrochem. Soc. 130, pp. 1950–1957, 1983.
- Karl F. Roenigk and Klavs F. Jensen. "Low Pressure CVD of Silicon Nitride," J. Electrochem. Soc. 134, pp. 1777–1785, 1987.
- Makarand G. Joshi, "Modeling of LPCVD Reactors," J. Electrochem. Soc. 134, pp. 3118– 3122, 1987.
- M.L. Hammond, "Introduction to Chemical Vapor Deposition," *Solid State Technology*, pp. 61–65, December 1965.
- A. Reinberg, "RF Plasma Deposition of Inorganic Films for Semiconductor Applications," J. Electrochem. Soc. 121, p. 85C, abst. no. 6, 1974, or Electrochem. Soc. Ext. Abst. 74-1, abst. no. 6, 1974.
- Russ A. Morgan, Plasma Etching in Semiconductor Fabrication, Elsevier, New York, 1985.
- S. Rhee and J. Szekely, "The Analysis of Plasma-Enhanced Chemical Vapor Deposition of Silicon Films," J. Electrochem. Soc. 133, pp. 2194– 2201, 1986.
- Stephen Dzioba et al., "Downstream Plasma Induced Deposition of SiN, on Si, InP, and In-GaAs," J. Electrochem. Soc. 134, pp. 2599– 2603, 1987.
- John E. Spencer et al., "New Directions in Dry Processing Using the Flowing Afterglow of a Microwave Discharge," *Electrochem. Soc. Ext. Abst.* 86–2, abst. no. 285, 1986.
- P.D. Garner et al., "Characteristics of the Low-Temperature-Deposited SiO<sub>2</sub> Ga<sub>0.47</sub>In<sub>0.53</sub> as Metal/Insulator/Semiconductor Interface," *Thin Solid Films* 117, pp. 173–190, 1984.
- C.H.J. v.d. Brekel and P.J. Severin, "Control of the Deposition of Silicon Nitride by 2573 Å Ra-

diation," J. Electrochem. Soc. 119, pp. 372-376, 1972.

- J.W. Peters et al., "Low Temperature Photo-CVD Silicon Nitride: Properties and Applications," *Solid State Technology*, pp. 121–126, September 1980.
- "LPCVD System Accommodates Different Sizes and Types of Substrates," Semiconductor International, p. 198, April 1988.
- H.A. Tanner and L.B. Lockart, Jr., "Germane Reflection Reducing Coatings for Glass," J. Opt. Soc. Am. 36, pp. 701–706, 1946.
- E.L. Jordon, "A Diffusion Mask for Germanium," J. Electrochem. Soc. 108, pp. 478–481, 1961.
- Julius Klerer, "On the Mechanism of the Deposition of Silica by Pyrolytic Decomposition of Silanes," J. Electrochem. Soc. 112. pp. 503–506. 1965.
- V.D. Wohlheiter and R.A. Whitner, "High Production System for the Deposition of Silicon Dioxide," *Electrochem. Soc. Ext. Abst.* 75-1, abst. no. 180, pp. 424-425, 1975.
- W. Steinmaier and J. Bloem, "Successive Growth of Si and SiO<sub>2</sub> in Epitaxial Apparatus," J. Electrochem. Soc. 111, pp. 206–209, 1964.
- Myron J. Rand and John L. Ashworth, "Deposition of Silica Films on Germanium by the Carbon Dioxide Process," *J. Electrochem. Soc.* 113, pp. 48–50, 1966.
- N. Goldsmith and W. Kern, "The Deposition of Vitreous Silicon Dioxide Films from Silane," RCA Review 28, pp. 153–165, 1967.
- K. Strater and A. Mayer, "The Oxidation of Silane, Phosphine, and Diborane during Deposition of Doped-Oxide Diffusion Sources," pp. 469–480, in Rolf R. Haberecht and Edward L, Kern, eds., Semiconductor Silicon, Electrochemical Society, New York, 1969.
- T.L. Chu et al., "Silica Films by the Oxidation of Silane," Trans. Metall. Soc. AIME 242, pp. 532– 538, 1968.
- William J. Kroll, Jr, et al., "Formation of Silica Films on Silicon Using Silane and Carbon Dioxide," J. Electrochem. Soc. 122, pp. 573–578, 1975.
- A.K. Gaind et al., "Preparation and Properties of SiO<sub>2</sub> Films from SiH<sub>4</sub>-CO<sub>2</sub>-H<sub>2</sub>," J. Electrochem. Soc. 123, pp. 111–117, 1976.

- 53 A.K. Gaind et al., "Preparation and Properties of CVD Oxides with Low Charge Levels from SiH<sub>4</sub>-CO<sub>2</sub>-HCl-H<sub>2</sub> Systems," J. Electrochem. Soc. 123, pp. 238-246, 1976.
- H.F. Sterling and R.G.G. Swann, "Chemical Vapor Deposition Promoted by R.F. Discharge," *Solid-State Electronics* 8, pp. 653–654, 1965.
- E.P. van de Ven et al., "High Rate PECVD to Reduce Hillock Growth in Aluminum Interconnects," in Proc. 4th IEEE V-MIC Conference, 1987.
- Akira Takamatsu et al., "Plasma-Activated Deposition and Properties of Phosphosilicate Glass Film," J. Electrochem, Soc. 131, pp. 1865–1870, 1984.
- A.C. Adams et al., "Characterization of Plasma-Deposited Silicon Dioxide," J. Electrochem. Soc. 128, pp. 1545–1551, 1981.
- John R. Hollahan, "Deposition of Plasma Silicon Oxide Thin Films in a Production Planar Reactor," J. Electrochem. Soc. 126, pp. 930–934, 1979.
- 59. L. G. Meiners, "Electrical Properties of SiO<sub>2</sub> and Si<sub>1</sub>N<sub>4</sub> Dielectric Layers on InP," J. Vac. Sci. Technol. 19, pp. 373–379, 1981.
- 60. F.S. Becker and S. Röhl, "Low Pressure Deposition of Doped SiO<sub>2</sub> by Pyrolysis of Tetraethylorthosilicate (TEOS)," J. Electrochem. Soc. 134, pp. 2923–2931, 1987, and extensive references therein.
- Jerry E. Tong et al., "Process and Film Characterization of PECVD Borophosphosilicate Films for VLSI Applications," *Solid State Technol*ogy, pp. 161–170, January 1984.
- K. Nassau et al., "Modified Phosphosilicate Glasses for VLSI Applications," J. Electrochem. Soc. 132, pp. 409–415, 1985.
- W. Kern et al., "Optimized Chemical Vapor Deposition of Borophosphosilicate Glass Films," RCA Review 46, pp. 117–152, 1985.
- V.Y. Doo et al., "Preparation and Properties of Pyrolytic Silicon Nitride," J. Electrochem, Soc. 113, pp. 1279–1281, 1966.
- Tetsuya Arizumi et al., "Thermodynamical Analyses and Experiments for the Preparation of Silicon Nitride," Jap. J. Appl. Phys. 7, pp. 1021– 1027, 1968.
- 66. Kenneth E. Bean et al., "Some Properties of Va-

por Deposited Silicon Nitride Films Using the SiH<sub>4</sub>-NH<sub>3</sub>-H, System," J. Electrochem. Soc. 114, pp. 733-737, 1968.

- V.Y. Doo et al., "Property Changes in Pyrolytic Silicon Nitride with Reactant Composition Changes," J. Electrochem. Soc. 115, pp. 61–64, 1968.
- M.J. Grieco et al., "Silicon Nitride Films from SiCl<sub>4</sub> Plus NH<sub>3</sub>: Preparation and Properties," J. Electrochem. Soc. 115, pp. 525–531, 1968.
- V.D. Wohlheiter and R.A. Whitner. "A High Production System for the Deposition of Silicon Nitride." J. Electrochem. Soc. 119, pp. 945–948, 1972.
- E.A. MacKenna and P. Kodama, "The Suppression of Ionic Contamination during Silicon Nitride Deposition," J. Electrochem. Soc. 119, pp. 1094–1099, 1972.
- J.J. Gebhardt et al., "Chemical Vapor Deposition of Silicon Nitride," J. Electrochem. Soc. 123, pp. 1578–1582, 1975.
- R. Gereth and W. Scherber, "Properties of Ammonia-Free Nitrogen-Si<sub>1</sub>N<sub>4</sub> Films," J. Electrochem, Soc. 119, pp. 1248–1254, 1972.
- M. Shiloh et al., "Preparation of Nitrides by Active Nitrogen," J. Electrochem. Soc. 124, pp. 295–300, 1977.
- Richard S. Rosler et al., "A Production Reactor for Low Temperature Plasma-Enhanced Silicon Nitride Deposition," *Solid State Technology*, pp. 45-50, June 1976.
- A.K. Sinha et al., "Reactive Plasma Deposited Si-N Films for MOS-LSI Passivation," J. Electrochem. Soc. 125, pp. 601–608, 1975.
- M. Dutoit and F. Sollberger, "Lateral Polysilicon p-n Diodes," J. Electrochem, Soc. 125, pp. 1648–1651 and references therein, 1978.
- R. Pennell et al., "Fabrication of High Voltage Polysilicon TFTs on an Insulator," J. Electrochem. Soc. 133, pp. 2358-2361, 1986.
- Richard S. Rosler, "Low Pressure CVD Production Processes for Poly, Nitride, and Oxide," Solid State Technology, pp. 63–70, April 1970.
- K.E. Bean and W.R. Runyan, "Dielectric Isolation: Comprehensive, Current and Future," J. Electrochem. Soc. 124, pp. 5C-12C, 1977.
- T.I. Kamins, "Structure and Stability of Low Pressure Chemically Vapor-Deposited Silicon

Films," J. Electrochem. Soc. 125, pp. 927-932, 1978.

- G. Harbeke et al., "Growth and Physical Properties of LPCVD Polycrystalline Silicon Films," J. Electrochem. Soc. 131, pp. 675–682, 1984.
- P. Joubert et al., "The Effect of Low Pressure on the Structure of LPCVD Polycrystalline Silicon Films," J. Electrochem. Soc. 134, pp. 2541– 2545, 1987.
- Makoto Matsui et al., "Low Temperature Formation of Polycrystalline Silicon Films by Molecular Beam Deposition," J. Appl. Phys. 52, pp. 995–998, 1982.
- Earl G. Alexander and W.R. Runyan, "A Study of Factors Affecting Silicon Growth on Amorphous SiO, Surfaces," *Trans. Metall. Soc. AIME* 236, pp. 284–290, 1966.
- W.A.P. Claassen and J. Bloem, "The Nucleation of CVD Silicon on SiO<sub>2</sub> and Si<sub>3</sub>N<sub>4</sub> Substrates Part III," J. Electrochem. Soc. 128, pp. 1353– 1359, 1981, and Parts I and II referenced therein.
- V.J. Silvestri, "Growth Kinematics of a Polycrystalline Trench Refill Process," J. Electrochem. Soc. 133, pp. 2374–2376, 1986.
- F.C. Eversteyn and P.H. Put, "Influence of AsH<sub>3</sub>, PH<sub>4</sub>, and B<sub>3</sub>H<sub>6</sub> on the Growth Rate and Resistivity of Polycrystalline Silicon Films Deposited from a SiH<sub>4</sub>-H<sub>2</sub> Mixture," *J. Electrochem. Soc. (20, pp. 106–110, 1973)*
- B.S. Meyerson and W. Olbricht, "Phosphorus-Doped Polycrystalline Silicon Via LPCVD," J. Electrochem. Soc. 132, pp. 2361–2365, 1984. (See also the following article beginning on p. 2366.)
- Andrew Yeckel and Stanley Middleman. "A Model of Growth Rate Nonuniformity in the Simultaneous Deposition and Doping of a Polycrystalline Silicon Film by LPCVD," J. Electrochem. Soc. 134, pp. 1275–1281, 1987.
- M.M. Mandurah et al., "Phosphorus Doping of Low Pressure Chemically Vapor-Deposited Silicon Films," J. Electrochem. Soc. 126, pp. 1019–1023, 1979.
- J.R. Monkowski et al., "Comparison of Dopant Incorporation into Polycrystalline and Monocrystalline Silicon," *Appl. Phys. Lett.* 35, pp. 410–412, 1979.
- 92. Junichi Murota and Takashi Sawai, "Electrical Characteristics of Heavily Arsenic and Phos-

phorus Doped Polycrystalline Silicon," J, Appl. Phys. 53, pp. 3702-3708, 1982.

- T.I. Kamins and K.L. Chiang, "Properties of Plasma-Enhanced CVD Silicon Films," J. Elevtrochem. Soc. 129, pp. 2326–2331, 1982.
- W.R. Burger et al., in J. Bloem et al., eds., Proc. 4th European Conference on Chemical Vapor Deposition, The Netherlands, 1984.
- J.P. Harbison et al., "Effect of Silane Dilution on Intrinsic Stress in Glow Discharge Hydrogenated Amorphous Silicon Films," J. Appl. Phys. 55, pp. 946–951, 1984.
- K.M. Mar, "Diffusion in Silicon from a Spin-On p-Doped Silicon Oxide Film," J. Electrochem. Soc. 126, pp. 1252–1257, 1979.
- K.D. Beyer, "A New Paint-On Diffusion Source," J. Electrochem. Soc. 123, pp. 1556– 1560, 1976.
- Pei-Lin Pai et al., "Material Characteristics of Spin-On Glasses for Interlevel Dielectric Applications," J. Electrochem. Soc. 134, pp. 2829– 2834, 1987.
- Bulent E. Yoldas, "Diffusion of Dopants from Optical Coatings and Single Step Formation of Antireflective Coatings and p-n Junctions in Photovoltaic Cells," J. Electrochem. Soc. 127, pp. 2478-2481, 1980.
- B.H. Justice et al., "A Novel Spin-On Dopant," Solid State Technology, pp. 1553–1559, October 1984.
- A.H. Berman, "Glass Passivation Improves High Voltage Transistors," *Solid State Technol*ogy, pp. 29–32, March 1976.
- 102. Yutaka Misawa, "Properties of ZnO-B<sub>2</sub>O<sub>1</sub>-SiO<sub>2</sub> Glasses for Surface Passivation," J. Electrochem. Soc. 131, pp. 1862–1865, 1984.
- Masaru Shimbo et al., "Electrophoretic Deposition of Glass Powder for Passivation of High Voltage Transistors," J. Electrochem. Soc, 132, pp. 393–398, 1985.
- Matheson Gas Data Book, The Matheson Company, 1980.
- 105. Electronic News, p. 30, September 5, 1988.
- R.K. Eckhoff et al., "Ignitability and Explosibility of Silicon Dust Clouds," J. Electrochem. Soc. 133, pp. 2631–2637, 1986.
- David Ranier, "Dichlorosilane—Leak Spill and Fire Control," *Electrochem. Soc. Ext. Abst. 85–* 2, abst. no. 299, 1985.



# Lithography

### 5.1 INTRODUCTION

5.2

PHOTOMASKS 5.2.1 Masks for Optical Printing The steps required to go from a circuit function existing only in a designer's mind to a pattern etched in oxide or metal are shown in Fig. 5.1. First, the circuit must be designed, and then the size of the various circuit elements, from transistor bases to lead widths, must be determined and their exact positions on the chip established. All of this information is then converted into computer data files that can be used by the pattern generator (optical or E-beam) to determine which positions on the reticle to expose. A reticle is generally printed at  $5 \times$  or  $10 \times$  the size of the finished chip. Its image is then reduced to the correct size during printing onto the mask to be used for wafer printing. When an E-beam wafer writer is used to write the image on the wafer directly from the digitized information, as Fig. 5.1 shows, pattern generation and stepping operations are by-passed. When steppers are used for printing, a separate reduction step is ordinarily not needed.

Masks are not produced in the wafer-fab area but rather in a separate mask shop. Accordingly, only a general discussion of their fabrication will be given. However, since the mask quality drastically affects the wafer-fab yields, it is very important that methods of evaluating quality be considered in some detail.

Making masks for optical printing starts with square glass plates somewhat larger than the slice to be printed. The plates are first coated with a material opaque in the wavelength region used to expose resist. Chromium is most commonly used. The coating process is very critical since if any pinholes exist in the metal, they will remain as part of the finished mask. The metal is then coated with resist,<sup>1</sup> and the resist is exposed to the appropriate image, which

<sup>&#</sup>x27;Called resist because it "resists" the chemicals used to etch into the unexposed areas.

#### FIGURE 5.1

Path from circuit design to a patterned wafer. (This chapter considers only the operations within the dashed box.)



may be by either an optical or an electron beam (E-beam) pattern generator. The optical generator builds up the image by exposing a short line segment, moving a small distance (stepping), exposing another short segment, and continuing until the whole mask pattern has been exposed. The light used is very intense so that the individual exposures can be quite short. E-beam printing requires a resist that is sensitive to an electron beam rather than to light. Since masks are several inches square, and since electron beams can be deflected over only a very small angle without distortion, some combination of small-area E-beam scan combined with stage travel is used to cover the complete mask. The combination of low beam energy and relatively insensitive resist makes E-beam writing very slow. However, because diffraction effects are much less than they are in optical pattern generators, E-beam writing is indispensable for highresolution masks.

The first plate made by either generator is usually referred to as a reticle and will include only one chip pattern if a mask is to be made. If the reticle is to be used directly in a stepper, it will include as many whole chip patterns as will fit into the stepper field of view. This aspect is discussed in some detail in section 5.11. The reticle geometry will generally be drawn at either  $5\times$  or  $10\times$  actual size. with 5× as the preferred size because of better manufacturing cost effectiveness. A mask made by printing enough 1× patterns from the reticle to cover a full wafer is referred to as a master. Multiple copies of the master can be made by contact printing and are much less expensive than masters but have many more defects. Reticles are used directly in step and repeat-projection printers to print chip by chip on the wafer. The full-size mask is used with contact printers, with proximity printers, or by 1:1 scanning projection printers. Projection printers will generally use masters, and contact printers will use copies.

Soda lime glass blanks, which have a rather large thermal expansion coefficient, were used as mask substrates until registration demands required a lower expansivity. Borosilicate glass was the next step and was used extensively for several years. Fused silica is now widely used for new designs. If an old product is to be switched to fused silica, a full set of new masks must be made; otherwise, the set will not thermally track. Thus, replacement masks for products that have not been converted will still be of borosilicate glass. Bright chromium is the normal metallization, but to reduce reflections from the mask, chromium with an antireflecting front surface (AR chrome) is sometimes used. When visual level-to-level alignment is used and masks for the different levels are not accurately sized to one another (or if the alignment markers are not properly placed), slight adjustments may be made by the operator in an effort to get the "best alignment." In such cases, it is helpful to have masks that are partially transparent (see-through) in the visible region, while remaining completely dark in the near UV regions where the resist is sensitive. This feature is most useful in the case of positive resist masks, which usually are mostly opaque. CVD iron oxide is a material that works satisfactorily for this application (1). Fortunately, however, sizing and placement control has now improved, and such operator judgment is seldom needed. Indeed, much alignment is now done completely automatically. This better control has been

achieved not only through improved mask making techniques but by better wafer-fab practices as well.

# 5.2.2 X-Ray Masks

The making of X-ray masks is complicated by the fact that no thick, rigid, mask blank material that is transparent to the X rays used in X-ray lithography is available. Very thin membranes must be used, and they have problems of both durability and dimensional stability. Membrane materials that have been used include polyimide, parylene, titanium, silicon, silicon carbide, silicon nitride, and boron nitride (2, 3). The inorganic films are generally preferred because of their greater strength and dimensional stability. A gold layer about 1  $\mu$ m thick is ordinarily used as the X-ray opaque material. The steepness of the gold edges will affect printing sharpness, and selective electroplating of thick gold onto a thin, patterned gold layer is sometimes used instead of initially depositing a thick gold layer and then etching the pattern into it.

The steps in making an X-ray mask using CVD membrane material are shown in Fig. 5.2. Transparent, stress-free silicon carbide layers on silicon can be obtained by depositing stoichiometric SiC from a mixture of 0.89% SiCl<sub>4</sub>, 0.37%  $C_3H_8$ , and 98.74% hydrogen



at 1200°C. The slice and layer are then attached to a pyrex ring for support, and the silicon is etched away, leaving only a peripheral ring for support of the film. The gold, or other efficient absorber such as tungsten, can be deposited before or after the silicon is removed, but to prevent subsequent distortion, patterning should be done after silicon removal.

5.2.3 Mask Protection Masks used in contact printing often adhere to the resist so that when the wafer and mask are separated, small areas of resist remain on the mask, thus leaving holes (defects) in the wafer resist. If the resist is not removed, it may produce a defect on the next wafer exposed. To minimize such occurrences, release coatings applied over the resist or to the mask surface have been used with reported defect reductions of from 10% to 40%. One example of a release agent applied to the resist is polyvinyl alcohol with a surfactant such as Triton X-100 and a lubricant such as calcium stearate spun onto the wafer just after resist soft-bake (4). Release agents applied to the mask have included parylene (5) and fluoropolymers (6).

Masks used for projection printing do not touch other surfaces but nevertheless will show a gradual increase in flaws. In this case, the flaws are dust particles settling on the mask surface. The particles can originate from the operators handling the masks, from the storage containers, or from the interior of the printing machinery itself. Ordinarily, particles on the back of the mask cause little trouble since they will not be in focus. Furthermore, steppers usually use  $5 \times$  or  $10 \times$  size masks so that a particle that does alight will be reduced in size and may thus be smaller than the limit of resolution of the lens and not print. Where such particles are a problem, a thinmembrane pellicle can be placed above the front surface of the mask so that particles will be caught by it and held out of focus. An approximate expression for the pellicle-mask separation  $d_p$  required for a 10% or less intensity variation is (7)

Printer Light Source f-Numbers Printer f-Number Perkin-Elmer //4\* 200 (1X) Ultratech 900 //4 (1X)

TABLE 5.1

GCA 4800

(10X) \*Aperture 4. \*Effective *f*-number after lens reduction.

1/25+

 $d_p = \frac{nfP}{280}$ 

where *n* is the index of refraction of the material between the mask and pellicle (1 for air and approximately 1.5 for glass), *f* is the condenser *f*-number, and *P* is the diameter of the particle on the pellicle. Some representative light source *f*-numbers are shown in Table 5.1. Depending on whether or not deliberate overexposure is being used, even a 10% reduction in intensity could cause a thinning of the resist. While the thin spot might not be enough to cause the mask to fail during an etch operation, it could fail to give protection during an ion implant step. The effectiveness of the pellicle or the image reduction in minimizing the printability of particles depends inversely on how well optimized the system is for high resolution. Thus, unless the maximum lens resolution is required for the geometries being printed, best overall performance may occur when the total system is adjusted for less than maximum resolution.

Since pellicles represent more reflecting surfaces in the optical path and more opportunities for light scattering and attenuation, the pellicle composition and thickness must be carefully chosen. Nitrocellulose is a common pellicle material. The thickness will ordinarily be only a few microns and may be adjusted to give minimum reflection at the printing wavelength. Alternatively, antireflection coatings, either organic or inorganic, can be used to ensure minimum reflection loss.

Mask defects can be broadly described as follows:

Pinholes Notches Bridges<sup>2</sup> Missing geometry Geometry of wrong size Runout/runin Particulates Scratches

Some of these defects are shown schematically in Fig. 5.3. Pinholes usually stem from holes in the chromium film when it was originally deposited on the mask blank. Mask chipping and scratching can occur during transfer and other handling and produce defects ranging from small pinholes to long tears cutting across whole chip patterns.



Mask defects.



'Holes and notches are often called clear defects. Bridges are opaque defects.

# 5.3 MASK DEFECTS

Elm Exhibit 2159, Page 185

The harder the glass used for the mask blank, the less likely it is to chip. Thus, borosilicate glass is more resistant than soda lime glass. Bridging is a mask printing or developing problem. Missing geometry occurs because of missing data fed to the pattern generator or because of poor chromium-glass adhesion, which allows some of the chromium to be washed away during cleaning. Geometry of the wrong size could occur on a new mask because of errors in describing the size, but the usual cause is a resist exposure problem or a metal etch problem. Runout/runin is discussed in more detail in a later section. Particulates are unique in the list of defects in that they are the only defects that can accumulate and then be easily removed. They arise from mask storage containers and the printing equipment, from chipping of glass off the mask edges, and from the ambient air in which the mask is handled. While every effort must be made to keep particles from getting on the mask, efficient mask cleaning on a routine basis is also required.

Inspection for the various defects just discussed must be done not only as masks leave the mask print shop but also regularly in the wafer-fab area. To simplify these inspections, several specialized sets of measuring equipment have evolved. Even the standard optical microscope, if used for examining masks with attached pellicles. must be equipped with long working length objectives. Otherwise, the pellicle spacing will prevent all except the lowest power objectives from being brought close enough to the mask surface to focus.

The simplest line-width measuring equipment consists of a microscope and either a filar or image-shearing evepiece. The width of the image is measured, and accuracy depends on being able to recognize the actual edge of the object from its image. Experiments have shown that a human observer using a filar evepiece will undersize a clear line and oversize an opaque line. When the observer uses an image-shearing eyepiece, the reverse occurs (8). Better results are obtained if a photoelectronic scan of the image is used. In this case, edge-point detection can be preset to occur at a given fraction of the intensity change observed when going from dark to light. For zero transmission of the "dark" region, the fraction is calculated to be 25% (8). If there is some transmission, the number is higher. The most accurate measurements use either a laser or an electron beam to directly detect the line edges as the mask is moved along. and a laser interferometer to measure the stage motion between the two detected edges (9, 10).

Pinholes, bridges, scratches, particulates, and missing geometry can all, in principle, be detected by a thorough microscopic examination. However, to do a thorough job means that each piece of

# 5.3.1 Mask Inspection

geometry on the whole mask must be inspected.3 The time consumed by such an inspection is overwhelming, and the possibility of missing a defect is quite large. One early inspection machine used complementary colors of light for simultaneously viewing chip patterns in adjacent rows. Portions of the pattern that were alike appeared as either black or white. Differences between the two patterns were in one of the illuminating colors and easy for an operator to see (11). Chips having differences were then further visually inspected to determine which one was flawed. Another approach, which looked at individual chip patterns, made use of the fact that masks are generally composed of rectangular segments. Spatial optical filtering was used to remove rectangular components of the Fourier spectrum and leave only that from curved objects (most defects would appear to be curved). This filtering enhanced the image of the defects relative to that of the proper geometries and made the defects easy to see (12). Such machines are difficult to automate, however, and sometimes give spurious differences. The most successful approach has been to optically scan each of two adjacent chip patterns and compare the results (13). Unlike the color machine, precise optical alignment of the images is not necessary since the two patterns can be digitized and computer overlayed for best fit. A search for differences can then be made without alignment interference. None of the methods just described will detect missing geometries caused by faulty inputs to the pattern generator since all of the chip patterns will have the same defect. It is possible, however, with some machines, to compare the digitized scan data from a single chip pattern with the original digitized layout pattern data and detect missing patterns.

A completely different approach can be used to look for particulates on pellicles, on the back of masks, on mask blanks, and on unpatterned wafers. With this approach, an intense light is directed to the surface under inspection, and the light scattered from any surface irregularities is detected and mapped. Two variations of the approach have proven quite successful. One covers the whole surface with light and detects light scattered from the surface with a

When visual inspection is performed with an optical microscope, care must be exercised to ensure that no contaminants are introduced during inspection. Contaminants can come from such things as sneezes and flecks of dandruff, skin, or cosmetics, and even from the breath of a person who has recently smoked. Inspection stations using robots or other forms of mechanical wafer transport can produce particles from the wearing of linkages and should themselves be inspected regularly. (See, for example, Brian Hardegen and Andrew Lane, "Testing Particle Generation in a Wafer Handling Robot," *Solid State Technology*, pp. 189–195. March (985.)

very sensitive television camera. The other scans the surface with a laser light beam and detects scattered light with an integrating sphere and light detector (14).

Overlay inaccuracies can arise from a random misplacement of a feature within a die, from a random misplacement of a die within the whole mask array, or from a systematic change in die-to-die spacing as the mask is traversed (runout/runin). Even if a set of masks is perfectly matched as manufactured, the various levels printed on a given wafer may not match when contact or full-wafer scanning projection printing is used. This mismatch can arise for several reasons. One is due to thermal expansion of the mask and/or wafer. Printing of all levels may not be at the same temperature because of drifts in cleanroom or equipment temperature. Thus, since the expansion coefficients of mask and wafer are generally different, level-to-level matching will suffer. Also, a temperature difference may exist between wafer and mask, which may increase from wafer to wafer during a run due to the high-intensity light source used during exposure. Differences in chromium coverage (which affects reflectivity) from one mask to another can affect the temperature rise during exposure. Hence, data taken for one mask geometry may not be appropriate for another (15). Data indicate that if precautions are not taken, up to a 4°C rise in mask temperature can occur during a printing sequence. Thus, the amount of this runout will vary from wafer to wafer within a run. Temperature gradients may also develop across the mask during exposure due to an uneven distribution of cooling air.

If the mask and wafer features are overlayed at the center of the wafer, the runout R for a change in temperature of the mask and/or wafer between two levels of printing is given by

$$R = r(\Delta T_m \alpha_m - \Delta T_w \alpha_{\rm Si})$$
 5.1a

where r is the wafer radius,  $\Delta T_m$  is the temperature change of the mask between exposures,  $\Delta T_w$  is the temperature change of the wafer between exposures,  $\alpha_m$  is the expansion coefficient of the mask, and  $\alpha_{si}$  is the expansion coefficient of silicon. If the wafer temperature remains constant, but the temperature of different levels of masks varies by  $\Delta t$ , Eq. 5.1a reduces to

 $R = r\Delta t \alpha_m \qquad 5.1b$ 

The expansion coefficients of various mask glasses and the amount of runout across a 150 mm wafer that might be expected are shown in Table 5.2. For a 1°C temperature differential across the mask, the runout for a 150 mm slice is given by line 2 of Table 5.2.

# 5.3.2 Mask–Wafer Overlay Accuracy

### TABLE 5.2

Thermal Expansion Effects

| 1                                             | Material |           |              |              |
|-----------------------------------------------|----------|-----------|--------------|--------------|
| Effect                                        | Silicon  | Soda Lime | Borosilicate | Fused Silica |
| Expansion Coefficient<br>(10 <sup>-</sup> %C) | 2.3      | 9.4       | 4.5          | 0.5          |
| Runout µm<br>(1°C across mask)                |          | 0.7       | 0.34         | 0.037        |
| Runout µm<br>(4°C change; see text)           |          | 2.1       | 0.7          | -0.55        |

The runout for a 4°C excursion of both mask and wafer is given in line 3 of Table 5.2. (A  $\pm$ 2°C temperature control of a cleanroom is reasonable.) Note that the use of fused silica with its low expansion coefficient does not eliminate runout in the case just discussed. However, its use does make temperature control during mask making, as well as temperature control of the mask during printing, of less concern.

Bending of the mask can also cause a small amount of runout, Most mask blanks will not be perfectly flat when freely supported, but when the image is being printed onto them, whether by E-beam, optical pattern generator, or direct contact, the plate will be pulled flat against a carefully ground chuck. However, when the mask is being used (except for contact printing), it will again be supported only at its edges. To estimate the runout that can occur, assume that the mask has spherical curvature and that the maximum deflection is much less than the thickness of the mask. Also suppose that over the area of a wafer of diameter D, the maximum mask deflection (at the center of mask and wafer) is  $\delta$ . It can then be shown that the runout, which is half the difference  $\Delta D$  between the distance Dalong the curved mask and the projected distance D' on a flat surface, is given by

$$R = \frac{\delta^2}{D}$$
 5.2

Bowing of the wafer can cause misregistration problems similar to those caused by the mask. Furthermore, a dirt particle or burr on the chuck surface can cause severe local distortion and consequent local misregistration. Thus, care should be given to maintaining very flat and clean chuck surfaces. By knowing the vacuum pressure used to pull the wafer against the chuck and the maximum wafer deflection, which will be the diameter of the trapped particle, an estimate of the diameter of the distorted area can be made. Fig. 5.4 is a plot of the calculated distorted area diameter and surface elongation versus particle size for a single wafer size.

## FIGURE 5.4

Wafer distortion due to a particle between the chuck and wafer. (*Source:* Jere D. Buckley, Director of Advanced Studies, The Perkin-Elmer Corporation. From material presented at a Wafer Flatness Workshop at TI on October 4, 1984.)



| TABLE 5.3                    |           | Typical Feature | Minimum Feature |                   |  |
|------------------------------|-----------|-----------------|-----------------|-------------------|--|
| <b>Required Registration</b> | Device    | Size (µm)       | Size (µm)       | Registration (µm) |  |
|                              | 64K DRAM  | 2.5             | 1.8             | ±0.5              |  |
|                              | 256K DRAM | 1.5             | 1.1             | 0.3               |  |
|                              | 1M DRAM   | 1.3             | 1.0             | 0.2               |  |
|                              | 4M DRAM   | 1.0             | 0.8             | 0.2               |  |

Table 5.3 gives the approximate level-to-level registration required in order for no yield loss to occur because of registration. Thus, for no yield loss on a wafer, the total runout must be less than the numbers shown in the table.

5.3.3 Overlay Accuracy Measurements Alignment accuracy is generally considered to include only the accuracy with which a marker from one level matches that of another. For contact, proximity, or full-wafer projection printing, there are only two such markers per level per wafer, and it is easy to measure deviations. Even if the markers do match, geometries in the various chips in the wafer may not because of local distortions in the mask or wafer and, if printing is by optical projection, because of the various distortions in the optical system. In addition, as mentioned in the previous section, the alignment markers may be misplaced with respect to the actual chip patterns. Overlay (registration) accuracy considers how accurately the geometric patterns of one level match their corresponding patterns on other levels. To achieve the necessary measurement precision, special test patterns are used, and to speed up the measurement process, many of the test patterns are designed so that misregistration can be determined by automatic testing. In some cases, these patterns are incorporated into the chip or put in the scribe lines. In other cases, special wafers covered with test patterns are run from time to time to evaluate a printing machine.

Fig. 5.5 shows four patterns that have been used. The first, a set of verniers, shown in Fig. 5.5a, can be put on each chip, with the two halves of the verniers going on two consecutive levels. They are quite sensitive and are directly and individually read visually (16). The next pattern set, shown in Fig. 5.5b, is run on test wafers and requires individual measurement with optical line-width measuring equipment (17). The last two use separate test wafers and require a diffusion or ion implant step, but the degree of misalignment can be read on automatic test equipment in terms of electrical resistance. With these two patterns, x, y, and  $\Theta$  can be determined for each pattern, and misregistration over the whole wafer can be quickly mapped out (18-19). When the patterns of Fig. 5.5c are used, the corner-contacted square resistor, along with all of the contact arms 1 through 8, are defined by the first printing operation. The second one positions the voltage-sensing contacts that connect arms 1, 3, 5, and 7 to the square resistor. How far they are from being perfectly centered can be determined by electrical measurements and is a measure of misregistration. The resistor and all of the contacts except  $V_1$  and  $V_2$  of the pattern of Fig. 5.5d are defined during one printing step. The V1 and V2 contacts are added during a later printing step, and by determining R and  $R_1$ - $R_4$ , misregistration can be calculated.

The first resist used in semiconductor fabrication produced a negative image and was thus called negative resist. As shown in Fig. 5.6a, areas where the light strikes become polymerized and more difficult to remove so that when the resist is "developed" (subjected to a solvent), the polymerized region remains. Later, positive resist, based on a different chemistry was sometimes used instead. As shown in Fig. 5.6b, exposure to light changes the material so that in that region the resist is soluble in a different solvent and can be more easily removed. Since the exposed regions are removed, the resist is referred to as positive.

5.4 PHOTORESISTS



## 174 CHAPTER 5 LITHOGRAPHY



In general, resists have several components. First is the resin, which has the property of withstanding the etch solution. The resin may or may not be photosensitive but is, in any event, not very sensitive. Thus, a sensitizer is added, which is itself very light-sensitive and, in addition, in the case of negative resist, transfers energy to the basic resin during exposure. Since the exposure process requires changes in molecular bonding, substantial photon energy is required, which, in turn, generally dictates the use of ultraviolet light. Since neither of the basic components may adhere well to the surface being coated, an adhesion promoter may also be needed. The result can be a very viscous material that is difficult to spread uniformly. The viscosity must then be reduced with a thinner. The major resist properties that a user must consider are the following.

- Sensitivity: How much light energy is required for a good exposure.
- Spectral response: Sensitivity versus wavelength of the exposing light.
- Contrast: The difference in amount of light to produce a good image and the threshold amount of light where an image just begins to form.
- 4. Resolution: Size feature that can be reproduced in the resist.
- Adhesion: How tenaciously the resist sticks to the substrate.
- Etch resistance: How well the resist will protect the substrate from the etchants being used.
- Ease of processing: How difficult it is to apply, develop, and strip the resist.
- Pinhole density: How many holes in the resist occur because of its inherent properties.
- Toxicity of resist and related processing chemicals: The allowable level of fumes in air breathed by operators.

These properties will be discussed in more detail throughout the remainder of this chapter. The myriad of resists that have been and continue to be developed are in response to these requirements, and the fact that work continues on new resists attests to the fact that resists meeting all of the requirements have yet to be found.

Polyvinyl cinnamate was used as the basic resin in most of the resists of the 1960s. However, most negative resists now are based on cyclized polyisoprene polymers. A wide range of sensitizers may be used, including quinones, azido compounds, and nitro compounds for polyvinyl cinnamate and azides for cyclized polyisoprene. A small quantity of novolac resin can be added to improve adhesion and reduce swelling during developing. Solvents/developers include nitrobenzene, acetic acid, and furfural for polyvinyl cinnamate and xylene and benzene for polyisoprene (cyclized rubber).

Regardless of the resin or additives, negative resists suffer from swelling during development. Even though the developer does not dissolve the exposed resist, it is absorbed and causes swelling. During subsequent rinse operations, the developer is removed, and the resist then shrinks. If the resist lines are close together (as required for high resolution), the swelling can cause them to touch, and during the subsequent shrinkage, they may still remain stuck together. Long, narrow lines well-anchored at the ends can become wavy during swelling and, if surface adherence is good, will retain their waviness after shrinkage. Alternatively, the lines may be pulled loose from the substrate,

Positive resists function very differently from negative resists. The sensitizer and the main body resin do not interact, so the change in solubility is all due to the sensitizer. The sensitizer breaks down under the influence of light and increases the solubility rate in alkaline solutions by a factor of about a thousand. Since the presence of the sensitizer inhibits dissolution, it is often referred to as an inhibitor. While negative resists use only 2% or 3% sensitizer, in positive resists, 20% may be added. The most common resin is phenol formaldehyde novolac, and the most common sensitizer is napthoquinone diazide. Ethylene glycol monomethyl is a possible solvent; diluents can be butyl acetate, xylene, cellosolve acetate, or some combination thereof.

Positive resists have a broader optical sensitivity than negative resists and can utilize more of a conventional UV lamp's output.

See references 20-22. See references 20, 23.

# 5.4.1 UV Negative Resist Formulation\*

5.4.2 UV Positive Resist Formulation<sup>+</sup> mally present in the atmosphere or in the film is required for an alternate reaction to take place that does produce the normal alkaline solubility. By exposing the resist to the desired pattern in a vacuum and then flood-exposing the entire surface with UV, the previously unexposed resist becomes soluble and that exposed in vacuum remains insoluble. Thus, a positive resist can behave as a negative resist without swelling problems, although the main reason for image reversal processing is to obtain higher resolution than is otherwise obtainable with most positive resists. In practice, positive resist image reversal processing generally involves the addition of an extra photosensitive material to the resist before flood-exposing (24).

They do not show the oxygen effect<sup>4</sup> of negative resists, but if they are exposed in a vacuum, the breakdown of the inhibitor forms structures that are not soluble in alkaline solutions. Moisture nor-

Because the usual UV resists are too insensitive for practical X-ray and E-beam use, new formulations are required. The task of combining sensitivity, etch resistance, and thermal stability into one resist has proven to be much more difficult, and completely satisfactory resists are not yet available. The earliest E-beam resist (the same resists are generally used for E-beam, X-ray, and ion beam) was poly-(methyl methacrylate), usually referred to as PMMA. It is a vinyl polymer positive resist that degrades when subjected to radiation. Its resolution is good, but its sensitivity and thermal stability are poor. Another group of positive resists is represented by poly-(butene-1 sulfone), or PBS. It is much more sensitive but has poor plasma etch resistance. Suitable developers for either of these resists are methyl ethyl ketone or methyl isobutyl ketone. Various negative resists are available, and several are based on poly-(glycidyl methacrylate), or PGMA-for example, poly-(glycidyl methacrylate-co-ethyl acrylate), or COP. Sensitivity and etch resistance for these resists are generally good, but like other negative resists. since they suffer from swelling during development, line spacings must be greater than for positive resists. When these resists are used for ion beam lithography, sensitivity is two to three orders of magnitude higher than it is for E-beams. However, the ion beam density is much less, so the time for exposure is comparable. However, it does not appear feasible to increase ion beam resist sensitivity. So few ions per unit area are now required for exposure that increased sensitivity could lead to large statistical variations in exposure.

"See section 5.4.8 on reciprocity." See reference 25.

5.4.3 E-Beam, X-Ray, and Ion Beam Resists\*



5.4.4 Inorganic Resists

The resists discussed thus far have all been organic-based. It is possible, however, to develop patterns in some inorganic materials (26). In many cases, less normalized energy differential is required between "light" and "dark" for exposure than is required for their organic counterparts. Thus, for a given quality optical imaging system, smaller features can be printed when inorganic resist is used. A considerable increase in processing complexity occurs, however, As currently implemented (see Fig. 5.7), an inorganic resist requires two separate layers. The bottom layer is a chalcogenide glass such as arsenic trisulfide or germanium selenide deposited to a thickness of about 200 nm. On top of the glass is a very thin (100-200 Å) layer of silver or a silver-bearing compound such as silver sulfide or silver selenide. Exposure to light at room temperature causes the silver to diffuse vertically downward into the glass (photodoping). The addition of silver to the top layer of the glass makes it resistant to alkaline etches so that it will remain while the unexposed regions are etched away. Undercutting is not a problem, apparently because the columnar structure of the glass makes it etch much more slowly in a horizontal direction than in a vertical direction. Since these chalcogenides have high optical absorptivity, no standing waves can form when there are reflective surfaces. They are useful over the wavelength range from UV to X ray.

5.4.5 Plasma-Developable Resists

In the continuing effort to provide an all-dry lithographic process, both dry application and dry developing have been considered. Dry application could, in principle, be CVD, plasma-assisted CVD, sputtering, or evaporation. Dry developing implies a resist with a differential plasma or reactive ion etch rate between exposed and unexposed regions. Thus far, no success has been achieved with dry resist application, and while there has been limited success with development, no production-worthy plasma-developable resists currently exist.

The first reports of a plasma develop process used a resist with a volatile monomer in a polymer base. Exposure to light caused some linking of the monomer and polymer. After exposure, heating drove off much of the monomer in the unexposed region and thus left a thinner layer. Subsequent plasma etching completely removed the thinner unexposed region and left a thin layer in the exposed region (27, 28). Another approach, directed more toward X-ray applications, used a resist made by mixing an organometallic monomer with a host resist polymer. Exposure caused the metal ions to form bonds that later, when exposed to an oxygen plasma, allowed the metal to oxidize and form a protective surface (29). A related approach has been to expose optical and E-beam resist in the normal fashion and then treat it with an inorganic compound that reacts with either the exposed or the unexposed resist and renders that part more difficult to plasma etch (30). In the case of ion beam printing, the metal ions needed for such reactions are implanted in the resist during exposure.

# 5.4.6 Resist Contrast

Resist contrast is a measure of the sharpness of the transition from exposure to nonexposure. Fig. 5.8 is an idealized plot of the relative thickness of resist remaining after development as a function of the exposure energy.  $E_i$  is the amount of energy required for gelling to



(The scales are different for positive and negative resist.)

## FIGURE 5.9

Normalized AZ1350J thickness remaining after development versus exposure energy (with postexposure-bake). (*Source:* Adapted from W.G. Oldham and A.R. Neureuther, *Solid State Technology*, p. 106, May 1981.)



just begin for negative resist or for positive resist to just begin to break down.  $E_f$  is the minimum energy at which full thickness of negative resist remains after development or where positive resist is completely removed. Fig. 5.9 is an example of the behavior of an actual resist.<sup>5</sup> In this case, well-defined points  $E_i$  and  $E_f$  do not exist, but a tangent to the curve can be drawn and  $E_i$  and  $E_f$  determined as shown. The swing between intensities  $E_i$  and  $E_f$  as determined in this way does not move from zero to full-thickness resist but will usually be greater than half the full thickness, which is considered adequate for most purposes. The resist contrast  $\gamma$  is a measure of the steepness of the curve in transition from  $E_i$  to  $E_f$ . A commonly used definition is as follows:

$$\gamma = \frac{1}{\log_{10} E_f - \log_{10} E_i} = \frac{1}{\log_{10} (E_f/E_i)}$$
 5.3

A plot of this equation is given in Fig. 5.10. For positive resists,  $\gamma$ 's from 2 to 4 are typical.

The intensity I of the exposing light is related to E through E = It, where t is the time. Thus, the light intensity must swing from  $I_f$  in the light portions of the image down to at least  $I_i$  in the dark portions for satisfactory resist images to be formed. Fig. 5.11a shows

<sup>&</sup>lt;sup>3</sup>The curve in Fig. 5.9 is still an approximation in that even with no exposure, some resist will be dissolved by the developer solution.





the case of the intensity varying from  $I_f = E_f/t$  to zero. The case of the light intensity varying from  $I_f$  on down to an amplitude that will just barely prevent the resist from responding  $(I_i = E_i/t)$  is shown in Fig. 5.11b for the case of resist with a contrast ratio of 3. This square wave of period L can be expressed as

$$I = I_o + I_m \Sigma f \sin \frac{2\pi(2n-1)x}{L}$$
 5.4

 $I_{f} = 1$ 

where  $I_n$  is the average intensity,  $I_m$  is the amplitude of the modulating light, and f is given by  $4/\pi(2n - 1)$ , where n goes from 1 to infinity.<sup>6</sup> The wave modulation  $M_i$  of the image is defined as

$$M_i = \frac{I_m}{I_n}$$
5.5

Fig. 5.11b shows that when the lines will just print,  $I_i = I_o - I_m$  and  $I_f = I_o + I_m$ . By using these values and Eq. 5.5, the expression for  $M_w$ , the critical modulation below which printing will not be acceptable, becomes

$$M_{i_{\ell}} = \frac{I_{f} - I_{i}}{I_{\ell} + I_{i}}$$
 5.6

By using Eq. 5.3,  $M_{ic}$  can also be expressed in terms of the resist contrast y as

$$M_{iv} = \frac{10^{1/\gamma} - 1}{10^{1/\gamma} + 1}$$
 5.7

This equation would predict that for a resist contrast of 3. an  $M_i$  of 0.3 would be acceptable. However, experimentally, it is found that for positive resist, an  $M_i$  of 0.6 is desirable, although 0.4 can sometimes be tolerated. Fig. 5.12 is a plot of Eq. 5.7 over the contrast range from 1 to 5.

With good chrome masks, the source modulation  $M_s$  is 1—that is, the "black" areas are black, and the "clear" areas are clear.<sup>7</sup> Thus, the modulation transfer function (MTF) of the optics of the printing system necessary to print the desired image is equal to  $M_{pc}$ . MTF will be discussed in more detail in a later section, where it will be shown that as the feature size gets smaller than a few microns, MTF begins to decrease and will limit the ability to print small features. If the contrast of the resist were improved, then printing could be done with a lower  $M_i$  (and hence lower printer MTF). It does continue to improve, but in addition, the effective contrast of a given resist can sometimes be improved by adding a layer of photobleachable material on top of the resist (31–33).

A photobleachable material is one whose optical transmission decreases as the amount of incident light increases, as, for example,

<sup>&</sup>quot;When the higher-frequency components are removed, the initially square wave becomes rounded, until finally, for n = 1, only a sine wave remains.

<sup>&#</sup>x27;This was not true of the older emulsion masks. They had a noticeable transmissivity in the dark regions. Too high an intensity light source would cause "burnthrough" and resist exposure in the dark regions.



contrast.



the various diarylnitrones that are converted to oxaziridine by absorbed light. A layer of such material on top of the resist will first bleach in the regions of highest light intensity and begin exposing the resist before the darker areas begin to transmit. Thus, the darks are made darker and the lights lighter.

Sensitivity is the measure of how much energy is required to allow the resist to develop properly. However, how to define sensitivity properly is not always straightforward. It is commonly defined as an energy value equal to, or closely related to, the  $E_i$  discussed in the previous section. In the case of negative resist, it might be the point where 50% or 70% of the resist remains after development. A practical definition is that it is the energy required to give a "good" pattern. As the exposure energy increases, the development (resist dissolution) rate will first be very low and will then increase very rapidly. Sensitivity also can be defined as the energy required to raise the ratio of the exposed dissolution rate to the initial rate to some arbitrary value. In either case, higher sensitivity means a lower energy to expose, and it is generally desirable since for a given light source, throughput improves with higher sensitivity resist. Also in either case, the sensitivity is dependent on the develop conditions and on the wavelength range of the source included in the energy measurement. In the case of E-beams and ion beams, sensitivity is usually expressed either as the number of coulombs (C) of charge required or as the number of particles required. In either case, the sensitivity depends on the particle energy. Table 5.4 lists representative sensitivities of various resists. As mentioned earlier, the exact

5.4.7 Sensitivity

## TABLE 5.4

**Resist Sensitivity** 

|                             | Sensitivity       |                                  |                                |                                |                                  |           |
|-----------------------------|-------------------|----------------------------------|--------------------------------|--------------------------------|----------------------------------|-----------|
| Resist                      | UV<br>$(mJ/cm^2)$ | Deep UV<br>(mJ/cm <sup>2</sup> ) | X-Ray<br>(mJ/cm <sup>2</sup> ) | E-Beam<br>(C/cm <sup>2</sup> ) | ton Beam<br>(C/cm <sup>2</sup> ) | Reference |
| PMMA* (positive)            |                   | 500-1500                         | 500-1000                       | $2 \times 10^{-4}$             | $4 \times 10^{-6}$               | (1.2)     |
| PBS† (positive)             |                   |                                  | 200                            | 1.6 × 10 °                     |                                  | (1)       |
| COP <sup>‡</sup> (negative) |                   |                                  | 175                            | $4 \times 10^{-7}$             | 6 × 10 *                         | (1)       |
| PGMA (negative)             |                   |                                  |                                | 10-6                           |                                  | (1)       |
| Ge,Se, (negative)           |                   | 50 (220 nm)                      |                                |                                |                                  | (2)       |
| Hunt HNR (negative)         | 228               | a general state                  |                                |                                |                                  | (3)       |
| Kodak 732 (negative)        | 20§               |                                  |                                |                                |                                  | (4)       |
| Hunt HPR (positive)         | 50§               |                                  |                                |                                |                                  | (3)       |
| Kodak 809 (positive)        | 150§              |                                  |                                |                                |                                  | (4)       |

Mo X-ray target, 20 kV electrons.

#Pd X-ray target, 20 kV electrons, 100 kV protons.

\$Based on the multiline spectrum of an Hg light source.

References: (1) M.J. Bowden, p. 39, in ACS Symposium Series 266, Materials for Microlithography, American Chemical Society, Washington, D.C., 1984. (2) E. Ong et al., p. 71, in D.A. Doane and A. Heller, eds., Inorganic Resist Systems, Electrochemical Society, Pennington, N.J., 1982. (3) Philip A. Hunt Chemical Corporation product literature. (4) Kodak product literature.

> value will depend on specific conditions. In general, negative resist is much more sensitive than positive resist, and X-ray resists are much slower (less sensitive) than UV resists.

# 5.4.8 Reciprocity

Over the exposure region where the product of exposure time and light intensity for proper exposure remains constant, a photosensitive material is defined as exhibiting reciprocity (the Bunsen and Roscoe reciprocity law). With the range of exposure times normally used, there is no evidence of reciprocity failure in positive resist. However, most negative resists, during exposure, are sensitive to oxygen in the resist. If oxygen is available from the ambient to diffuse into the resist during exposure, the photo-initiated products of the sensitizer react with the oxygen rather than forming cross-links in the polymer (34). The result is that as exposure time is increased, there is a reduction in the number of cross-links for a given amount of incident energy and thus a reduction in sensitivity. In contact printing, the tight contact between mask and wafer protects the resist from ambient oxygen, and there is no "oxygen effect." However, if negative resist is used with a projection printer with the wafer in air, a significant reduction in sensitivity will occur. A nitrogen blanket can be used to exclude the air and allow normal exposure. Also, the higher the exposure intensity, the shorter the exposure time and the less the effect.

When very high intensities are used—for example, when a series of short, high-intensity pulses are used instead of a continuous lower-intensity exposure—reciprocity failure can occur for other reasons in both negative and positive resist. Examples of pulse exposure are eximer-laser deep UV sources and laser-initiated X-ray sources.

As more vigorous etching environments develop, as in plasma etching, it becomes increasingly difficult for the resist to provide the necessary protection during etching. In general, the more resistant the resist, the more difficult to develop. Consequently, to be able to harden the resist after development is useful. Higher-temperature postbakes help, as does a flooding with high-intensity UV, which increases the amount of cross-linking.

The processing flow is essentially the same for all standard organic resists, positive, negative, optical, E-beam, or X-ray. The steps of this basic flow are listed in Table 5.5. The general aspects of these steps are discussed in the following sections, but substantial differences in the behavior of different resists may exist. Therefore, the manufacturer's recommended procedures for the specific resist being used should always be consulted.

In order to prevent unwanted resist exposure, all operations from application through developing must be done in light with the ultraviolet and blue removed. Thus, yellow light is left and is obtained from specially coated fluorescent bulbs—hence the name "yellow room" for that portion of the wafer-fab facility used for lithography operations. The total cycle time from resist application to stripping can be less than an hour but, because of queuing, is often much longer. Ordinarily, the longer the time from application to etch, the poorer the adhesion. Long-term storage of resist-coated wafers in ordinary yellow light may produce a film over the whole surface that will not develop away. For negative resist, nitrogen storage may aggravate the problem since no oxygen is present to reduce sensitivity.

Perfect adhesion, which minimizes undercutting, is generally the goal, but a gentle lifting (separation of the resist from the surface) during etching gives tapered sidewalls, which are sometimes desirable. However, such processes are difficult to control and are sel-

5.4.9 Etch Resistance

5.5

RESIST PROCESSING FLOW

5.5.1 Adhesion Promoters and Adhesion

| TAB | LE | 5.5 |  |
|-----|----|-----|--|
|     |    |     |  |

**Resist Processing Flow** 

| Step                          | Comments                                                                                        |  |  |
|-------------------------------|-------------------------------------------------------------------------------------------------|--|--|
| Adhesion promoter application | Sometimes used.                                                                                 |  |  |
| Resist application            | Done by dispensing resist onto static<br>or slowly rotating wafer and then<br>rapidly spinning. |  |  |
| Resist dry                    | To remove solvents.                                                                             |  |  |
| Resist prebake (soft-bake)    | To slightly harden resist and improve<br>adhesion.                                              |  |  |
| Expose                        | Defines image in resist.                                                                        |  |  |
| Postexpose-bake               | Sometimes used to suppress effect<br>of standing waves.                                         |  |  |
| Resist developing             | Removes unwanted resist.                                                                        |  |  |
| Rinse                         | Displaces developer in resist.                                                                  |  |  |
| Resist postbake (hard-bake)   | To improve adhesion and etch<br>resistance done at a higher<br>temperature than prebake.        |  |  |
| Etch oxide*                   | Resist must not react with etch or<br>peel from surface.                                        |  |  |
| Resist removal                | Done with oxygen plasma or hot acid.                                                            |  |  |

\*Sometimes, another insulator or metal is used. Sometimes, photoresist is used as an ion implant mask rather than as an etch mask.

dom used. Clean, dry surfaces of undoped oxide, silicon nitride, polysilicon, and most metals will give satisfactory adhesion. Adherence to heavily phosphorus-doped oxide and to gold is generally rather poor, and various promoters (primers) are sometimes used.

A wafer prebake (drying) cycle before resist application is recommended for good adhesion,<sup>8</sup> and if a promoter is also desired, a 1% solution of hexamethyldisilizane (HMDS) in xylene spun onto the wafer and allowed to dry a few seconds before resist application can be used for negative resist. For positive resist, a solvent such as EGMEA must be used.<sup>9</sup> As an alternative method of application, the wafer can be exposed to HMDS vapor just before resist spinning, and the possibility of solvent-induced problems can thus be avoided. Phosphorus, carbonyl, or sulfur-containing additives to the basic promoter are suggested to improve adhesion to gold metalli-

<sup>&</sup>quot;For example, 2 hours at 200°C for negative resist and 300°C for positive resist is suggested by Philip A. Hunt Chemical Corporation.

<sup>&</sup>quot;Xylene and some other negative resist solvents will cause gelatinous clusters to form in positive resist—thus, the use of ethyl glycol monoethyl ether acetate (EGMEA).

zation (35). Higher soft-bake temperatures for positive resist and higher hard-bake temperatures for negative resist will help adhesion. Higher soft-bake temperatures reduce optical sensitivity. Higher hard-bake temperatures can cause unwanted flow and pattern distortions and also make it more difficult to remove the resist during the stripping operation.

The main requirements for resist application are that the resist be pinhole-free and of uniform and reproducible thickness. Integrated circuit applications in general require tighter thickness tolerance and a thinner layer than do printed circuit board applications. Hence, roller application, dipping, spraying, or the use of sheets of dry resist is not appropriate. The use of thin layers also requires that much more attention be given to the removal of particles and gelatinous masses from the resist before it is dispensed. Resist is carefully filtered by the manufacturer, but because subsequent contamination may occur and long storage times may produce more gel, filtration is also often done just before application.

The standard application method now in use puts a metered amount of resist onto the center of the wafer and then spins the wafer to fling off the excess resist and produce a uniform covering. This operation is referred to as spinning, although the literature of 1970 and before often called it "whirling." It is a seemingly simple process but is very difficult to accurately model. Table 5.6 lists many of the variables that affect the thickness.

Theoretical modeling predicts the thickness t to vary as Eq. 5.8, which gives

$$t = KS\left(\frac{v}{\omega^2 R^2}\right)^{1/2}$$
 5.8

where K is a constant, S is the fraction of solids in the resist, v is the kinematic viscosity of the resist,  $\omega$  is the angular velocity of the spinner, and R is the wafer radius (36). Experimentally, S and v are intertwined since the viscosity is ordinarily adjusted by adding solvent to the resist, which, in turn, lowers the percentage of solids in the resist. Fig. 5.13 gives experimental data for dynamic viscosity versus solids content for a particular resist. Curves such as this one can be used to determine the amount of solvent necessary to lower resist viscosity to some desired lower value. One equation for a particular resist combines S and v (40) so that

5.5.2 Resist Application\*

<sup>&#</sup>x27;See references 36-39.



Dynamic viscosity versus solids content for a negative resist.



$$t = \frac{K(\ln \nu + 0.2258)^2}{\omega^{1/2}}$$
 5.9

The dynamic viscosity is defined as the shearing stress divided by the rate of shearing strain and has the units of poise. Kinematic viscosity is dynamic viscosity divided by the density of the fluid and is expressed in stokes. Note that

> 1 poise = 1 dyne $\cdot$ s/cm<sup>2</sup> = 1 gram/cm $\cdot$ s 1 centipoise = 0.1 poise 1 stoke = 1 cm<sup>2</sup>/s

Since the density is relatively insensitive to solids content, the ratio of kinematic to dynamic viscosity remains reasonably constant. Viscosity is measured by the drag on a rotating or oscillating drum, by the time for a spherical ball to drop a given distance in the liquid, or by the time required for a given volume to flow through an orifice. Both flow through an orifice (Canon–Fenske viscometer) (41) and drag on a drum (Brookfield viscometer) have been used to characterize resist.

Eq. 5.8 predicts that if the resist behaves in a Newtonian fashion—that is, viscosity independent of shear rate—then  $t \approx \omega^{-2/3}$ . Experimentally observed variations of the thickness *t* with the spin velocity  $\omega$  have ranged from *t* inversely proportional to the 0.4 power of  $\omega$  up to 0.7 (36–38). Data for some resists show that the exponent is a function of the viscosity (36), with a value near -0.66for low viscosities and near -0.5 for high viscosities. Modeling is complicated by the fact that the resist properties change during spinning because of solvent evaporation.

The diameter of the wafer appears to have less effect on thickness than the range of thickness variations normally observed during processing. The same is true of the ramp rate (spin spindle acceleration). The method of dispensing the resist, the amount dispensed, the speed with which it is dispensed, and the speed of wafer rotation at the time of dispense are all important factors in providing a reproducible process, but the interactions are so complex that only some general statements can be made. The resist should be dispensed while the wafer is either stationary or just barely rotating. A substantial amount of excess resist should be dispensed. Despite the safety requirements for rapid exhaust of solvents, removing the solvents too rapidly can cause the resist to prematurely dry and give poor coverage. If the gas flow above the wafer produces eddies, resist drops leaving the wafer edge may be transported back to the middle of the slice. If the dispense nozzle is too far from the wafer, it can cause splattering. If the wafer is irregularly shaped, as is some GaAs, the resist will usually have a nonuniform thickness over the wafer. The resist formulation contributes to resist striations,10 with some varieties showing almost none and others being heavily striated. Edge beading is a buildup of resist along the edge of the wafer and may be up to 50 µm wide and 2 µm high. The height is reduced as the spin speed and acceleration are increased. Some resist dispense equipment has resist solvent nozzles located so that solvent can be applied to the edge of the wafer and can dissolve the bead.

<sup>&</sup>lt;sup>10</sup>Resist striations are small thickness variations that arise during resist application. (See, for example, P. Frasch and K.H. Saremski, *IBM J. Res. Develop.* 26, p. 561, 1982.)

5.5.3 Soft-, Postexpose-, and Hard-Bake

The soft and hard resist bakes are done respectively just before exposure and just after development. Soft-bake (prebake) temperatures are in the 70°C–90°C range, and depending on whether a conventional convection furnace or infrared or microwave heating is used, the time may range from 30 minutes down to 4 or 5 minutes. Prebakes are necessary to drive solvent out of the resist so that it can be properly exposed. The subsequent behavior of the resist is dependent on the prebake cycle in that some chemical changes take place at the elevated temperatures. As mentioned in a previous section, prolonged baking will reduce the resist sensitivity (speed) but will also improve adhesion. If negative resist baking is done in an atmosphere high in NO<sub>2</sub> or ozone—that is, with a high smog content—an insoluble scum may form, which will interfere with developing and the subsequent etching step (42).

When interference effects (discussed in a later section) cause the resist to be exposed in layers, a postexposure-bake will cause the exposed resist reaction products to diffuse from layer to layer and thus allow for more uniform development of the entire resist film.

The hard-bake (postbake) is done after development in order to increase both adherence and etch resistance. Hard-bake temperatures will vary with the resist, but they are generally higher than the prebake temperatures. Positive resists must be postbaked at lower temperatures than negative resists—for example, 90°C-120°C versus 130°C-140°C. If the temperature is too high, the resist will be difficult to strip and also may flow and deform the pattern.

Negative resist can be developed by a solvent such as xylene, but proprietary mixtures formulated for the various resists are often used. Immediately after development, there is usually a rinse in n-butyl acetate to displace the developer in the remaining resist and reduce swelling and to wash away any remaining reaction products. Positive resist requires an alkaline developer, which can be NaOH or KOH. Concern over metallic ion contamination has prompted the use of organic bases, mostly of proprietary nature. Buffering is often used, which makes the developer less sensitive to CO<sub>2</sub> in the air reacting with it. Rinsing is also necessary with positive resist. In this case, since the reaction products are water soluble, deionized water is used for the rinse.

# 5.5.5 Resist Removal

5.5.4 Developing

When only silicon and silicon oxide or nitride are present, very vigorous stripping reactions can be used, but when resist is being removed from metal, much more care must be exercised. As in the case of developers, several proprietary stripping solutions are available. Plasma etching in an oxygen atmosphere is often used and provides a dry process. Burning of the resist at elevated temperatures can be used if the resist is free of ash-forming material. Hot sulfuric acid is very effective. A variety of organic solvents can also be used. Some materials that will remove positive resist are methyl ethyl ketone, acetone, and cellosolve. Phenol- or cresol-based solutions are used for both positive and negative resists. Stripping procedures may vary somewhat from level to level since some processing steps make the resist difficult to remove. Examples are resist that has been used as an ion implant mask and resist used during some plasma metal etching processes.

High-pressure mercury arc lights are the normal source of radiation for optical printing systems (43, 44). The pressure during operation is 35-40 atm, and since the envelope must be fused silica in order to transmit the UV, the size must of necessity be rather small so that the envelope does not explode. The arc size is approximately 2 mm in diameter by 5 mm long and radiates light in a toroidal pattern as shown in Fig. 5.14a. This light must be collected and spread uniformly over the area of the mask, which may be as large as 200 mm by 200 mm. Two approaches, both shown in Fig. 5.14b, have been used (45). One approach surrounds the source with a parabolic mirror and then uses either additional mirrors and/or lenses to transform the beam to the proper size and divergence. The other approach surrounds the bulb with a series of lenses, each of which collects a portion of the output. Early steppers used only one collection lens, but that number was soon increased in order to collect more of the total light. Methods of combining the outputs of the several collection lenses include the use of fiber optics (46) and the use of mirrors and prisms.

FIGURE 5.14

Light source optics.



44.1

5.6

LIGHT SOURCES 5.6.1 Arc Sources

The output spectrum varies somewhat with construction details and length of use, but a typical spectrum is shown in Fig. 5.15. With age, the intensity decreases, primarily because the interior of the bulb becomes coated with a thin layer of electrode material. Further, since the metallic coating absorbs more at the shorter wavelengths, a shift in the spectrum also occurs. Because of the high photon energy required to expose resist, only the 436 nm and shorter wavelengths are of use in lithography. For contact print systems, lens systems with low resolution, or reflective optics systems, the energy from all lines can be used at once. For high resolution, the lenses are corrected for just one of the wavelengths, and filters are used to remove the rest. The single frequency, while making lens design easier, also increases exposure time and makes standing waves between various reflecting surfaces in the optical path more pronounced. To combat interference, at the expense of optical resolution, lenses are sometimes corrected to both 436 nm and 405 nm. In order to minimize heating of either the mask or the wafer during exposure, a filter is used to remove the longer wavelength part of the spectrum. To minimize the energy absorbed by the filter, it is usually designed as a beam-splitting mirror (cold mirror) so that the desired wavelengths are reflected and the long-wavelength IR is transmitted,

The extension of optical printing to shorter and shorter wavelengths in order to provide high pattern resolution has made it increasingly difficult to find satisfactory light sources. Reference to Fig. 5.15 shows that with a conventional high-pressure mercury arc,  $0.3 \,\mu m$  (300 nm) is about the lower limit. By adding xenon, a usable output is obtained down to 250 nm. Deuterium lamps produce a continuum in the 200–350 nm region, but their power output is even lower. Pulsed sources can be used to momentarily increase the power output, and even in the 400 nm range, the bulbs are often idled at a lower power than that used when exposures are being made.





#### FIGURE 5.15

(a) High-pressure mercury emission spectrum in the visible and UV regions. (b) Additional lines from the introduction of xenon into the mercury arc.

| TABLE 5,7               | Material                   | Wavelength (nm) |
|-------------------------|----------------------------|-----------------|
| Excimer Laser Materials | Xenon fluoride (XeF)       | 351             |
|                         | Xenon chloride (XeCl)      | 308             |
|                         | Krypton fluoride (KrF)     | 249             |
|                         | Argon fluoride (ArF)       | 193             |
|                         | Fluorine (F <sub>2</sub> ) | 157             |

5.6.2 Excimer Lasers

The sources currently capable of producing the highest power output and the shortest wavelengths are excimer lasers<sup>11</sup> (47). Table 5.7 lists some excimer lasing materials and their wavelengths (48). Such sources have been used for contact (49–51), proximity (52), and projection printing (53). These sources are capable of 10–20 W and thus, even with relatively insensitive resists, could be used for highthroughput systems. Because of the high power associated with each laser pulse, reciprocity effects will probably dictate more total energy to expose the resist than would be predicted from standard resist sensitivity calculations.

The high degree of spatial coherence present in most lasers generates image noise and has prevented their widespread use as light sources for equipment requiring good image definition. The excimer lasers are multimode, have poor spatial coherence, and hence can produce low noise images.

The shorter wavelengths of deep UV require fused silica masks and either fused silica or reflecting optics. Below about 200 nm, not even fused silica is appropriate. At  $157^{12}$  nm, LiF has been used for a lens material (light source collimator) and CaF<sub>2</sub> for the mask (50). Because of the high attenuation of the 157 nm light at atmospheric pressure, the light path must be in vacuum. (Such short-wavelength UV is often referred to as "vacuum ultraviolet.")

Contact printing is a lensless system in which the mask is held in direct contact with the wafer. Before exposing, the mask must be aligned with features already on the surface, such as oxide holes.

5.7

CONTACT PRINTING

<sup>&</sup>lt;sup>11</sup>An excimer is two atoms or molecules that are unstable in the ground state but that form a stable entity when excited. An example of the reaction to produce an excimer is  $Xe' + Cl_2 \rightarrow XeCl + Cl$ , where Xe' is an excited atom.

<sup>&</sup>quot;The 1500 Å wavelength UV is only about 100 times as long as the soft X rays currently used for lithography. It seems likely that there will be a continual narrowing of this gap and that the deep UV and X-ray proximity printing technologies will eventually merge.

For this operation, the mask and wafer are slightly separated so that they can be moved relative to each other without damaging either. A split-field microscope, with its two objectives positioned to look diametrically opposite each other near the edges of the wafer, is used to simultaneously view the mask and the wafer below it at these two positions. Using x, y, and  $\Theta$  movements, an operator can shift the relative positions until alignment markers13 on the two overlap. After alignment, the mask and wafer are clamped together, the microscope is moved away, and the resist is exposed with UV light. The equipment is relatively inexpensive, but the process suffers from a serious difficulty. Since the mask comes in direct contact with the resist on the wafer, sometimes the resist will stick to the mask and be pulled away from the wafer, thus causing a defect on the wafer and a flaw on the mask that may not be removed before the next use. Additionally, abrasive particles or epi spikes on the wafer will permanently damage the mask so that it degrades during each use. Economics do not justify throwing away a mask after only one use, and as the size of the chip increases, the same number of defects per unit area causes increasingly lower yields. (The impact of defects introduced during processing on yield is covered in detail in Chapter 11.) Thus, the move to progressively larger chips has caused the gradual disappearance of contact printing, even though it does have the potential for fine-line definition. The good definition occurs because of lack of diffraction effects if the mask is in direct contact with a thin resist. If the wafer is so wavy that good contact does not occur or if the resist is thick, some diffraction effects can still occur, and, of course, the properties of the resist itself affect the resolution actually observed. Using deep UV and conventional resist, a resolution of 0.5 µm has been reported (54).

In order to minimize wafer and mask damage, instead of a "hard clamp" of the mask and wafer, reduced pressure is sometimes used, in which case, the machine operation mode is referred to as "soft contact." If the mask is kept slightly separated from the slice, then no contact damage occurs, although resolution suffers. This type of printing is referred to as proximity printing and requires that some additional features be built into the printing machine.

The additional mechanical requirement of a proximity printer is that provisions be made to hold the mask parallel to and a specified distance away from the wafer. To minimize mask damage, a wide sep-

5.8

OPTICAL PROXIMITY PRINTING

<sup>&</sup>quot;Alignment markers are used in all forms of printing and take many shapes. In most cases, alignment is based on overlaying and centering one geometric pattern, such as a square or a cross, within a slightly larger one.

aration is desirable, but to minimize diffraction and divergence effects, the smaller the gap the better. Since the separation will be only a few microns, the Fraunhofer far-field approximations cannot be used to calculate diffraction effects. Rather, the more complex Fresnel equations must be solved for a complete picture of the theoretical diffraction effects.

As a rough estimate, the minimum line width  $W_m$  that can be satisfactorily printed is

$$W_m = (d\lambda)^{1/2}$$
 5.10

where  $\lambda$  is the wavelength of the light being used and d is the maskwafer separation (55). Eq. 5.10 is graphed in Fig. 5.16 for wavelengths of 436 nm, 249 nm, and 10 Å (typical of the length of X rays used in X-ray lithography). Actual line-width values will also have a dependence on the kind of resist used and the way in which it is developed. Experimental values for 249 nm UV exposure onto AZ1350J resist are also shown in the figure.

To minimize penumbra and its associated line broadening, a well-collimated (coherent) light source is desirable, but since such light also enhances diffraction effects, light a few degrees off collimation may be used in practice to strike a balance between the two effects (56). The penumbra contribution of a total beam divergence of  $2\theta$  to an image of width W is given by

$$\Delta W = 2d \tan \theta \tag{5.11}$$





Thus, if, for example, a nominal 2  $\mu$ m line is to be widened no more than 10% because of divergent light when printed with a mask-wafer spacing of 10  $\mu$ m, the angle of divergence must be less than 0.5°.

# 5.9

FUNDAMENTAL OPTICAL LIMITATIONS OF PROJECTION PRINTING

5.9.1 Resolution

The two kinds of printing discussed thus far use optical elements only in the light source, and their performance is little affected by optical lens limitations. However, the projection printing discussed in the next two sections depends on optical elements to form the printed image. Performance limitations caused by the inability to construct perfect systems are discussed in these sections. In addition, theoretically perfect optical projection systems still have limitations, some of which are discussed here.

Resolution is a measure of the ability of a system to form separate images of closely spaced objects. What constitutes a separate image depends somewhat on the use of the image. For example, if all that is required is that the eye be able to surmise that probably two objects exist, then an image intensity like the one shown in Fig. 5.17a would suffice. However, if an image is to print two separate objects on photoresist, then the image should be more like the one in Fig. 5.17b. The two objects are clearly discernible as separate images, but whereas the object had equal lines and space, the resist image has wide lines and a narrow space. If the object is to print a faithful reproduction, the image must be yet better defined, as in Fig. 5.17c. Note that because of the clipping action of the resist's exposure behavior, a square wave can be printed even though the exposing light intensity does not have square shoulders. It is this property that allows optical printing to be used for much finer geometries than was originally expected. A criterion<sup>14</sup> for resolution that is often used to

## FIGURE 5.17

Effect of increasing object separation when spacing is near diffraction limit.

| $\square$ | $\mathcal{M}$ | $\bigwedge$ | Image intensity   |
|-----------|---------------|-------------|-------------------|
|           |               |             | Developed pattern |
| (a)       | (b)           | (c)         |                   |

"Remember that this is the diffraction limit—that is, the limit for a perfect lens. Design or manufacturing flaws reduce this number. Lens manufacturers often rate their resolution limits at about 1.5 times that predicted by Eq. 5.12. describe lens capability and that corresponds roughly to Fig. 5.17a is the Rayleigh limit

$$S_r = \frac{0.6\lambda}{NA}$$
 5.12

where S, is the separation of two barely resolvable objects,  $\lambda$  is the wavelength of the light being used, and NA is the numerical aperature of the lens. NA is a lens design parameter and is given by

$$NA = n \sin \phi$$
 5.13

where *n* is the index of refraction of the medium outside the lens and  $\phi$  is the half-angle of the two most divergent rays of light that can pass through the lens. The *f*-number *F* is defined as

$$F = \frac{f}{D}$$
 5.14

where f is the focal length of the lens and D is the diameter of the entrance pupil. When the image is formed near the focal point, as in the case of printing equipment and microscopes,  $F \approx 1/2NA$ .

The modulation transfer function (MTF) of an optical system is the ratio of the modulation of an optical image (see Eq. 5.5) to that of the object. For a diffraction-limited optical system, it is a function of the spatial frequency of the object,<sup>15</sup> the light wavelength, the numerical aperture of the lens being used, and the degree of spatial coherence of the light source. Spatial frequency is often expressed in terms of lines/mm so that, for example, a series of 5  $\mu$ m lines and spaces would have a period of 10  $\mu$ m and hence would be 100 lines/ mm. The light source is spatially coherent if it is a point source and incoherent if it is infinite in extent. Practically, if the source is just large enough to fill the entrance pupil of the lens, then the light is assumed incoherent. A common description of the degree of incoherence is the ratio of the diameter of the light source to entrance pupil. Thus, a point source is 0% incoherent, and a source as large or larger than the entrance pupil is 100% incoherent.

Fig. 5.18 shows how MTF varies with spatial frequency for various degrees of light coherence (57). The two limiting cases are 0%and 100% spatial light coherence. The cutoff frequency  $\zeta$  for incoherence is given by

$$\zeta = \frac{2NA}{\lambda} = \frac{1}{F\lambda}$$

5.15

<sup>&</sup>lt;sup>15</sup>The remaining discussion assumes that the intensity of the object is sinusoidal rather than square wave.

### FIGURE 5.18

Modulation transfer function (MTF) versus normalized spatial frequency.



That is, by the time the wavelength  $L = 1/\zeta$  is reduced to  $\lambda/2NA$ , there is no contrast between the two lines. Note that this value is comparable to the Rayleigh spacing  $S_r$  of Eq. 5.12, or  $0.6\lambda/NA$ . The MTF derivation assumes a sinusoidal-varying intensity, whereas, in printing, the object and the desired image are square waves. However, since a square wave can be described by a Fourier series, it is possible to calculate how a square wave object intensity I(x) will fare by multiplying each term of the Fourier series that describes it by the appropriate MTF<sub>kw</sub> and then summing:

$$I(x) = I_0 + \Sigma \operatorname{MTF}_{kv} I_k \sin(2\pi k v x)$$
 5.16

where  $\nu$  is the fundamental spatial frequency of I(x).

EXAMPLE

Using this concept, calculate what the image intensity profile will be for a mask with 1  $\mu$ m lines and spaces projected through a lens system with an NA of 0.35, a 100% incoherence factor, and a wavelength of 436 nm.

The first four terms of the square-wave expansion are as follows:

$$I_0 + I_0\left(\frac{4}{\pi}\right)\sin\left(\frac{2\pi x}{L}\right) + I_0\left(\frac{4}{3\pi}\right)\sin\left(\frac{6\pi x}{L}\right) \\ + I_0\left(\frac{4}{5\pi}\right)\sin\left(\frac{10\pi x}{L}\right)$$

The cutoff frequency  $\zeta = (2 \times 0.35)/0.436 = 1.6/\mu m$ . The wavelength of the mask pattern is a 1  $\mu m$  line plus a 1  $\mu m$  space, or 2  $\mu m$ . The fundamental frequency is thus 0.5/ $\mu m$ , giving a normalized spatial frequency (actual frequency/cutoff frequency) of 0.31. The third harmonic is 0.93, and the fifth is 1.53. The MTF for each of these frequencies can be read from Fig. 5.18. For the lowest frequency (0.31), MTF  $\approx$  0.6. For the others, it is negligible. Thus, even though a square wave of light and dark is on the mask, only a sinusoidal light pattern will be projected onto the wafer in this case. Since the MTF for the first harmonic is 0.6, the sinewave image modulation will also be 0.6, and the normalized intensity will swing between 0.25 and 1.0 (see Problem 7 at the end of the chapter).

5.9.2 Depth of Focus An optical image degrades as the system is defocused, and the amount of defocusing that can be tolerated is called depth of focus. The amount to be tolerated of course depends on the use of the image and thus is somewhat subjective. The Rayleigh depth of field criterion is one that is widely used for comparing lenses, and it is defined as

$$\delta = \frac{\lambda}{2(NA)^2}$$
 5.17

where  $\delta$  is the depth of field. In lithography, the limiting item is how much line-width change can be tolerated. In some cases, the nonlinear resist behavior allows much more defocus tolerance than would be predicted from the Rayleigh expression, as shown in Fig. 5.19. Comparing the depth of field expression with the resolution expression shows that resolution is obtained at the expense of field depth. Thus, the higher the NA of the lens being used, the more difficult it is to keep the whole wafer properly focused. For a given resolution, a shorter wavelength and smaller NA will improve depth of field.

After the depth of field for a particular lens has been established, there remains the task of providing an environment that allows the whole wafer to stay within this depth of field during exposure. The first step is to determine each item that causes the image focus to be other than exactly on the wafer surface. Then, a budget can be assigned to each of these items and improvements made as necessary to ensure that the printing is satisfactory. A list of such items along with some typical values is given in Table 5.8. The total focus de-

5.9.3 Variation in Position of Best Focus



| TABLE 5.8                                 | Item                                                    | Typical Value      | Comments                                                                                                 |
|-------------------------------------------|---------------------------------------------------------|--------------------|----------------------------------------------------------------------------------------------------------|
| Factors Affecting Image Focus<br>on Slice | Process-Related<br>Wafer topography<br>height variation | ±lμm               | Due to metallization, vias,<br>and so on; can be reduced<br>with multilayer resist.                      |
|                                           | Incoming-Slice-Related*                                 |                    |                                                                                                          |
|                                           | Slice flatness                                          | ±3 µm              | With steppers, the area per<br>step is reduced, and<br>deviation over a step can<br>be less than 0.5 µm. |
|                                           | Slice taper (150 mm)                                    | 20 µm              | Applicable only to printers<br>with backside reference.                                                  |
|                                           | Mask-Related*                                           |                    |                                                                                                          |
|                                           | Mask bow $(5 \times 5 \text{ inch plate})$              | ±5 μm              | Printers using reduction<br>lenses reduce the effective<br>value by square of<br>reduction.              |
|                                           | Machine-Related*                                        |                    |                                                                                                          |
|                                           | Autofocus accuracy                                      | $\pm 0.25 \ \mu m$ | Stepping printers.                                                                                       |
|                                           | Best focus setting                                      | $\pm 0.05 \ \mu m$ | Applicable only to printers<br>with autofocus.                                                           |
|                                           | Chuck flatness (150 mm)                                 | ±0.3 µm            |                                                                                                          |
|                                           | Stage travel                                            | $\pm 0.05 \ \mu m$ | Per step on stepping printers.                                                                           |
|                                           | *These items are expected to                            | improve with tim   | ie.                                                                                                      |

199

viation can be estimated (58) by assuming that the values are all independent and thus can be combined by taking the square root of the sum of the squares. If there are constant, systematic errors, as, for example, the "best focus" setting, they should be linearly added to the root of the sum of the squares of the others.

The curvature of the slice-holding chuck is listed in Table 5.8, but another element of chuck flatness exists that can cause local printing defects as well as damage to the silicon itself. If the chuck is damaged, resulting in a raised burr, or if a particle is trapped between the chuck and the slice, the wafer surface may be enough out of focus to cause printing flaws. To minimize this problem, many chuck designs allow very little actual contact area between the chuck and the wafer.

As an alternative to proximity printing, which is difficult to control if the gap is narrow and which is troubled with excessive diffraction if the gap is wide, projection printing has evolved. The early machines (late 1960s) used full-wafer projection through a multipleelement lens onto 2 inch diameter wafers. It was difficult to get quality lenses that could cover even this small area, and as the slice diameter increased, alternative approaches were required. Excluding diffraction effects, which for a given wavelength and numerical aperture are independent of design, lens performance problems can be lumped into two general categories. One is excessive background light from internal reflections and scattering. This excessive light reduces the contrast of the image. The other is a series of aberrations that either blur or geometrically distort the image. These aberrations can be broadly categorized as follows:

> Spherical aberration Coma Astigmatism Field curvature Chromatic aberration Distortion

Chromatic aberration is eliminated by the use of reflective optics. The others can be made very small for an image lying in a narrow ring concentric with the optical axis of the reflecting mirrors. Despite the restriction of using only the narrow ring, a full-size mask image can be projected onto a wafer by synchronously moving the mask and wafer while the mask is illuminated by a segment of a circular band of light as shown in Fig. 5.20. In 1973, Perkin–Elmer introduced a 3 inch diameter full-wafer scanning projection printer based on this concept (59). Since that time, all full-slice printers

## 5.10 FULL-WAFER PRO-JECTION PRINTING



#### TABLE 5.9

Estimated Percentage of Wafers Printed by Various Technologies

| Year | Contact/<br>Proximity | Full-Slice<br>Projection | Steppers | E-Beam |
|------|-----------------------|--------------------------|----------|--------|
| 1975 | 90%                   | 10%                      | 0        | 0      |
| 1980 | 70%                   | 29.8%                    | 0.2%     | 0      |
| 1985 | 25%                   | 65%                      | 9.7%     | 0.3%   |
| 1990 | 15%                   | 60%                      | 24.6%    | 0.4%   |

have used that basic design, although newer large-slice machines have two primary mirrors symmetrically placed in order to further reduce distortion. The wafer size capability has been increased to 150 mm, and the resolution has been improved from 3  $\mu$ m to 1  $\mu$ m. As Table 5.9 shows, full-wafer projection printing continues to be the major method used, with contact and proximity printing decreasing and stepper printing increasing.

## 5.10.1 Operation

Because the image is scanned instead of being projected at one time, exposure time is generally described in terms of scan speed rather than total time. Greater slit widths (width of the ring of light used) allow more light to the resist. Thus, for a given resist, a faster scan speed can be used, giving a higher machine throughput. The use of wider slits allows more aberration so that resolution suffers. For example, in going from zero slit width (diffraction limit) to 2.7 mm on a Perkin–Elmer 200 series printer, the resolution changes from about 1.5  $\mu$ m to about 2  $\mu$ m (60). Also, often a light source aperture setting will change the effective *f*-number of the light source, the amount of light to the wafer, and hence the throughput. It will also affect the resolution since the size of the aperture stop changes the light coherence. As was mentioned in the discussion on pellicles, the amount of separation between the pellicle and the mask for a given degree of protection depends on the source *f*-number. To determine the impact of these parameters on image quality and throughput, manufacturers' specifications for the specific equipment must be consulted. In addition, as an aid in using their products, resist literature often includes performance curves based on some of these specialized parameters.

Regardless of whether manual or automatic focusing is used, two factors determine the accuracy of the final focus. One factor is the choice of the position of best focus. On full-wafer printers, this choice is generally determined by using a focus wedge. The focus wedge is a mask designed so that it is tilted at a known angle to the optical axis and contains a series of identical resolution patterns. After an exposure is made at what is assumed to be the proper focus. the pattern with the best image is located. The combination of its position on the mask with the known angle of inclination of the mask to the optical axis allows the position of best focus to be determined relative to the initial setting. After this position is determined as accurately as possible, the problem of reproducing it from wafer to wafer still remains. If automatic focusing is used, an error will be associated with it. With manual focusing, the problem of reproducing the best focus setting exists. Some full-slice printers position the front of the wafer against three reference pads. If buildup of resist on the pads occurs, focus will be affected. Other designs eliminate this problem by using air gage sensors instead of pads so that the wafers never contact the reference surfaces. Since wafers are almost never completely flat, if the distribution of bow changes, focusing may also need to change.

5.10.3 Registration

5.10.2 Focusing

Accuracy

Table 5.10 lists potential causes of misregistration. Misalignment is the amount of misregistration between alignment markers of differ-

**TABLE 5.10** 

Causes of Misregistration During Scan Projection Printing

Misalignment Internal mask errors Optical distortion

Magnification change Wafer expansion Mask expansion ent levels. It can occur because of inaccuracies in automatic aligning and may be exacerbated by processing steps that cause the edge slope of alignment markers to degrade. Manual alignment is subject to operator error and will, in general, vary from operator to operator. The misplacement of alignment markers with respect to the rest of the mask patterns will cause misregistration error when alignment is perfect. Unlike other mask errors, this error can be corrected by an operator's aligning by pattern geometry rather than by markers. Fig. 5.21a shows how feature overlay would appear with x-y or  $\theta$ misalignment.

Mask errors were discussed earlier in the section on mask making. Optical distortion can be pincushion, barrel, or random and will cause square arrays to appear as in Fig. 5.21b. If the same printer can be used on all levels, no overlay problem will occur. Unfortunately, this luxury is usually available only during the R&D phase. Two different types of printers—for example, steppers and full-slice printers—may have different kinds of dominant distortion and hence may overlay more poorly than two printers of similar nature. Distortion can be caused not only by errors in mirror grinding but also by temperature changes or other mechanical stresses that cause flexing of the mirror.

Magnification changes, as illustrated by Fig. 5.21c, are generally negligible compared to the others since all 1:1 projection printers are designed to be telecentric,<sup>16</sup> at least on the wafer side. Wafer or mask thermal expansion appears as an apparent magnification change. Expansion effects can be minimized by judicious mask cooling and a temperature-controlled printroom. The cooling air must be carefully filtered; otherwise, particles will collect on the mask and/or wafer and cause printing defects that can easily overshadow the expansion problems.

### FIGURE 5.21

Types of misregistration.





<sup>16</sup>A telecentric design is one that produces unity magnification even if the mask and/or wafer is substantially out of focus. Such a design does not, however, prevent image distortion when defocused. 5.11

5.11.1 Focus

## STEPPER PRINTING

As discussed in an earlier section, the capability to design and construct optical projection lenses to provide both high resolution and full coverage for large-diameter wafers does not currently exist. One method of circumventing this problem is to design lenses with a relatively small field and then print the full-wafer surface by a series of adjacent exposures.<sup>17</sup> In the late 1970s, GCA introduced a system based on their mask-making equipment that used a large, multielement lens to project an area of only 1 cm<sup>2</sup> onto a wafer at one time. A carefully constructed stage then moved the wafer to successive positions until the entire wafer was exposed.

The stepper provides improved resolution but, because of the multiple steps involved, is slower than full-slice printing. Table 5.11 summarizes characteristics of many of the lenses currently available. The table shows that these lenses have predominately a  $5 \times$  or  $10 \times$  reduction ratio and cover fields of from  $1 \times 1$  cm to  $2 \times 2$  cm. The resolutions of several of the lenses go below 1  $\mu$ m. The reduction lens used in most stepping printers is comprised of a large number of refracting elements in order to compensate for various optical aberrations. Fig. 5.22a shows the complexity to be found in such dioptric lens designs. By using a catadioptric design,<sup>18</sup> which is more appropriate to a 1:1 stepper, the lens system can be substantially simplified and still provide comparable image quality. An example, often referred to as a Wynne–Dyson design (61), is shown in Fig. 5.22b.

The reduction lens used in most steppers has an advantage over the lens used in 1:1 printing systems in that there is much less sensitivity to dust on the mask. Many particles that would be troublesome in a 1:1 printer are reduced to the point that they are below the resolution of the lens and hence will not print. The thickness of the mask itself is enough to keep particles on the back surface out of focus enough to offer reasonable protection. For the front surface, pellicles (see the section on mask protection) can be used to keep particles from being in focus. Some steppers have built-in particle detectors for periodically scanning the mask or pellicle surface for particles large enough to cause a yield loss.

Steppers can be refocused at each site so that they are relatively insensitive to wafer surface undulations even though their high-

<sup>&</sup>lt;sup>47</sup>The actual lens field is, of course, circular, but the field that includes the patterns is either square or rectangular. Hence, the effective field is the largest square or rectangle that can be inscribed in the circle.

<sup>&</sup>quot;An optical system using both refractive and reflecting elements.

| TA | BL | E | 5. | 11 |
|----|----|---|----|----|
| -  |    |   | ~  |    |

.

| Wavelength<br>(nm) | NA    | Advertised<br>Resolution | Field Size<br>(mm) | Lens<br>Reduction | Manufacture |
|--------------------|-------|--------------------------|--------------------|-------------------|-------------|
| 365                | 0.32  | 0.9                      | 14×14              | 5X                | Zeiss       |
| 365                | 0.35  | 0.75                     | 11×11              | 5X                | Tropel      |
| 365                | 0.35  | 0.75                     | 15×15              | 5X                | Tropel      |
| 365                | 0.35  | 0.8                      | $10 \times 10$     | 10X               | Nikon       |
| 365                | 0.42  | 0.7                      | $10 \times 10$     | 10X               | Zeiss       |
| 405                | 0.28  | 1.25                     | $20 \times 20$     | 5X                | Zeiss       |
| 405                | 0.30  | 1.0                      | $10 \times 10$     | 5X                | Cerco       |
| 405                | 0.35  | 1.0                      | 17×17              | 5X                | Wild        |
| 405-436            | 0.30  | 1.25                     | $10 \times 10$     | 5X                | Cerco       |
| 436                | 0.16  | 2.0                      | $30 \times 30$     | 2.5X              | Nikon       |
| 436                | 0.20  | 1.75                     | $20 \times 20$     | 5X                | Zeiss       |
| 436                | 0.24  | 1.25                     | $20 \times 20$     | 5X                | Tropel      |
| 436                | 0.28  | 1.25                     | $20 \times 20$     | 5X                | Zeiss       |
| 436                | 0.28  | 1.25                     | $10 \times 10$     | 10X               | Zeiss       |
| 436                | 0.30  | 1.2                      | $20 \times 20$     | 5X                | Nikon       |
| 436                | 0.30  | 1.1                      | $14 \times 14$     | 5X                | Zeiss       |
| 436                | 0.32  | 1.0                      | $15.5 \times 15.5$ | 5X                | Tropel      |
| 436                | 0.32  | 1.0                      | $10 \times 10$     | 10X               | Tropel      |
| 436                | 0.35  | 1.0                      | $14 \times 14$     | 5X                | Canon       |
| 436                | 0.35  | 1.0                      | 15×15              | 5X                | Nikon       |
| 436                | 0.35  | 1.0                      | $14 \times 14$     | 5X                | Tropel      |
| 436                | 0.35  | 1.0                      | $10 \times 10$     | 10X               | Zeiss       |
| 436                | 0.38  | 0.9                      | $14 \times 14$     | 5X                | Zeiss       |
| 436                | 0.38  | 0.9                      | $10 \times 10$     | 10X               | Zeiss       |
| 436                | 0.43  | 0.8                      | 15×15              | 5X                | Canon       |
| 436                | 0.48  | 0.7                      | $15 \times 15$     | 5X                | Canon       |
| 436                | 0.45  | 0.75                     | 15×15              | 5X                | Nikon       |
| 436                | 0.60  | 0.6                      | 5×5                | 10X               | Nikon       |
| 390-450            | 0.28  | 1.25                     | 27.5×10.6          | 1X                | Ultratech   |
| 390-450            | 0.315 | 1.0                      | $27.5 \times 10.6$ | 1X                | Ultratech   |
| 390-450            | 0.40  | 0.7                      | 31.1×11.1          | 1X                | Ultratech   |

Source: Data from Pieter Burggraaf, "Wafer Steppers and Lens Options," Semiconductor International, pp. 56-63, March 1986, and Pieter Burggraaf, "Stepper Lens Options for VLSI," Semiconductor International, pp. 44-49, February 1988.

### 206 CHAPTER 5 LITHOGRAPHY

### FIGURE 5.22

Cross section showing relative simplicity of a catadioptric design.



resolution lenses have a relatively small depth of field. However, without wafer leveling, automatic focus cannot compensate for the amount of taper typically found in wafers. (This item is discussed further in the section on wafer flatness.)

Three stages of alignment can be involved. The first is a rough prealignment step, which positions the wafer so that sensors can detect its alignment markers. The next is global alignment, which uses only two marks per wafer, just as do the full-wafer printers just discussed. Each site is then indexed from these markers by precision wafer-stage motion. When steppers were first introduced for making masks from  $10 \times$  reticles, screw rotation was used to determine the position of the stage, and relatively large cyclic errors occurred. Precision verniers etched in fused silica glass were next used to reduce the error, but all later machines have had laser interferometry and are much more accurate. Since global alignment is usually done with auxiliary alignment optics, if the mask position is not accurately known relative to the optical axis of the alignment optics, errors will occur. By placing alignment markers on each chip pattern, each site

### 5.11.2 Alignment

can be aligned individually. Aligning each site is more time consuming than is single global aligning and. if the added precision is not required, is often not used. However, some machines are designed to always do site-by-site alignment first, and only if an insufficient signal is available from a given site, do they resort to positioning this site from the global markers.

5.11.3 Registration In order to fully utilize the high-resolution lenses now available, a method of maintaining level-to-level registration to accuracies approaching 20% of the minimum feature size is necessary. In many cases, a lack of registration and not lack of resolution limits feature size. Various elements causing lack of registration are given in Table 5.12. While no value is given for magnification change when a dedicated machine is used, a change in barometric pressure will cause a change in the index of refraction of air. This change can cause a change in magnification of multiple-element lenses that incorporate a light path through air. The effect is usually small, but most manufacturers provide some form of compensation. A distinction must be made between machines using global alignment and machines using site-by-site alignment. Global alignment has additional sources of error and, consequently, is substantially less accurate than is siteby-site alignment. It is, however, faster to use since only one alignment per wafer rather than one per step is required. It is also a simpler and less expensive system to build into the stepper. As the data of Table 5.12 show, with the current stage of development, the range of misregistration of machines using global alignment is such

|                                                                           | Exhibited by         |                      |                      |  |  |
|---------------------------------------------------------------------------|----------------------|----------------------|----------------------|--|--|
| İtem                                                                      | Dedicated<br>Stepper | Multiple<br>Steppers | Global<br>Alignment* |  |  |
| Wafer expansion                                                           | Xt                   | X†                   |                      |  |  |
| Mask expansion                                                            | X                    | X                    |                      |  |  |
| Internal mask error                                                       | X                    | X                    |                      |  |  |
| Misalignment                                                              | $X \pm 0.1$          | $X \pm 0.2$          |                      |  |  |
| Lens distortion                                                           |                      | $X \pm 0.15$         |                      |  |  |
| Magnification change                                                      |                      | $X \pm 0.15$         |                      |  |  |
| Reticle rotation                                                          |                      |                      | $X \pm 0.2$          |  |  |
| Stage orthogonality                                                       |                      |                      | $X \pm 0.2$          |  |  |
| Stage motion precision                                                    |                      |                      | $X \pm 0.2$          |  |  |
| Gløbal offset                                                             |                      |                      | $X \pm 0.1$          |  |  |
| RMS 20 limits (µm)                                                        | $\pm 0.1$            | $\pm 0.3$            | ± 0.35               |  |  |
| *Additional errors due to glo<br>$\forall X$ is determined by temperative |                      | earlier discussion   | 5.                   |  |  |

### **TABLE 5.12**

Causes of Misregistration during Stepper Operation 5.12 ELECTRON BEAM PRINTING\*

5.12.1 Pattern Generation

5.12.2 Resolution

Electron beam exposure of resist directly on a wafer offers two potential advantages over optical methods. One advantage is that because of the short wavelength associated with an electron beam, few diffraction effects should occur at geometries currently visualized. The other advantage is that by directly scanning the E-beam over the wafer, the mask can be eliminated. The cycle time from design to completed wafer can thus be substantially reduced. Unfortunately, these advantages have not been realized in a timely fashion. Whereas optical systems can be designed that are very close to being diffraction limited, other limitations in E-beam printing have severely limited its high-resolution potential. The relatively low throughput and high expense of a quality E-beam system and attendant facilities have prevented its widespread use even in applications where a premium is placed on quick turnaround.

Since an electron beam cannot be accurately deflected over the total area of a wafer, a combination of stage-travel, electromagnetic, and electrostatic deflection is used to cover the complete wafer. In some systems, a small spot beam is swept back and forth as the stage moves in serpentine fashion over the slice. Some systems form a variable-length beam as the stage travels, and others, as in optical steppers, stop the stage and expose an area by various scanning methods and then move to the next field.

The smallest feature that can be exposed is the diameter of the electron beam. Diffraction, electron-electron interactions, and E-beam lens design all contribute to the beam diameter. The wavelength  $\lambda$  associated with an electron beam is very short and is approximately given by

$$\lambda = \frac{12.3}{\sqrt{V}}$$
 5.20

where  $\lambda$  is in Å and V is the accelerating voltage in V. Thus, a beam accelerated through 30 kV would have a wavelength of 0.07 Å and would have negligible diffraction when printing a 0.5 µm feature. Beam expansion due to electrons interacting with other electrons in the beam is dependent on the electron density and is manageable by controlling beam current. Focusing and deflection optics are a major limitation, but beam diameters of from 0.125 µm to 0.2 µm are cur-

See references 62-63.

rently available. Thus, none of these limitations would prohibit the printing of 0.5 µm features.

What is a limitation is the scattering of electrons by the resist and the underlying substrate. Not only does scattering tend to make individual exposures cover more area, but also it gives rise to proximity effects in which the scattering from one feature affects the exposure of other parts of the same feature as well as that of adjacent features (64–66). This problem is minimized by programmed reduction of beam intensity in adjacent regions and the use of as low a beam energy as possible.

5.12.3 Alignment Since a scanning beam is already available, it, along with suitable detectors, is used to locate alignment markers. In general, as in the other systems, a flat finder is first used to provide rough azimuthal orientation. Then, some coarse markers near the edge of the wafer are used to determine the position well enough for the small markers on individual chip fields to be located. Quoted overlay tolerances, which include alignment and distortion, vary from 0.15 μm to 0.5 μm.

5.13 X-RAY PRINTING

X-ray wavelengths are very much shorter than that of the UV light used in conventional lithography and thus, when they are used for exposing photoresist, have minimal diffraction effects. The wavelengths that are ordinarily used are in the 5-10 Å range. Because X rays cannot be readily focused, printing must be either contact or proximity. For the same reason, the only way to get a reasonably collimated beam is to have a small source and have the wafer far from the source. However, small-size sources limit the intensity of the X-ray beam, as do large distances. Typical values are a source diameter of 2-4 mm, a source-to-wafer distance of 30-40 cm, and a mask-wafer spacing of 10-40 µm (67-68). Fig. 5.24 is a schematic diagram of an X-ray printing system. Since the soft (low-energy, long-wavelength) X rays suffer appreciable attenuation when they pass through a 40 cm air path, either a helium-filled or vacuum chamber is provided. Because many of the masks are opaque to visible light (see the earlier section on mask making), conventional alignment systems have, in many cases, been replaced by indirect systems. Furthermore, the main purpose of X-ray lithography is to allow finer geometries to be printed than are possible with UV (probably line widths less than 0.5 µm). The finer geometries require more precise alignment and hence more elegant systems than are often found on UV systems.

### 212 CHAPTER 5 LITHOGRAPHY



5.13.1 Image Distortion

The lack of a parallel beam causes the image to be slightly larger than the mask, as shown in Fig. 5.25. The magnification varies with the mask–wafer separation, and if the separation is not uniform, image distortion will occur. A variable separation can arise from a non-flat wafer, a bowing mask, or a mask–wafer tilt. The magnification m is given by

$$m = \frac{B}{B - d}$$
 5.21

where *B* is the source-to-wafer distance and *d* is the mask-wafer separation. For a typical mask-wafer separation of 40  $\mu$ m and a 40 cm wafer-source separation, m = 1.0001. For an image covering a full 100 mm wafer, this amounts to a 10  $\mu$ m increase in image size over mask size.

Factors that affect X-ray printed line widths (see Fig. 5.25) are diffraction, shadowing from the finite-size X-ray source, leakage of X rays through the edges of the mask, and electrons generated when the X rays are absorbed in the resist. The width of the diffraction blur  $\delta_d$  is approximately (69)

$$\delta_d = 0.4 \left(\frac{\lambda d}{2}\right)^{1/2}$$

5.22

### FIGURE 5.25

Factors affecting resolution of X-ray printing.



where d is the wafer–mask separation. The width  $\delta_s$  of the penumbral shadow is

$$\delta_s = \frac{S_d d}{B}$$
 5.23

where  $S_d$  is the source diameter and B is the distance from source to wafer. Since  $d \ll B$ , the distance from the source to mask B' is assumed equal to B, and the mask edge shadow  $\delta_m$  can then be approximated as (2)

$$\delta_m = \frac{rt}{B}$$
 5.24

where r is the distance from the centerline of the source-wafer to a particular mask edge and t is the thickness of the masking material. The range  $\delta_e$  of the electrons generated in the resist has been estimated as

$$\delta_{\nu} = 10^{-23} \lambda^{-7/4} \qquad 5.25$$

where  $\lambda$  is in meters. Table 5.13 compares these factors for some typical system parameters. From the values shown, it can be con-

| TABLE 5 | 11 | 3 |
|---------|----|---|
|---------|----|---|

Values for Components of **Resolution Limitations** 

| Wavelength<br>(Å) | $\delta_d(\mu m)$ | δ, (μm) | δ <sub>m</sub> (μm) | $\delta_{e}(\mu m)$ |
|-------------------|-------------------|---------|---------------------|---------------------|
| 5                 | 0.04              | 0.3     | 0.05                | 0.19                |
| 10                | 0.057             | 0.3     | 0.05                | 0.056               |
| 20                | 0.08              | 0.3     | 0.05                | 0.017               |

cluded that for submicron geometries, the penumbral shadow can be a limiting item. Its significance can, however, be reduced by going to a much narrower gap, such as 10 µm. To minimize scattering and diffraction effects, it appears that a window in the 10-20 Å wavelength region is optimum. For shorter wavelengths, resist scattering becomes a problem; for longer ones, diffraction becomes a limitation.

Because of X-ray resist insensitivity, a high-intensity X-ray source is very desirable (70). One method of increasing power input to the target (anode) is by using a water-cooled rotating anode. The impinging electron beam is thus constantly presented with a freshly cooled surface, and the power input can be substantially greater than it is with a stationary water-cooled anode. However, some stationary targets have a thin-walled conical depression cooled by highvelocity water that can handle very-high power inputs (71) and are not plagued with the mechanical problems of rotating anodes. Pulsed laser-powered X-ray sources also have potential for X-ray lithography (69). With such systems, in which the individual pulses are from 1-100 ns long, the peak power may be hundreds of times greater than the power normally incident on the X-ray mask and resist. Thus, if the resist shows a problem with reciprocity, exposure times may be considerably longer than anticipated. The mask absorbing layer can be subjected to substantial stress and heating by such pulses, and it has been suggested that the peak energy per pulse should be limited to about 10 mJ/cm2. Synchrotron radiation is also a possible X-ray lithography source (72, 73). Conventional synchrotrons are of course very large, but the small, storage-ring models may prove practical.

5.13.4 X-Ray Steppers Because of the difficulty in producing the large-area, very-thin membranes used in X-ray masks, steppers offer some advantages. With their use, the movement to ever larger slices will pose no additional mask-making problem. However, as long as X-ray sources remain relatively weak and the resist relatively insensitive, the requirement for many exposures per wafer will dictate a low throughput.

## 5.13.3 X-Ray Sources

5.14

ION BEAM PRINTING 5.14.1 Focused Ion Beams

## 5.14.2 Masked Ion Beam Printing

Focused ion beam printing (74, 75) uses a focused beam of ions, accelerated to perhaps 150 kV, that is scanned over a small area just as in E-beam lithography. Currently, the available ion beam density is much lower than the electron beam density. However, because the ions are so much heavier than the electrons, resist sensitivity is much greater for ions. The net result is that exposure times are comparable. Ions that have been used include protons and doubly charged silicon. A double-charged ion has the advantage of providing greater resist penetration for the same accelerating voltage.

As an alternative to E-beam exposure, focused ion beams appear to offer several advantages. Diffraction effects, as in E-beam optics, are negligible, because of the higher mass of ions, scattering of the beam by the resist is minimal, and no problem occurs with the proximity effect. Beams of 0.1  $\mu$ m diameter can be formed and can, in principle, be printed. Printing speeds may be substantially slower than those of E-beam systems, however, because of the slower scan speeds associated with longer ion beam transit times. To prevent charge buildup from causing a widening of the beam and thus setting a lower limit on line width, conductive resists may also be necessary.

Masked ion beam printing, analogous to optical proximity printing, E-beam flood-printing, or X-ray printing, uses a beam of 1-2 cm2 to sweep over a mask and expose the resist beneath (76, 77). It avoids difficulties in precision ion beam focusing and location but introduces the problem of masking. Some way of patterning the mask (presumably E-beam) is required. Either a stencil mask (holes in an ion beam impervious material) or an impervious material supported on a thin ion beam transparent material may be used. The stencil mask requires either that no ring-and-dot type geometries be used or that a complementary mask pair be used. The thin membrane approach has all of the problems associated with X-ray masks plus scattering of the beam by thin amorphous materials. One solution to scattering is to use a thin silicon membrane oriented so that channeling can occur. Alignment, as with X-ray printing, is more difficult than it is with either optical or E-beam stepping and must be done indirectly.

## 5.15 PRINTER SELECTION

Many factors must be considered in choosing the most suitable printer for a particular operation. The final choice should be one that satisfies the technical requirements, achieves a satisfactory chip cost, and is compatible with the long-term plans for the wafer fabTABLE 5.14

Factors Affecting Choice of Printer

Initial Investment Printer cost Printer facility cost Mask retooling cost Printer Operating Cost Repair and maintenance cost Mask cost Operator labor Depreciation Facility upkeep Printer Throughput Inherent speed Uptime Setup time Process Applicability Wafer diameter Chip size Defects generated by printer relative to defects in incoming mask Process yield of each printer process Compatibility of printer processes Resolution of printer

Time until obsolescence

5.16

LINE-WIDTH CONTROL rication facility. Table 5.14 lists factors that must be considered. The first three groups can be classified as purely economic; the last group contains technical issues. Of prime importance in printer selection is technical capability: Immediately rule out machines that cannot print wafers of the desired diameter with the required resolution. Fig. 5.26 shows the current (1988) resolution limits for each type of printing technology.

Most of the remaining items contained in the four groups are interrelated so that numerous trade-offs must be evaluated. Which ones are most important will vary with the situation, but as examples, consider the following. Contact printing is capable of good resolution, high throughput, and low downtime but has high mask usage, high defect density, and marginal overlay capability. Initial cost and operating cost are low: the yield will, however, be low because of the high defect density introduced; and the mask cost could be high. E-beam machines and their associated facilities are very expensive, and the throughput is low. However, they allow quick turnaround from design to wafer printing, and the printing defects are low. Stepper throughput is low relative to full-wafer scanners, but scanner mask costs for large-diameter wafers may be excessive.

In some cases, the use of two or more different kinds of printers in the same wafer-fab area, each assigned to its own particular levels, may be advantageous. For example, steppers might be used on levels requiring the smallest feature size and full-wafer scanners on the others. If there is some level, such as the final lead pattern of a gate array, that is customized and needs quick turnaround for good customer responsiveness, then it might be printed with E-beam machines. This approach, referred to as "mix and match" makes use of the strengths of each printer type. Its successful implementation requires that differences in distortion between the machines are acceptable and that the mask alignment markers are compatible. In some cases, multiple sets of markers may be required since, for example, stepper site-by-site alignment requires a marker on each die and full-wafer printers need only two markers per wafer. Also, printers by different manufacturers may use completely different alignment sensing. For example, one might use Fresnel zones and another the scanning of crosses.

As mentioned earlier, providing line-width control is one of two major goals of a good lithography process (the other is maintaining adherent pinhole-free resist layers). Control is lost when the feature size becomes resolution limited, and this aspect has already been discussed. However, even if the feature size is well above the resolution limit, several things limit size reproducibility. Ordinarily,

### FIGURE 5.26

Printing technology versus current (1988) minimum feature size.



 $\pm$  10% of a line width on an IC is considered satisfactory, and processes are designed with this percentage as a minimum goal. Not all of this tolerance can be used for the pattern width variation since pattern undercutting and initial mask tolerance all contribute to the final feature size. Undercutting can be both an etching and a lithographic problem since undercutting is accentuated if the resist peels during etching. Mask control 2 $\sigma$  limits are typically 0.2  $\mu$ m plate-toplate and 0.1  $\mu$ m within a plate (78). Thus, for features of 2  $\mu$ m and less, all of the desired tolerance is used in the mask alone when 1:1 printing is done. However, when pattern reduction and stepping printing are used, the numerical (not the percentage) variation of the image line width is reduced by the same factor as the image reduction being used.

d Table 5.15 lists processing factors that affect line width. In addition, the table shows, for positive resist, the effect of a change in these factors on feature size. Not shown in the table is the effect of resist and developer compositional variability. A distribution of line widths will occur just because of this normal variability. Moreover, on occasion, errors in formulation that occur must be considered. When they do, line width may change, and such changes must be separated from the variables listed in Table 5.15. In order to study how the various items shown in the table affect line width, curves can be constructed that plot line width versus each of the individual variables, with all of the others held constant, and the sensitivity determined. These kinds of curves are referenced in Table 5.15 and

## 5.16.1 Process-Initiated Control Problems

#### **TABLE 5.15**

### Process Variables Affecting Feature Width

| Item                              | Direction of<br>Change | Effect on Oxide Feature Width*<br>for Positive Resist | Reference  |
|-----------------------------------|------------------------|-------------------------------------------------------|------------|
| Exposure time                     | *                      | +                                                     | Fig. 5.27  |
| Develop time                      | ÷                      | +                                                     | Fig. 5,29  |
| Developer concentration           | +                      | +                                                     | Fig. 5.30  |
| Time between exposure and develop | +                      | -                                                     | Fig. 5.31  |
| Resist thickness                  | +                      |                                                       | Fig. 5.32  |
| Printer defocus                   | +                      | +                                                     | Fig. 5.33‡ |
| Soft-bake time or temperature     | -                      | +                                                     |            |
| Hard-bake temperature             | +                      | \$.                                                   |            |
| Resist adhesion                   | -                      | +                                                     |            |
| Printer NA                        | +                      | +                                                     |            |

\*Note that if the feature is a cut in an oxide, a negative value for resist removal, as shown in Fig. 5.27, gives a larger feature. However, if a metal pattern is being defined, a negative value means a narrower lead line width,

†Depending on the amount of exposure, the line width can widen or narrow.

The intensity changes described in this graph must be translated into line-width changes.

\$If the resist flows, the line will be too narrow; if the high temperature causes gas to be trapped and poor adhesion, the line will be wider.

This item is not the maximum system NA but rather that set by the aperture actually used during exposure.

show, for example, line width versus resist thickness. Alternatively, a "best" value for some particular variable can be plotted against a range of some other variable. An example would be the best exposure time as the resist thickness is varied. The first approach is most useful in developing a wide-latitude process, while the latter is more useful in providing operational information for a factory. Unfortunately, neither approach gives a clear picture of the interactions between multiple variables. For example, depending on the soft-bake time or temperature, the best exposure time versus resist thickness curve may vary substantially. One approach to clarifying the situation, and one that minimizes the number of experiments, is to use a set of statistically designed experiments to examine the interactions among a whole series of variables (79). The series of curves shown in the next few sections (and referred to in Table 5.15) are of the "change one variable at a time" variety in order to give more insight into the individual process variables.

The amount of exposure will affect line width, and a plot of line width versus exposure is often used as the criterion for proper exposure. Alternatively, a resolution test pattern can be exposed for various times and the exposure chosen on the basis of best resolution. The energy for proper exposure is in the 10 mJ/cm<sup>2</sup> range for negative resist and in the 40 to 60 mJ/cm<sup>2</sup> range for positive resist. Table 5.16 gives conversion factors to other units.

For most negative resists, underexposure will cause poor resist adherence and the resist lines to be narrower than normal, while overexposure will cause them to be wider. For positive resists, the opposite is true; overexposure produces narrower lines.<sup>20</sup> Typical data for positive resists are given in Fig. 5.27. In working with a specific resist, curves applicable to this particular resist should be used. Because of the nonlinearity of the exposure/line-width curve, the sensitivity of line width to exposure is less for line widths less than nominal. Thus, better control can be achieved by using an un-



### FIGURE 5.27

Location of resist edge versus amount of exposure for positive resist, 1.8 µm thick. (Source: Data from Terry V. Nordstrom, Semiconductor International, p. 158, September 1985.)



<sup>&</sup>lt;sup>20</sup>Note that these are narrower *resist* lines. In the literature, a *line width* will usually refer to the etched feature and not to the resist.

dersized mask (80). Because of interference effects changing the amount of energy absorbed by the resist, the correct exposure and, in turn, the line width can be very sensitive to rather small changes in resist and oxide thicknesses. The kinds of intensity changes to be expected are discussed in a later section. When lines of different widths are being exposed simultaneously by projection printing, the narrower ones may receive substantially less energy per unit area and hence be underexposed. The magnitude to be expected is shown in Fig. 5.28. The smaller the NA of the lens, the sooner the effect becomes noticeable. The decreasing intensity arises because more and more of the light incident on a slit is diffracted outside its boundary as it becomes narrower. Behavior for other lens and line-width combinations can be calculated from Eq. 5.16.

The development time affects the line width as shown in Fig. 5.29 for positive resist. Note that at the nominal size opening (where the resist size matches the mask size), the change in size with time is very rapid. Better line-width control can often be maintained by making the mask opening somewhat smaller and by developing longer (81). The strength of the developer (the dilution) can affect line width. Fig. 5.30 shows data for positive resist. Even the length of time between exposure and develop has an effect on line width, as Fig. 5.31 shows, which again underscores the fact that cycle time through the lithography process should be minimized. For a given exposure, changing resist thickness will change line width, as shown in Fig. 5.32. For this reason, resist thickness must be kept under good control.

When the image is defocused, it becomes smeared, and linewidth control deteriorates. Therefore, much attention is paid to



### FIGURE 5.29

FIGURE 5.30

Effect of developer dilution on location of resist edge for

AZ1350J positive resist, 2 µm

thick, 50 s exposure, 60 s de-

velop. (Source: Data from David

J. Elliot, Solid State Technology,

p. 66, September 1977.)

Location of resist edge versus development time for AZ1350J positive resist, 580 nm thick, developed with 1:1 AZ developer: H<sub>2</sub>O. (*Source:* Data from F.H. Dill et al., *IEEE Trans. on Electron Dev. ED-22*, p. 456, 1975.)



depth of focus of the lenses being used and to the flatness of the wafers being printed. Fortunately, because of the resist characteristics, the useful depth is often substantially greater than the Rayleigh criterion of Eq. 5.17. Fig. 5.33 shows how defocusing changes the light intensity near the mask edge. The idealized curve shows the abrupt light-dark transition that is usually visualized. As shown

Elm Exhibit 2159, Page 238



Resist thickness (µm)

by the effect of diffraction curve, the transition is always somewhat sloped. Defocusing makes the curve even more sloping. However, a pivot point (A of Fig. 5.33) somewhere between I = 0.25 and I = 0.5 (depending on the light coherence) is independent of the amount of defocus. Thus, if the exposing intensity and resist can be matched so that the resist cutoff energy ( $E_f$  of Fig. 5.8) equals the intensity  $I_f$ 



at point A times the exposure time, then over a broad range of defocus, no change in line width should occur. Increasing exposure energy causes  $I_f$  to be reached lower on the intensity axis so that it will move from C to A to B. If, for the exposure time chosen,  $I_f$  is at B, a change from the idealized to the defocused curve causes this intensity to be reached well under the edge of the mask, and a negative resist line will widen with defocus. If  $E_f$  is at C, then the line will narrow. Experimental data are given in Fig. 5.34. The two exposure times of 5.3 s and 8.3 s bracket the pivot point A. Other data not shown on this graph substantiate the thesis that, with  $I_i$  set at point A, a substantial amount of defocus can occur with negligible line broadening. There are limits, of course. Fig. 5.33 is idealized and will not hold over an infinite range of defocus. Furthermore, as the slope decreases, the position of the resist boundary becomes less defined.

## 5.16.2 Interference Effects

### FIGURE 5.35

Effect of standing waves on the cross section of developed resist.



The use of monochromatic light21 to expose the resist, coupled with a highly reflecting surface such as silicon or metal beneath it, can give standing waves in the resist (82-85). When standing waves do exist, the light intensity through the resist in a direction normal to the wafer surface will vary in a cyclic fashion, causing resist development normal to the surface to vary in a similar fashion and leading to undulating instead of straight walls as shown schematically in Fig. 5.35. Since there is a minimum in intensity of a standing wave at a highly reflective surface, adjacent to this surface, the resist will always be underexposed. If an oxide is between the reflecting surface and the resist, the intensity of the standing wave at the oxide-resist interface will depend on the thickness of the oxide. It can range from minimum to maximum intensity, and since it does depend on the oxide thickness, it is often possible to choose the oxide thickness so that maximum intensity occurs at the interface. In this way, the resist next to the oxide is more likely to be fully exposed. While printers using broadband illumination are, in principle, not subject to standing waves, the use of a resist with a narrow spectral transmissivity can still cause them.

The E-field  $E_l$  of the incident light can be expressed as

$$E_t = E_0 e^{i\omega t} e^{-2\pi i N z/\lambda}$$
5.26

where N is the complex index of refraction n - ik,  $\lambda$  is the wavelength in vacuum, and k is given by  $\alpha\lambda/4\pi$ , where  $\alpha$  is the absorption coefficient. The n, k, and  $\alpha$  values for various materials encountered in lithography are given in Table 5.17. The wave is considered to be traveling in the -z direction, which is normal to, and toward, the wafer surface. The primary reflecting surface (the silicon or metal) is at z = 0.

<sup>&</sup>lt;sup>21</sup>High-performance lenses using all refractive elements can only be designed for use with monochromatic light. Lenses using partially reflecting optics, such as full-wafer scanners and 1:1 steppers, can use a broader light spectrum and are much less susceptible to interference effects.

| Material | n |
|----------|---|

**TABLE 5.17** 

Index of Refraction and Absorption Coefficient

| Material                  | n    | $\alpha$ (/cm)      | k    | Wavelength  | Reference |
|---------------------------|------|---------------------|------|-------------|-----------|
| Si                        | 4.6  | $2 \times 10^4$     | 0.07 | 450 nm      | (1)       |
| Au                        | 1.4  | $5 \times 10^{5}$   | 1.88 | 450 nm      | (2)       |
| Al                        | 0.49 | $1.2 \times 10^{6}$ | 4.32 | 450 nm      | (2)       |
| Cr                        | 0.6  |                     | 1.55 | (not known) | (3)       |
| Air                       | 1.0  |                     |      |             |           |
| SiO <sub>2</sub>          | 1.46 |                     |      | 450 nm      |           |
| AZ1350J resist            | 1.68 |                     | 0.02 | 450 nm      | (4)       |
| KTFR (negative resist)    | 1,54 |                     |      | Na light    | (5)       |
| KMER (negative<br>resist) | 1.55 |                     |      | Na light    | (5)       |
|                           |      |                     |      |             |           |

References: (1) H.R. Philipp and E.A. Taft, Phys. Rev. 120, p. 37, 1960, (2) O.S. Heavens, Optical Properties of Thin Films, Dover, New York, 1965. (3) Reference in J.H. Altman and H.C. Schmitt, Proc. Kodak Photoresist Seminar, p. 12, Los Angeles, 1968. (4) Frederick H. Dill et al., IEEE Trans. Electron Dev. ED-22, p. 456, 1975. (5) Kenneth G. Clark, Solid State Technology, pp. 52–56, June 1971.

Instead of exponential notation, the following expressions can also be used. First,

$$E_l = E_0 \cos(\omega t - \beta z) \qquad 5.27$$

And, the E-field  $E_R$  of the wave reflected back at the silicon or metal interface is given by

$$E_R = RE_0 \cos(\omega t + \beta z)$$
 5.28

where  $\beta = 2\pi (n - ik)/\lambda$  and R is the reflection coefficient such that

$$R = \left(\frac{N_1 - N_2}{N_1 + N_2}\right)^2$$
 5.29

 $N_2$  is the complex index for the medium doing the reflecting, and  $N_1$  is for the medium in which the wave is traveling. In the case of either oxide or resist, k is small and, for purposes of Eq. 5.28, can be disregarded. Thus,  $\beta$  is real and given by  $2\pi n/\lambda$ .

When  $E_l$  and  $E_R$  are added together, they have a standing wave component

$$E_{s} = E_{so} \sin \omega t \sin \left( \frac{2\pi nz}{\lambda} + \theta \right)$$
 5.30

It has the same wavelength  $\lambda/n$  as the incoming wave, but its phase  $\theta$  may not match that of either  $E_I$  or  $E_R$ , as is shown in Fig. 5.36.



The minima and maxima are each separated by  $\lambda/2n$ . The minima will occur whenever  $\sin(2\pi nz/\lambda + \theta) = 0$  —that is, when

$$\frac{2\pi nz}{\lambda} + \theta = m\pi \qquad 5.31$$

where *m* is zero or a positive integer. If the phase shift is  $180^{\circ}(\pi)$ , as is the case of Si or GaAs or when the reflector is metallic and perfect, the first minimum will be at the interface, and the others will be located at

$$z = \frac{m\lambda}{2n}$$
 5.32

Similarly, the maxima (antinodes) will be located at

$$z = \frac{(2m+1)\lambda}{4n}$$
 5.33

Otherwise, the position of the first minimum will be given by

$$z = \frac{\lambda \theta}{2\pi n}$$
 5.34

where 0 is given by

$$\theta = \frac{\sin(-2\pi nz/\lambda) + r \sin(2\pi nz/\lambda + \phi)}{\cos(-2\pi nz/\lambda) + \cos(2\pi nz/\lambda + \phi)}$$

The intensity I of the light is proportional to the square of the E-field so that I(z) is given by

 $I(z) = I_a \sin^2 \beta z \quad \text{or} \quad I_a \sin^2 (\beta z + \phi) \qquad 5.35$ 

depending on whether the reflector is silicon or metallization.

Resist that is exposed by light with an intensity varying in the manner of Eq. 5.33 will reflect this variation in the manner that it develops. The resist near the wafer interface will be poorly exposed. If it is positive resist, complete removal may not be achieved, and bridging will occur between closely spaced resist lines. If the resist is negative, the bottom part, being underexposed, may adhere poorly as well as being undercut during development. If the resist line is narrow, undercutting may be severe enough to allow the whole line to lift off. Depending on the resist thickness, there may be anything from maximum to minimum intensity at the surface. Thus, the top layer may range from fully developed to underdeveloped.

When the waves reflected from the wafer surface that are rereflected from the resist are considered, it is found that, while the location of the nodes of the standing wave does not change, the amplitude may. It will depend on the distance between the two reflecting surfaces.

If the removal rate of resist is assumed to be proportional to the product of I(z) and exposure time, the removal rate at any depth below the original surface can be calculated. Numerical computer analysis techniques based on this approach have been developed (86) that give good simulations of the experimentally observed profiles of resist exposed by standing waves.

When ordinary resist lies directly on silicon or metal, the position of the first node cannot be adjusted, and without additional precautions, a layer of underdeveloped resist near the interface must be accepted. If the resist is on top of a layer of oxide, the optical properties of the two are so close that the standing wave can be considered to be continuous across the oxide-resist interface. If the thickness of the oxide is  $z_1$ , there will be an antinode at its surface if  $z_1 = (2m + 1)\lambda/4n$  and a node if  $z_1 = m\lambda/2n$ . Thus, when resist is on top of an oxide, the oxide thickness can be varied to give an antinode at the interface and ensure proper exposure at the interface.

## 5.16.3 Minimizing Effect of Standing Waves

By using a very thin highly absorbing layer between the substrate and the resist, standing waves can be eliminated. This technique is not widely used, however. Dye can be incorporated into the resist to absorb additional light energy and minimize standing waves (87) and light scattering from rough surfaces. Some commercially available resists have such a dye in them. The incorporation of dye may cause the resist sidewalls to be excessively tapered during development. However, by using an additional additive that forms a thin patternable skin on the top surface of the resist during softbake, nearly straight sidewalls can be maintained (88).

A bake after exposure, but before developing, can cause the sensitizer to diffuse enough to smooth out the undulations so that the profile is not so ragged and so that the bottom of the resist layer will develop properly (89). A light plasma etch can be used after development to remove any scum left in the bottom of windows due to poor exposure and development.

Line-width variations will usually occur where resist lines cross steps. These variations have been ascribed to light scattering at the step exposing resist in unexpected regions and to changes in the interference effects causing variable development (90). In either case, reducing reflectivity helps, as does making the resist thicker. Reflectivity of metal or polysilicon can sometimes be decreased by changing the grain structure and surface roughness. Care must be taken to ensure that such an approach does not adversely affect the electrical current carrying capability of the conductor. A complete, albeit more complex, solution is to use multilayer resist as described next.

Topographical changes in going from the top level of metallization through a contact opening down to the silicon surface can easily be a micron on wafers with multilevel metallization. Such large distances use up an appreciable part of the focus budget when printing is done with high *NA* lenses and must be avoided whenever possible. Exposure difficulties arise in trying to pattern metal going over high steps, and thick layers of resist are needed to prevent resist failure at steps. Thick resist, in general, gives poorer optical resolution than does thin resist, and in the case of E-beam printing, thick resist presents more resist to scatter the electrons and thus reduce resolution. To simultaneously correct these problems and minimize problems with reflective substrates, multilayered resist can be used (91–95).

In this process, shown in Fig. 5.37, a thick layer of one kind of resist is applied to the wafer to smooth out surface variations. A second layer of a thin resist of different composition and spectral

5.16.4 Resist over Steps

5.16.5 Multilayer Resist



response is then added. Because of the different spectral responses. the pattern in the top layer can be exposed independently of the bottom one. After development, the pattern is then transferred to the bottom layer. A major problem with this approach is interfacial mixing of the resists during processing. To minimize this difficulty, the basic "bilayer" approach is sometimes modified by adding a third layer ("trilayer" approach) of a relatively inert material such as plasma-deposited polysilicon between the two organic resists. After patterning the top layer, conventional etching can be used to remove the middle layer. In either process, the bottom layer can be patterned by plasma etching or by using the top layer as a contact mask, flood-exposing with the proper wavelength light, and developing.

Tables 5.18 and 5.19 show two bilayer<sup>22</sup> and one trilayer<sup>22</sup> flow and illustrate different approaches for minimizing interactions between the resist layers and for transferring the pattern from the top to the bottom layer.

As lateral feature size becomes smaller and approaches the vertical step size, more attention must be paid to the manner in which the thick bottom layer covers the surface contours. Viscosity is important, as well as thermal flow during baking operations. The geometric layout of the features can affect the way the resist flows during its application, and indeed it may be helpful to add dummy geometries in some cases in order to promote more uniform coverage (96).

<sup>&</sup>quot;Sometimes, these flows are referred to as "bilevel" and "trilevel," and sometimes the general approach is referred to as "portable conformal masking."

TABLE 5.18a

| Bilayer | Resist | Flow | 1 | (91) |  |
|---------|--------|------|---|------|--|
|---------|--------|------|---|------|--|

| Step | Operation                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
|------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 1    | Spin on PMMA 1.5-2 µm thick.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
| 2    | Bake 60 minutes at 150°C.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
| 3a*  | Spin on 0.5-0.9 µm of Hunt HPR 204 resist; bake.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
| 3b*  | Or: Apply layer of germanium selenide inorganic resist.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
| 4    | Expose and develop top resist.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |
| 5    | Flood-expose PMMA with deep UV light.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
| 6    | Develop PMMA.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |
|      | 그는 사람이 집에 집에 집에 집에 있는 것이 같은 것이 없는 것이 가지 않는 것이 없다. 것이 많은 것이 없는 것이 없는 것이 없는 것이 없는 것이 없는 것이 없는 것이 없다. 것이 없는 것 않이 |

\*Both of these resists are sensitive to light in the 400 nm range and opaque in the 200-250 nm range where PMMA is most sensitive.

| TABLE | 5.18 | b |
|-------|------|---|
|       |      |   |

Bilayer Resist Flow 2 (93)

| Step | Operation                                                                  |  |
|------|----------------------------------------------------------------------------|--|
| 1    | Spin on polyimide for a postbake thickness of 1.3 µm.                      |  |
| 2    | Bake 60 minutes at 350°C.                                                  |  |
| 3    | Spin on silicon containing positive resist; bake.                          |  |
| 4    | Expose and then develop with sodium hydroxide solution.                    |  |
| 5    | Transfer resist pattern to polyimide using oxygen<br>reactive ion etching. |  |

## TABLE 5.19

Trilayer Resist Flow (92)

| Step | Operation                                                  |  |  |
|------|------------------------------------------------------------|--|--|
| 1    | Apply HMDS.                                                |  |  |
| 2    | Spin on PMMA 1 µm thick.                                   |  |  |
| 3    | Bake 15 minutes at 85°C.                                   |  |  |
| 4    | Spin on PMMA 1 µm thick.                                   |  |  |
| 5    | Bake 60 minutes at 160°C.                                  |  |  |
| 6    | Apply Si layer by plasma CVD or by E-beam evaporation.*    |  |  |
| 7    | Bake 60 minutes in air at 100°C.                           |  |  |
| 8    | Apply HMDS.                                                |  |  |
| 9    | Spin on 0.45 µm of thinned AZ1350J resist.                 |  |  |
| 10   | Bake 15 minutes at 85°C.                                   |  |  |
| 11   | Expose and develop AZ1350J.                                |  |  |
| 12   | Bake 30 minutes at 100°C.                                  |  |  |
| 13   | Reactive ion etch silicon in CF <sub>4</sub> .             |  |  |
| 14   | Reactive ion etch PMMA with O2 or flood-expose and develop |  |  |

layer.

# 5.17 WAFER FLATNESS

Because wafer flatness can be one of the larger contributors to focus problems, some attention must be given to it. Wafer flatness may be described in several different ways, and some are better suited than others for use with a particular printing machine. Flatness measurements are sometimes made at various stages of processing to see whether additional warpage is being introduced, but most measurements are made on the slice just after it is polished.

Total indicated runout (TIR), also sometimes called total indicated reading, is made while the back of the slice is clamped to a flat surface such as a vacuum chuck. As shown in Fig. 5.38a, it is the difference between the highest point above a selected reference plane lying in the front surface of the slice and the lowest point below the reference plane. This measurement is most useful for wafers to be used for contact or proximity printing.

A similar measurement is the nonlinear thickness variation

FIGURE 5.38

Methods of defining wafer flatness.



(a) TIR (total indicated runout) or PV (peak to valley)



(b) NTV (nonlinear thickness variation)



(c) FPD (focal plane deviation)

(NTV). However, instead of scans over the whole slice, measurements are made at five points, and NTV is calculated as shown in Fig. 5.38b.

Focal plane deviation (FPD) has been tailored to the evaluation of slices to be used with full-wafer scanning projection printers. Instead of using an arbitrary reference plane, it uses the plane established by a replica of the pins the front surface of the wafer is held against during printing. FPD is defined as the maximum deviation (+ or -) between the slice surface and the reference plane (Fig. 5.38c). It can range in value from (TIR)/2 to TIR, depending on the shape of the surface. Its utility is that, in principle, if the allowable depth of field is less than FPD, then the whole wafer will print. The surface contour of a wafer will have either the general appearance shown in Fig. 5.39a or the one shown in Fig. 5.39b. Then, if the printer is adjusted so that the plane of best focus lies in the plane of the reference pins, both wafers should print satisfactorily if neither exceeds the FPD limit. If, however, an "a" wafer were used to set focus, the plane of best focus would be as shown by the horizontal dashed line rather than in the plane defined by the reference pins. All "a" wafers meeting the FPD spec will be well within the focus tolerance, but a "b" wafer can be substantially out of focus, as shown in Fig. 5.39b. The polishing process determines the relative



 $(A - C)^2 + (B - D)^2$ 

.C

D

Back reference plane

Local slope =

C

(b) Local slope (the vector sum of orthogonal pairs

of elevations from a reference plane divided by the pair separation)

numbers of "a" and "b" wafers, but over a long period of time, the numbers appear to be about equal.

If a printer is being used that references from the back of the wafer, a measurement that takes both wafer taper and front surface contour into account is required. Total thickness variation (TTV), which is the difference between the high and low points on the front surface of the wafer as measured while the back is held clamped to a reference plane, has been used. As can be seen from Fig. 5.40a, the front surface variation is generally overshadowed by wafer taper. For application to steppers, which are the only machines that use backside reference (and not all of them do), two other measurements have been used. They are local slope (LS) and local FPD (LFPD). The requirement for such information can be seen from Fig. 5.41. Even though a stepper may expose only 1 cm<sup>2</sup> at a time, if there were 10 µm of taper across a 100 mm slice and the back of the slice was the reference surface, the variation in height over each site would be 1 µm. Local slope at a particular location is determined by taking four closely spaced slice elevation readings, as shown in Fig. 5.40b. The total change in elevation over a stepper site is then approximated by the product of LS and the step size. Because of the sensitivity of LS to the separation distance, local FPD is a better measurement. In local FPD, shown in Fig. 5.40c, a

> Front surface

> > D



Flatness measurements that include taper.







(c) Local FPD (the FPD of each stepper site referenced against a plane through the center of each site)





reference plane through the center of each stepper site is used. The center is chosen because stepper autofocusing is done on the center of each site. Flatness measuring machines are available that make LFPD measurements on the same sites to be used in actual printing so that wafers can be precisely evaluated.

While using desk-top computers to model various steps of the lithography process is very convenient, modeling of the whole process is much more complex and requires more computing power. A widely used integrated model is SAMPLE (simulation and modeling of profiles for lithography and etching), available from the University of California at Berkeley (33, 85, 91). SAMPLE combines a model of resist behavior during development with optical models of the printing equipment to predict properties such as line width and edge slope. Because of the unpredictability of negative resist swelling, the model has generally been restricted to organic positive resist and to inorganic resists.

Positive resist can be considered to be composed of a base resin and a photosensitive inhibitor that breaks down under the influence of light and that prevents resist dissolution when present. The optical behavior can be characterized by three constants (92). One, C, relates the rate of destruction of inhibitor at some point z within the resist to the intensity of light I at z. That is,

$$\frac{-\partial M(z,t)}{\partial t} = I(z,t)M(z,t)C \qquad 5.36$$

where M is the relative concentration of inhibitor at time t. The other two constants A and B relate the optical absorption coefficient to the fractional amount of inhibitor remaining:

5.18 LITHOGRAPHY PROCESS MODELING TABLE 5.20

SAMPLE Inputs

| P | rinter lens NA                                                                                      |
|---|-----------------------------------------------------------------------------------------------------|
| 1 | lluminator coherence                                                                                |
| A | Amount of defocus                                                                                   |
| C | Optical properties of substrate (could, for example, be silicon, polysilicon, or metal)             |
| C | Optical properties of thin-films between wafer and resist (for example SiO <sub>2</sub> or nitride) |
| T | hickness of thin films                                                                              |
| R | Resist optical properties                                                                           |
| R | Resist thickness                                                                                    |
| E | Exposure dose (and wavelengths)                                                                     |
| R | Resist development properties (constants discussed earlier)                                         |
|   | Development time                                                                                    |

$$\alpha = AM(z,t) + B \qquad 5.37$$

The optical printer is described in terms of the lens MTF, which can be calculated from the wavelength and numerical aperture of the lens. Alternatively, a measured MTF can be used by the program. Using the MTF, the optical properties of the resist and any other layers present, and the underlying reflecting surface, the intensity of the light at any point z can be calculated. From I(z,t) and the total exposure time, the amount of inhibitor remaining in the resist is calculated. An etching model (92) is then used to calculate the profile of the resist remaining after development (which is really a dissolution or etching process).

Inputs required for the simulator are given in Table 5.20. From these inputs, the actual line widths can be calculated, as well as the vertical profile of the resist (as in Fig. 5.35). Thus, the effect of any of the variables of Table 5.20 can be quickly modeled. One can, for example, study the effect of defocus on line width or the effect of oxide thickness on edge scalloping.

SAFETY 5

Since the conventional short-arc UV bulb operates at 35–40 atm, the chance of a bulb's exploding always exists. Under normal circumstances, it will be shielded by the light collection optics so that the only damage from an explosion is to the optics. When cool, mercury bulbs have less than an atmosphere of pressure and hence pose little safety hazard during handling and installation. Xenon/mercury bulbs may have a few atmospheres of pressure when cold and thus must be handled with care.

High-intensity UV generates ozone in the air, but with normal

ventilation, the ozone should not cause a health problem. The odor threshold for ozone is about 0.01 ppm in air, whereas the OSHAproposed maximum allowable for an 8 hour period is 0.10 ppm.

Since the high-intensity UV itself can be an eye hazard, caution should be exercised to ensure that shielding is always in place. Particular care should be taken when the high-power excimer-laser sources are used. It should also be remembered that the eye does not see in the UV range so that if the visible portions are filtered out or if lasers are being used, there will be no warning prior to a burned cornea.

Most of the solvents used in and with resists are combustible and should be kept away from open flames. Where they are used in quantity, special ventilation is necessary. With prolonged contact, some of the solvents may cause skin irritation. The toxicity of the newer deep UV resists should be carefully reviewed before using.

The possibility of pressure buildup in containers should not be overlooked. In particular, the diazoquinone in positive resists slowly decomposes to give off nitrogen, particularly when it is stored above 70°F. Storage at 50°F is recommended by some manufacturers.

# KEY IDEAS 5

- Large-feature geometry photomasks are made with optical pattern generators because of their speed. E-beam writing, while slow, is predominately used for the smallfeature geometry masks, although laser optical generators show promise as an alternative.
- Masks for X-ray lithography are very difficult to make and currently are a major limitation to X-ray usage.
- Contact printing, while inexpensive and capable of printing very fine geometries, is practically obsolete because of its inability to print defect-free patterns.
- E-beam direct wafer writing is capable of the finest feature size definition currently visualized, but it is so slow as to be economically undesirable in most cases.
  - Optical projection printing is capable of printing features at least as small as 0.75 µm and is a relatively economical process.

- Because of mask defects and lack of wafer flatness, the full resolution of printer lenses is seldom used.
- Negative resists are generally at least five times more sensitive than positive resists.
- Positive resists do not exhibit swelling during development and thus are preferred over negative resists when the spacings between long resist runs is very narrow.
- Multilayer resists afford a good way of minimizing the effect of a very narrow depth of focus in printing rough wafer topography while using a high-resolution lens.
- Optical interference producing standing waves in the resist can cause regions of it to be improperly exposed. The use of a dye in the resist to increase optical absorption minimizes the effect.
- Quite good computer models of the lithography process are available.

# PROBLEMS 5

- Show that if a flat wafer of diameter D is uniformly bowed so that the center deflection is δ, its radius of curvature is approximately D<sup>2</sup>/8δ.
- 2 Draw a mask that will allow the letter I to be etched into the surface of a silicon wafer if negative resist is to be used. Draw a mask that will allow a relief letter L to remain after etching if positive resist is used.
- 3. If AZ1350J resist is to be properly exposed in 1 s, what should the light intensity in mJ/cm<sup>2</sup>·s be? If a resist has a contrast ratio of 4 and the sensitivity of AZ1350J, what is the minimum swing in exposure energy that will give a satisfactory image?
- 4. If, using a standard light source, the exposure time for a particular resist is 2 s, what would you expect the exposure time to be if a newly developed laser light source 1000 times as intense were used? Explain your answer.
- What functions do wafer prebake, resist softbake, and resist hard-bake serve?
- 6. Why is polyisoprene-based negative resist not as

# REFEFENCES 5

- Mike V. Sullivan, "Iron Oxide See-Through Masks," J. Electrochem. Soc. 120, pp. 545–550, 1973.
- R.K. Watts, K.E. Bean, and T.L. Brewer, "X-Ray Lithography with Aluminum Radiation and SiC Mask," in Robert Bakish, ed., *Electron and Ion Beam Science and Technology* 78–5, Electrochemical Society, 1978.
- Phil E. Mauger and Alex R. Shimkunas, "Additive X-Ray Mask Patterning," Semiconductor International, pp. 70–74, March 1986.
- Dervin L, Flowers and Henry G. Hughes, "On the Use of Photomask Coatings in Photolithography," J. Electrochem. Soc. 124, pp. 1599–1602, 1977.
- 5. D.L. Flowers, "Lubrication in Photolithography-

suitable for fine-line geometry as novalac-based positive resist?

- Show that for an image modulation of 0.6, the normalized intensity swings between 0.25 and 1.0.
- Prove that for a given resolution, the use of a lens with the shortest wavelength and smallest numerical aperture gives the greatest depth of field.
- 9. Using the Rayleigh criterion for resolution limits, first plot resolution versus NA from NA = 0.2 to NA = 0.6 for a wavelength of 436 nm. Next, plot on the same graph the reported resolution of the lenses listed in Table 5.11 for all lenses with an NA of 0.4 or greater. Do they all seem reasonable?
- 10. If standing waves cause a resist development problem such as shown in Fig. 5.35, what will be the spacing of the undulations if the exposing wavelength is 0.436 µm?
- Sketch the surface profile of a wafer that has both an FPD and an LFPD of 0.5 μm.

Resist Protect Coats," J. Electrochem. Soc, 124, pp. 1608–1612, 1977.

- Eugene R. Blome, "Photolithographic Mask Release Coating," *Semiconductor International*, pp. 75–78, November 1979.
- Ron Hershel, "Pellicle Protection of IC Masks," Semiconductor International. pp. 97–106, August 1981.
- John M. Jerke et al., "Accurate Linewidth Measurements at the National Bureau of Standards," *Proc. Kodak Microelectronics Seminar*, pp. 51–59, Monterey, Calif., 1978.
- Kiwao Nakazawa et al., "Measuring System for Fine-Line Lithography," Proc. Kodak Microelectronics Seminar, San Diego, 1980.
- 10. Peter H. Singer, "Linewidth Measurement: Ap-

proaching the Submicron Dimension," Semiconductor International, pp. 48-54, March 1983.

- I. Tanabe, "Production of High Quality MOS LSIs," Proc. Koduk Microelectronics Seminar, pp. 91–99, 1975
- Masana Minami et al., "A New Mask Defect Inspecting Device," Proc. Kodak Microelectronics Seminar, pp. 67–80, Monterey, Calif., 1976.
- K. Levy and P. Sandland, "Automatic Mask Inspection: What Can It Find, What Will It Miss. and What Can It Do?" Proc. Kodak Microelectronics Seminar, pp. 84–94, 1977.
- L. K. Gailbraith, "Advances in Automated Detection of Wafer Surface Defects," Proc. ASTM Symposium, January 1982.
- Mikio Makito et al., "Analysis of Registration Errors in 1:1 Projection Mask Aligners," Proc. Kodak Microelectronics Seminar, San Diego, 1980.
- G.L. Resor and A.C. Tobey. "The Role of Direct Step-on-the-Wafer in Microlithography Strategy for the 80's," *Solid State Technology*, p. 101, August 1979.
- H.R. Rottmann, "Overlay in Lithography," Semiconductor International, pp. 83-94, December 1980.
- David S. Perloff, "A Four-Point Electrical Measurement Technique for Characterizing Mask Superposition Errors on Semiconductor Wafers," *IEEE J. Solid State Cir. SC-13*, pp. 436– 444, 1978.
- T.J. Russell, T.F. Leedy, and R.L. Mattis, "A Comparison of Electrical and Visual Alignment Test Structures for Evaluating Photomask Alignment in Integrated Circuit Manufacturing," *IEDM Technical Digest*, 1977.
- William S. DeForest, *Photoresist Materials and Processes*, McGraw-Hill Book Co., New York, 1975.
- Donald J. Sykes, "Recent Advances in Negative and Positive Photoresist Technology," Solid State Technology, pp. 53–57, August 1973.
- Alvin Stein, "The Chemistry and Technology of Negative Photoresists," Philip A. Hunt Chemical Corporation brochure no. 1014/0535R1.
- Alvin Stein, "The Chemistry and Technology of Positive Photoresists," Philip A. Hunt Chemical Corporation brochure no. 1015/0535R1.
- John Hock et al., "Resist Image Reversal for Next-Generation VLSI Circuit Fabrication,"

Semiconductor International, pp. 164-168 and references therein. September 1987.

- Materials for Microlithography, ACS Symposium Series 266, American Chemical Society, Washington, D.C., 1984
- Daryl Ann Doane and Adam Heller, eds., Proc. Inorganic Resist Systems Symposium 82-9, Electrochemical Society, Pennington, N.J., 1982.
- T.C. Penn, "Forecast of VLSI Processing—A Historical Review of the First Dry-Processed 1C," *IEEE Trans. on Electron Dev. ED-26*, pp. 640–643, 1979.
- J.N. Smith et al., "A Production-Viable Plasma Developable Photoresist," Semiconductor International, pp. 41–47, 1979.
- 29. Gary N. Taylor and Thomas M. Wolf, Chapter 4, "The Status of Dry-Developed Resists for Each Lithographic Technology," in Norman G. Einspruch and Graydon B. Larrabee, eds., VLSI Electronics, Vol. 6, Academic Press, New York, 1983.
- G.N. Taylor, L.E. Stillwagon, and T. Venkatesan, "Gas-Phase-Functionalized Plasma-Developed Resists: Initial Concepts and Results for Electron-Beam Exposure," J. Electrochem. Soc. 131, pp. 1658–1664, 1984.
- B.F. Griffing and P.R. West, "Contrast Enhanced Lithography," Solid State Technology, pp. 152– 157 and references therein, May 1985.
- Don R. Strom, "Optical Lithography and Contrast Enhancement," Semiconductor International, pp. 162–167, May 1986.
- A.R. Neureuther and W.G. Oldham, "Resist Modeling and Profile Simulation," Solid State Technology, pp. 139–144, May 1985.
- Shigeki Shimizu and George R. Bird, "Chemical Mechanisms in Photoresist Systems," J. Electrochem. Soc. 126, pp. 273–277, 1979.
- J.N. Helbert, "Photoresist Adhesion Promoter for Gold Metallization Processing," J. Electrochem. Soc. 131, pp. 451–452, 1984.
- B.D. Washo, "Rheology and Modeling of the Spin Coating Process," *IBM J. Res. Develop.* 21, pp. 190-198, 1977.
- Patrick O'Hagen and William J. Daughton, "An Analysis of the Thickness Variance of Spun-On Photoresist," Proc. Kodak Microminiaturization Seminar, 1977.

- G. F. Damon, "The Effects of Whirler Acceleration on the Properties of the Photoresist Film," Kodak pub. P195, pp. 34–41, a collection of papers from the 1965 and 1966 Kodak seminars.
- Hans Denk, "Optimization of Photoresist Processing with the IBM 7840 Film Thickness Analyzer," Proc. Kodak Microelectronics Seminar, pp. 28–34, Monterey, Calif., 1976
- 40. Kodak pub. G-96 (positive resist 809).
- Mary Long, "Quantitative Evaluation of a Photoresist," Proc. Kodak Microelectronics Seminar, pp. 60–69, San Diego, 1970.
- Burke Leon, "The Nature of Negative Photoresist Scurnming: Part II," Solid State Technology, pp. 58-61, May 1977.
- Ted C. Bettes, "UV Exposure, Systems, and Control," *Semiconductor International*, pp. 83– 95, April 1982,
- Jerry Bachur, "Deep UV Exposure Technology," Solid State Technology, pp. 124–127, February 1982.
- Hans Schellenberg, "Optical Mask Alignment and Exposure Systems," Proc. Kodak Photoresist Seminar, pp. 54–63, Philadelphia, 1969.
- Jeanne Roussel, "Step-and-Repeat Wafer Imaging," Solid State Technology, pp. 67–71, May 1978.
- James J, Ewing, "Rare-Gas Halide Lasers," *Physics Today*, pp. 32–39, May 1978.
- R.A. Lawes. "Use of Excimer Lasers in Photolithography," Semiconductor International, pp. 76-77, July 1986.
- K. Jain, C.G. Wilson, and B.J. Lin, "Ultrafast Deep UV Lithography with Excimer Lasers," *IEEE Elec. Dev. Lett. EDL3*, pp. 53–55, 1982.
- H. Craighead et al., "Contact Lithography at 157 nm with an F<sub>2</sub> Excimer Laser," J. Vac. Sci. Tech. B1, pp. 186–189, 1983.
- Elmar Cullmann, "Excimer Laser Applications in Contact Printing," Semiconductor International, pp. 332–334, May 1985.
- 52. Karl Suss product literature.
- K. Jain and R.T. Kerth, "Excimer Laser Projection Lithography," Appl. Opt. 23, pp. 648–630, 1984.
- 54. T. Kaneko, T. Umegaki, and Y. Kawakami, "A Practical Approach to Sub-Micron Lithography." Proc. Kodak Microelectronics Seminar, San Diego, October 1980.

- R.K. Watts and J.H. Bruning, "A Review of Fine-Line Lithographic Techniques: Present and Future," Solid State Technology, pp, 99–105, May 1981.
- Richard C. Heim, "Practical Aspects of Contact/ Proximity Photomask/Wafer Exposure," Proc. SPIE 100, in Semiconductor Microlithography II, pp. 104–114, 1977.
- Michael C. King, "Future Developments for 1:1 Projection Photolithography," Semicon/Europa, Zurich, 1978, and "The Characteristics of Optical Lithography," Proc. Kodak Microelectronics Seminar, San Diego, October 1980.
- John W. Bossung, "Projection Printing Characteristics," Proc. SPIE 100, in Semiconductor Microlithography II, 1977.
- David A. Markle, "A New Projection Printer," Solid State Technology, pp. 50-53, June 1974.
- 60. Perkin-Elmer product literature.
- David A. Markle, "Submicron 1:1 Optical Lithography," Semiconductor International, pp. 137-142 and references therein, May 1986.
- Y. Lida and K. Mori, "A Modified S.E.M. Type EB Direct Writing System and Its Application on MOS LSI Fabrication," J. Electrochem. Soc. 128, pp. 2429–2434, 1981.
- H.C. Pfeiffer, "Direct Write Electron Beam Lithography—A Production Line Reality," Solid State Technology, pp. 223–227, September 1984.
- J.S. Greeneich, "Impact of Electron Scattering, on Linewidth Control in Electron-Beam Lithography," J. Vac. Sci. Technol. 16, pp. 1749–1953, 1979.
- D.F. Kyser and C.H. Ting, "Voltage Dependence of Proximity Effects in Electron Beam Lithography," J. Vac. Sci. Technol. 16, pp. 1759–1763, 1979.
- Nao-aki Aizaki, "Proximity Effect Dependence on Substrate Material," J. Vac. Sci. Technol. 16, pp. 1726–1933, 1979.
- Shoichiro Yoshida et al., "X-Ray Lithographic System," Proc. Kodak Microelectronics Seminar, San Diego, October 1980.
- W.D. Buckley et al., "The Design and Performance of an Experimental X-Ray Lithography System and the Fabrication, Patterning, and Dimensional Stability of Titanium Membrane X-Ray Lithography Masks," Proc. Kodak Microelectronics Seminar, October 1980.

- H.A. Hyman et al., "Intense-Pulsed Plasma X-Ray Sources for Lithography: Mask Damage Effects," J. Vac. Sci. Technol. 21, pp. 1021–1026, 1982.
- Armand P. Neukermans, "Current Status of X-Ray Lithography," Solid State Technology, pp. 185–188 and references therein, September 1984.
- J.R. Maldonada et al., "X-Ray Lithography Source Using a Stationary Solid Pd Source," J. Vac. Sci. Technol. 16, pp. 1942–1945, 1979.
- E. Spiller et al., "Application of Synchrotron Radiation to X-Ray Lithography," J. Appl. Phys. 47, pp. 5450-5459, 1976.
- H. Aritomi et al., "X-Ray Lithography by Synchrotron Radiation of the SOR-Ring Storage Ring," J. Vac. Sci. Technol. 16, pp. 1939–1941, 1979.
- 74. John A. Doherty et al., "Focus Ion Beams in Microelectronic Fabrication," *IEEE Trans. on Components, Hybrids, and Manufac. Tech.*, CHMT6, pp. 329–333, 1983.
- Kenji Gamo and Susumu Namba, "Ion Beam Lithography," Ultramicroscopy 15, pp. 261–270, 1984.
- J.N. Randall, "Prospects for Printing Very Large Scale Integrated Circuits with Masked Ion Beam Lithography," J.Vac. Sci. Technol. A, pp. 777– 783, 1985.
- John L. Bartelt, "Masked Ion Beam Lithography: An Emerging Technology," Solid State Technology, pp. 215–220, May 1986.
- Ronald C. Bracken and Syed A. Risvi, Chapter V, "Microlithography in Semiconductor Device Processing," in Norman G. Einspruch and Graydon B. Larrabee, eds., VLSI Electronics: Microstructure Science, Academic Press, New York, 1983.
- Michael Johnson and Karen Lee, "Optimization of a Photoresist Process Using Statistical Design of Experiments," *Solid State Technology*, pp. 281-284, September 1984.
- Terry V. Nordstrom, "Reticle Sizing for Optimized CD Control." Semiconductor International, pp. 158–161, September 1985.
- Frederick H. Dill et al., "Modeling Projection Printing of Positive Resist," *IEEE Trans. on Electron Dev. ED-22*, pp. 456–464, 1975.

- J.H. Altman and H.C. Schmitt, "On the Optics of Thin Films of Resist over Chrome," *Proc. Kodak Photoresist Seminar 2*, pp. 12–19, Los Angeles, 1968.
- S. Middelhoek, "Projection Masking, Thin Photoresist Layers, and Interference Effects," *IBM J. Res. Develop.* 14, pp. 117–124, 1970.
- Dietrich W. Widmann, "Quantitative Evaluation of Photoresist Patterns in the 1-µm Range," *Appl. Opt. 14*, pp. 931–934, 1975.
- J.D. Cuthbert, "Optical Projection Printing," Solid State Technology, pp. 59-69, August 1977.
- William G. Oldham et al., "A General Simulator for VLS1 Lithography and Etching Processes: Part 1—Application to Projection Lithography," *IEEE Trans. on Electron Dev. ED-26*, pp. 717– 722, 1979.
- T.R. Pampalone and F.A. Kuyan, "Improving Linewidth Control over Reflective Surfaces Using Heavily Dyed Resist," J. Electrochem. Soc. 133, pp. 192–196, 1986.
- T.R. Pampalone and F.A. Kuyan, "Contrast Enhancing Additives for Positive Photoresist," J. Electrochem. Soc. 135, pp. 471–476, 1988.
- Edward John Walker, "Reduction of Photoresist Standing-Wave Effects by Post-Exposure Bake," *IEEE Trans. on Electron Dev. ED-22*, pp. 464–466, 1975.
- Dietrich W. Widmann and Hans Binder, "Linewidth Variations in Photoresist Patterns on Profiled Surfaces," *IEEE Trans. on Electron Dev. ED-22*, pp. 467–471, 1975.
- E. Ong et al., "Bilevel Resist Processing Techniques for Fine-Line Lithography," Proc. Kodak Microelectronics Seminar, pp. 91–97 and references therein, Dallas, 1981.
- E. Bassous et al., "A Three-Layer Resist System for Deep U.V. and RIE Microlithography on Nonplanar Surfaces," J. Electrochem. Soc. 130, pp. 478–484 and references therein, 1983.
- Y, Saotoma et al., "A Silicon Containing Positive Resist (SIPR) for a Bilayer Resist System," J. Electrochem. Soc. 132, pp. 909–913, 1985.
- H. Umezaki et al., "Deep-UV Contact Lithography Using a Trilevel Resist System for Magnetic Bubble Devices with Submicron Minimum Features," J. Electrochem. Soc. 132, pp. 2440– 2444, 1985.

- 95. C. Lyons and W. Moreau, "Aqueous Processable Portable Conformable Mask (PCM) for Submicron Lithography," J. Electrochem. Soc. 135, pp. 193–197 and references therein, 1988.
- L.K. White, "A Modeling Study of Superficial Topography for Improved Lithography," J. Electrochem. Soc. 132, pp. 3037–3041, 1985.
- 97. W.G. Oldham and A.R. Neureuther, "Projection Lithography with High Numerical Aperature

Optics," Solid State Technology, pp. 106-111, May 1981.

- F.H. Dill et al., "Characterization of Positive Photoresist," *IEEE Trans. on Electron Dev. ED*-22, pp. 445–452, 1975.
- R.E. Jewett et al., "Line-Profile Resist Development Simulation Techniques," *Polymer Eng. Sci.* 17, pp. 381–384, 1977.



# Etching

# 6.1 INTRODUCTION

In many IC manufacturing steps, whole wafers are completely coated with a layer or layers of various materials such as silicon dioxide, silicon nitride, or a metal. The unwanted material is then selectively removed by etching through a mask to leave, for example, holes in a thermal oxide where diffusions are to be made or long stripes of aluminum for electrical interconnects between individual circuit elements. In addition, various patterns must sometimes be etched directly into the semiconductor surface. Examples are circular holes or short grooves where trench capacitors are to be made in silicon; mesas that are required in the silicon dielectric isolation process (discussed later); and small, flat depressions in GaAs where the gate metal is to be deposited.

Possible kinds of etching are wet chemical, electrochemical, pure plasma etching, reactive ion etching (RIE), ion beam milling, sputtering, and high-temperature vapor etching. Wet etching, in which the wafers are immersed in aqueous etching solutions, is the oldest and, when applicable, the least expensive process. Electrochemical etching is done in aqueous-based solutions and has very limited applicability. Plasma etching and combination plasma/RIE are relatively new, are performed in a low-pressure gaseous plasma, and are most commonly used in fine-geometry applications. Plasma etching generally involves fewer safety hazards and spent chemical disposal problems, but the additional cost of plasma equipment is a deterrent to its use when fine-line definition (less than approximately 3 µm) is not necessary. Ion beam milling, done in vacuum, is seldom used. Sputtering, done in a relatively low vacuum, is quite slow, produces surface damage, and is used only for a surface cleanup. Vapor etching generally requires temperatures in the order of 1000°C and is mostly used as an in situ cleanup before epitaxial depositions.

A few etching procedures in IC manufacturing do not involve any local masking, These procedures include etching whole semiconductor slices to remove damage and/or to polish the surface and etching slices or chips to delineate crystallographic defects. In addition, before the advent of planar technology, a variety of germanium and silicon etching steps were used for removing damage from junctions. Because of this application, a body of literature exists that relates to junction cleanup etches (see, for example, reference 1); however, since these etches do not relate to modern IC technology, they will not be discussed in this context. Most IC etching does require material removal in selected regions only and hence requires a series of related processing steps, which are

- Coat the wafer with an adherent and etch-resistant photoresist.
- 2. Selectively remove the resist to leave the desired pattern.
- 3. Etch to transfer the mask pattern to the underlying material.
- Remove (strip) the photoresist and clean the wafer.

Of these steps, all but step 3 were discussed in Chapter 5 on lithography.

Etching completely through thin layers presents a somewhat different set of problems from the problems encountered in etching partway through thick layers. In the latter case, the thickness removed must generally be determined by time and etch rate. In the case of thin layers, the etch used is ordinarily selective and will slow or stop its etching action when the next layer is reached. It may, however, continue to etch laterally and give undercutting. Wet etching is particularly prone to this problem. Etches that behave thusly are isotropic; that is, they etch at the same rate in all directions. For etching to be truly isotropic, the material itself must be isotropic, as is, for example, thermal SiO2, and the etch rate must be controlled by surface kinetics. When the material is anisotropic, as is Si or GaAs, diffusion-controlled etching will remove the effects of any anisotropic surface kinetics and may still give isotropic etching. More likely, however, diffusion effects will cause enhanced diffusion near mask window edges (2). Fig. 6.1a shows an isotropic profile when etching is done through a window into a very thick material. In this geometry, the center of curvature for the undercut is assumed

### FIGURE 6.1

(a) Isotropic etching showing undercutting of an amount equal to depth of etch. (b) Isotropic etching of a thin layer that has just barely been etched through. (c) Etching at a later time showing increased undercutting and steeper sidewall slope.



to be at the mask edge. Fig. 6.1b shows the profile when etching is done through a thin layer. As undercutting progresses, the radius becomes larger, and the etching edge steeper (3), as Fig. 6.1c shows.

Anisotropic etching implies that a substantial difference exists in etch rates in different directions. There are, however, two usages of the term within this context. In one, which is commonly encountered in thin-film etching discussions, anisotropic etching means appreciable etching in a direction normal to the wafer surface and essentially no etching laterally. In the other, which may arise during the etching of single crystals, anisotropy is a measure of the differing etch rates in different crystallographic directions. Etches that show this latter behavior are anisotropic because their rate is limited, not by diffusion of etching reactants to and from the etching surface, but by differing reaction rates on different crystallographic planes. Such etches are not applicable to thin-film applications since most thin films are either amorphous or fine-grained polycrystalline material. An anisotropic etch mechanism sometimes operative in combination plasma/RIE restricts lateral etching and is independent of crystallography. As depicted in Fig. 6.2a, it depends on the sidewalls becoming coated during etching with a polymer that resists further etching (4). Etching will also be highly anisotropic, as shown in Fig. 6.2b, if the etching molecules or ions arrive at the etching surface in a parallel beam and have negligible surface mobility.

Many pattern-etching applications involve sequentially etching through multiple layers. Sometimes, the same etch formulation is used for all of the layers, but seldom will the etch rates be the same. When they are not, substantial undercutting of one or more of the layers may occur. The advantage of a single etch is that it requires only one etch step and is much less time consuming; however, when a single etch is used with wet etching, it is only applicable to large geometries. To minimize undercutting, the layers can be etched in sequence, using an etch specific to each layer, with each etched layer acting as the mask for the next one. Such an approach presumes that a series of etches is available that will attack the desired

FIGURE 6.2

(a) Anisotropic etching by sidewall passivation. (b) Anisotropic etching by directed reactive ions.

<sup>(</sup>a) (b)

layer at a useful rate and all of the rest either very slowly or not at all. Thus, the difference in etch rates (selectivity) between different materials is a very important factor. The individual rate values are also of importance since if the rate is too slow, the process may not be economically feasible and if it is too fast, the process may not be controllable.

In addition to rate, selectivity, and anisotropy, other etchant properties of interest are the effect of the etch on the mask material (whether it erodes the masking material or causes it to peel up from the surface) and the contaminants left behind. Etchant purity and cleanliness are particularly important because many liquid etchants can have heavy metal contaminants that will plate onto semiconductor surfaces and particulates that are difficult to remove during the rinse cycles. Plasma etching may leave solid reaction products in the windows, and sputter etching and ion milling sometimes transfer material from some part of the etching machinery hardware to the etching surface.

Wet etching in its simplest form consists of a beaker of etchant adjacent to several beakers of rinse water. The wafers are loaded into some sort of plastic holder and immersed in the etchant beaker for a specified time. They are then quickly transferred to the first rinse water beaker in order to abruptly stop etching (quench) and then moved in sequence to the others. To ensure that the final rinse water remains essentially free of etchant, the first beaker used is dumped after each use, the remaining beakers are moved forward, and a fresh one is added at the end. For reliable finished parts, it is imperative that no etch reactants remain on the surface. Otherwise, either the electric fields present during operation will cause surface ion migration and subsequent electrical failure or moisture diffusing to the chip surface through plastic packages will react with the ions and produce lead corrosion.

Several versions of wet etchers (often referred to as wet processors) are now available (5), but they all incorporate the same etchrinse steps used in the multiple-beaker approach. Usually, the etch bath is temperature controlled, and often the etching solution will be continually recirculated and filtered. The subsequent rinse operation uses hot deionized water and may have nitrogen bubbling through it for agitation. The resistivity of the water can be monitored to determine the time required for rinsing, based on the time for the water resistivity to recover to its value before wafer insertion. This time will be a function of water flow, the specific design of the rinse tank, and the etchant used. Of the common etchants, dilute HF is the easiest to remove as measured by a resistivity monitor; nitric

# 6.2 WET ETCHING

acid, the next easiest; and sulfuric acid, the most difficult. The spread of times is from 30% to 50%. For low flow rates, the time decreases as flow rate increases but becomes progressively less dependent on flow as the flow increases. Typically, times long enough to allow 4-6 changes of water in the tank are required. The resistivity of the rinse water is, however, misleading in that it does not measure the amount of ions that remain adsorbed on the semiconductor surface. Since fluorine ions are particularly difficult to remove from silicon, silicon wafers may require several additional minutes of water rinsing over that indicated by a resistivity monitor.

One early approach to automating the wet etching process used a single wafer chamber in which first the etchant and then the rinses were introduced (6). A more common approach is to use multiple tanks (much like the beakers) with a robot arm to transfer the wafer carrier from tank to tank. A sketch of a fume hood with an included etch tank is shown in Fig. 6.3. Not shown are the rinse tanks inside the hood. To minimize deionized water usage, the water rinse tanks are often cascaded so that water flows from tank to tank, with the pure water input going into the last tank used. Yet another method opens the bottom of the tank containing the wafer holder and etch solution when etching is complete so that the etchant is very rapidly



removed (dumped). The tank is then closed and flooded with water, and the dump step is repeated. The rinses can be repeated as many times as are necessary to ensure complete etchant removal. In some machines, the etch and then the rinse water are sequentially sprayed over the wafers, after which the holder containing the wafers is rapidly spun to remove the water (spin-rinse etching). During etching of deep, narrow geometries under stagnant conditions, diffusion of reactants into the small openings becomes a major limitation. Spray jets directed normal to the surface offer some improvement, but the thin, stagnant layer on the surface still must be penetrated by diffusion. It has been suggested that further improvement should be possible by making use of the fact that the etch solution and the reaction by-products have different densities and using a centrifugal field to promote circulation (7). However, since extensive undercutting generally occurs in wet etching, small geometries are almost always plasma etched.

### Etch composition is often described in terms of parts by volume for example, nitric, hydrofluoric, and acetic acids in the ratio of 1:1:2. Solids added are then specified in terms of grams per volume of solution. When composition is given in this way, one question that remains is the composition of the starting materials. Not only must the normal manufacturing spread in composition be considered, but also the fact that many of the acids of commerce are sold in different concentrations. For example, HF is commonly available as 49% by weight HF, remainder water, and also as 53% HF, remainder water. When water is part of the formulation, it must be made clear whether the water in the acid is included in the water specified. For example, does 1 part HF, 1 part water mean 1 part of 49% HF plus its water and an additional 1 part water, or does it mean a solution of 50% HF and 50% water?

Solution normality is another way to express concentration. A 1 normal solution is one that contains 1 gram equivalent weight (GEW) of solute per liter (I) of solution. A GEW is defined as the gram molecular weight divided by the number of available H<sup>+</sup> or OH ions per molecule. Molarity is also used, with a 1 molar solution containing I gram molecular weight of solute per liter of solution. A 1 molal solution is one with 1 gram molecular weight of solute per liter of solute per 1000 grams of solvent. Under most circumstances, when water is the solvent and the solute is a solid that singly ionizes, 1 molar and 1 molal solutions are nearly equivalent. Mole (abbreviated as mol) is a measure of the number of molecules in I gram molecular weight, or  $6.02 \times 10^{23}$ . The weight of 1 mol of an ion is given by the sum of the atomic weights of the ion constituents. So, for example, 1 mol of NH<sub>4</sub> ions =  $14 + (4 \times 1) = 18$  grams.

# 6.2.1 Etch Formulation

EXAMPLE How many grams of glacial acetic acid (CH,COOH) are required for 1 gram equivalent weight? How much of the acid must be added to I liter of water to give a 1 normal solution? How much for a 1 molar solution?

> First, the number of available hydrogen ions per molecule must be determined. The formula as written above (CH<sub>3</sub>COOH) gives no insight. However, when it is written in the alternate form of HC,H,O,, convention dictates that the beginning number of hydrogens is the available number. Hence, there is one hydrogen available per molecule, and a gram equivalent weight is the gram molecular weight, or 60 grams. Since glacial acetic acid is 99 % pure acid, assume that it is 100%. Since a 1 normal solution is I GEW per liter of solution, 60 grams (57 ml) must be added to 943 ml of water (assuming that the volume of the mixture equals the sum of the component volumes). Since, in this case, a gram equivalent weight and a gram molecular weight are the same, a 1 normal solution is the same as a 1 molar solution.

In the particular case of the hydronium ion, the concentration is usually expressed in terms of pH, which is - log10 of the concentration in moles per liter of solution. Thus, I gram molecular weight of strong acid such as HCl in a liter of solution will completely ionize, produce an  $H_{2}O^{+}$  concentration of 1 mol per liter and have a pH of 0. Experimentally, it is observed that by the time the ionizing material being added to the water is reduced to the point that only water is present, the water itself will very slightly dissociate into equal quantities of hydronium and hydroxide ions, each with a concentration of about 10<sup>-7</sup> mol.<sup>1</sup> Since the number of + and - ions are equal, the water solution is neutral. By the definition just given, it also has a pH of 7. Thus, acid solutions have a pH range of 0-7. When the hydrogen concentration is less than  $10^{-7}$ , the OH<sup>-</sup> ions are in the majority, and the pH of basic solutions ranges from 7 to 14.

Many reactions, including most etchants, will depend on the pH of the solution. As etching proceeds, the reactants are depleted so that the pH and the etch rate change. In order to provide a means of initially adjusting the pH, as well as to stabilize it during etching, buffered solutions are often used. Buffering entails adding a salt of the etching acid to the acid, and it is applicable to water solutions of weak acids.2 Thus, for example, the salt NH4F could be added to

The dissociation constant of water at room temperature is 10<sup>-14</sup> so that if the OH ions have a concentration of 1 mol per liter of water solution, the concentration of H<sub>i</sub>O ions will be 10<sup>14</sup>, and the solution will have a pH of 14.

<sup>&</sup>lt;sup>2</sup>For more details on buffering, pH, and the different ways of expressing concentration, see an elementary chemistry textbook.

the weak acid HF. A weak acid is one that does not completely dissociate in water at room temperature and includes all but six of the common acids. These six "strong" acids (perchloric, hydriodic, hydrobromic, hydrochloric, nitric, and sulfuric) completely dissociate and thus do not lend themselves to buffering. For a weak acid of concentration  $[C_a]$  above a small fraction of a mole, the hydronium ion concentration  $[H_3O^+]$  without buffering is given by<sup>3</sup>

$$[\mathbf{H}_{3}\mathbf{O}^{+}] \cong \{K_{a}[C_{a}]\}^{0.5}$$
6.1

where  $K_a$  is the dissociation constant of the acid. For a solution with a salt of concentration  $[C_s]$  added, the hydronium ion concentration is

$$[\mathrm{H}_{3}\mathrm{O}^{+}] \cong \frac{K_{a}[C_{a}]}{[C_{a}]}$$

$$6.2$$

It can be shown that for the  $[C_a]/[C_s]$  ratio near 1, the pH is quite insensitive to change when either a strong acid or a base is added to the solution (hence, the term "buffering").

Silicon etches that are nonselective with regard to crystallographic orientation and to crystallographic damage and that thus tend to produce smooth surfaces are primarily based on  $HF-HNO_3-H_2O$  mixtures. The etching is a two-step process in which the silicon is first oxidized by the nitric acid and the oxide is then dissolved by HF. The overall reaction is assumed to be (8)

$$18HF + 4HNO_1 + 3Si \rightarrow 3H_2SiF_6 + 4NO + 8H_2O = 6.3$$

Acetic acid is sometimes used either with or instead of additional water. Its role appears to be primarily as a diluent (9), although there is also the suggestion that the surface tension of the mixture is reduced so that better wetting and a smoother etched surface are achieved (10). This behavior is different from that observed during the etching of germanium, in which case the acetic acid is described as a moderator that prevents violent reaction (11). HF in water alone will also very slowly etch silicon (12). In this case, the OH<sup>-</sup> ion from water dissociation oxidizes the silicon. Since addition of HF will depress the OH<sup>-</sup> ion, higher HF concentrations reduce the reaction rate.

For the HF-HNO<sub>3</sub>-CH<sub>3</sub>COOH system, Fig. 6.4a shows how the rate varies with composition.<sup>4</sup> At each end of the HF-HNO<sub>3</sub> con-

6.2.2 Nonselective Silicon Etching

<sup>&</sup>lt;sup>3</sup>Brackets [] in chemical formulae are used to denote concentration. <sup>4</sup>For instructions on reading triangular graphs, see Appendix B.

centration range, the rate is determined by the concentration of the low-concentration species. In the low HNO<sub>3</sub> region, the rate is limited by oxidation and is sensitive to crystal orientation, crystal defects, and the presence of lower oxides of nitrogen, such as NO. In this region, the reaction may begin either slowly or not at all unless an initially damaged surface is present or a small amount of sodium nitrite (9) is added to the solution. In the low HF region, dissolution of the oxide is the limiting step, and the rate becomes diffusion controlled. These two regions are also characterized as giving rough and polished surfaces, respectively. The general location of the smoothrough boundary is shown in Fig. 6.4b.

The compositions of three nonselective silicon etches are as follows:

CP4: 3 HF, 5 HNO<sub>3</sub>, 3 acetic, 0.06 bromine (32 μm/minute) CP4A: 3 HF, 5 HNO<sub>3</sub>, 3 acetic (25 μm/minute) Planar: 2 HF, 15 HNO<sub>3</sub>, 5 acetic (3.5–5.5 μm/minute)

All parts are by volume, 49% HF, 70% HNO<sub>3</sub>, glacial acetic (100%). CP4 (chemical polish no. 4) was originally used for germanium and is a very rapidly acting etch. By leaving out the Br (CP4A), the rate is reduced, and a more tractable silicon etch results. As with many other data presented, published etch rates vary substantially. For example, the rates from Fig. 6.4a and the ones just tabulated differ by about a factor of 2. Such discrepancies are not necessarily the





#### FIGURE 6.4

(a) Effect of composition of HF-HNO<sub>3</sub>-acetic acid etching mixture on room-temperature etch rate of silicon. (b) Etchant composition region where silicon polishing occurs. (Source: B. Schwartz and H. Robbins, J. Electrochem. Soc. 123. p. 1903, 1976. Reprinted by permission of the publisher. The Electrochemical Society, Inc.)

Temperature dependence of silicon etch rate for a 20% HF, 45% HNO<sub>3</sub>, 35% acetic acid solution. (*Source:* Replotted from B. Schwartz and H. Robbins. J. Electrochem. Soc. 108, p. 365, 1961.)

## 6.2.3 Polycrystalline Silicon Etching



results of poor data but can stem from the fact that these etch rates are dependent on a number of difficult-to-control variables, such as a buildup of NO or the temperature at the etching surface. The effect of temperature for a composition near that of CP4A is shown in Fig. 6.5. Based on this curve, etching done at 30°C could easily be twice that done at 25°C. In the diffusion-controlled region, rate is sensitive to variables such as stirring and wafer spacing. (Dependency on these variables is often considered proof of a diffusion-controlled process.)

Polycrystalline silicon, particularly the fine-grained, low-temperature poly used for MOS gates and leads, etches very rapidly in most etches designed for single-crystal silicon. In pattern etching polysilicon, the polysilicon will usually have been deposited over SiO, so that HF-bearing etches may offer little polysilicon/oxide selectivity. In addition, undercutting makes control of the gate and lead geometries difficult. Thus, plasma etching is generally used in these applications (see section 6.3). However, a dilute (15%) KOH-water solution has been used, as has ethylenediamine-pyrocatechol-water (13). Also, by working far into the high nitric acid portion of the polish region-for example, 6 HF, 100 HNO<sub>3</sub>, 40 H<sub>2</sub>O-slow (<1 µm/minute), smooth edge definition and minimal oxide attack are possible (14). Choline, used as a sodium-free alkaline developer for positive resist and for wafer cleaning, can also be used for etching polycrystalline silicon. A 5% solution etches phosphorus-doped polysilicon at a rate of about 250 Å/minute at room temperature, while it etches LPCVD nitride and thermal oxide at rates of <1 A/minute.

6.2.4 Anisotropic Silicon Etching The etches described in section 6.2.2 are "hot" etches that are good for polishing and etching isotropically. There are also etches composed, for example, of chromic oxide, water, and HF that etch single-crystal silicon very slowly but regions with crystallographic damage much more rapidly. In a sense, they are anisotropic, but some silicon etchants are available that are truly anisotropic and etch extremely slowly in <111> directions and at a useful rate in other directions. This property can be used to form a variety of patterns in a silicon wafer that would be very difficult to achieve in any other manner. The slow etch rate in <111> directions is a consequence of the diamond lattice. A "(111) plane" is really a double layer bound together by more atomic bonds than are found between other planes, as is illustrated in Fig. 6.6. The figure shows a view of

#### FIGURE 6.6

Views of diamond or zinc blende lattice showing double layers of atoms encountered in moving through lattice in a [111] direction: (a) Looking in a [110] direction. (b) Looking in a slightly different direction, with perspective. (Source: Program for drawing lattice courtesy of Dr. Anthony Stephens, Texas Instruments Incorporated.)



Relative width of grooves of depth *d* etched into (100) silicon surfaces when isotropic and anisotropic etches are used.



the diamond lattice when looking in a [110] direction and a perspective view in which the line of sight is slightly off a [110] direction so that the bonds tying the double layer together can be more easily seen.

One of the oldest applications of anisotropic etching, also referred to as orientation-dependent etching (ODE), is the etching of mesas on silicon wafers as one step in the production of dielectrically isolated ICs. Mesas can be etched using an isotropic etch, but because of undercutting of the mask, they cannot be as closely spaced as when an anisotropic etch is used, as is shown in Fig. 6.7 for the particular case of (100) oriented wafers (15). Such circuits are used for radiation hardening and for high-voltage ICs (16, 17). More recently, silicon has been used for a variety of micromechanical applications, many of which require anisotropic etching (18).

Any structures etched using anisotropic etches will have the etched features bounded predominately by (111) planes, although in some cases facets of planes with intermediate etch rates will appear. To consider the shape of the features that can be etched, both the traces of the (111) planes intersecting the wafer surface and the angles of the (111) planes with the wafer surface must be considered. Anisotropic etching on a (111) wafer is not possible since the first plane encounter is the slow-etching (111) plane. Etching on (110) wafers is possible, but such wafers are not currently used for IC production. The widely used (100) wafers are, however, quite amenable to anisotropic etching. Traces of {111} planes intersecting a (100) surface make right angles with each other. If a mask with a slot opening in it is oriented azimuthally on the wafer<sup>5</sup> so that the slot edge is parallel to a (111) trace, a trench bounded by  $(\overline{111})$  and  $(\overline{111})$  planes on the sides can be etched. If etching is allowed to

Wafers that conform to SEMI standards will have an edge flat ground parallel to a (110) plane that can be used for orienting the mask.

(a) SEM view of anisotropically etched grooves in (100) silicon. (b) Cross section of a polycrystalline silicon wafer with dielectrically isolated single-crystal islands.



continue to completion, the trench will be V-shaped, with both the width and depth determined by the mask opening. Etching can be stopped earlier, however, leaving a flat-bottomed trench. In this case, the depth is a function of etching time. If, instead of a single slot opening, the mask is patterned with many slots at right angles to each other, a series of mesas can be etched. An example of the kind of etching required for dielectrically isolated (DI) ICs is shown in Fig. 6.8a. After the DI process is completed, the wafer has a series of regions isolated from the main body of the wafer by oxide layers, as shown in cross section in Fig. 6.8b.<sup>6</sup>

At outside corners (corners of mesas where two (111) planes intersect), etching may not stop abruptly. Instead, it will continue until an intermediate etching rate set of planes such as the {331} are exposed and, in cases in which the mesas are small, may produce pronounced corner faceting as shown in Fig. 6.9. This effect can be minimized by using a corner-compensation mask design that has

<sup>\*</sup>After mesa etching, a thermal oxide about 2  $\mu$ m thick is grown on the etched surface. Next, a polysilicon layer about as thick as the original wafer is deposited over the oxide. The wafer is then turned polysilicon side down, and the opposite side (the single-crystal side) is ground off and polished until only single-crystal islands isolated by the oxide layer and immersed in a polysilicon matrix are left. This leaves the structure shown in Fig. 6.8b.

Exposure of higher-index planes at corners during orientation-dependent etching of (100) silicon. (The etch used was KOH-propanol-water, and the exposed corner planes are of the {331} family. Other etchants may expose other planes.)



"ears" sticking out from each corner to slow down etching at the corners (19-21).

When [110] wafers are used, since several of the {111} family of planes are perpendicular to the [110] surface, slots with vertical walls can be etched out by aligning the edge of the mask with the traces of the perpendicular (111) planes (22). However, since these {111} planes do not intersect one another at right angles, rather than being able to have square or rectangular mesas, they must be rhombic-shaped. It is also not possible to etch deep, short slots because other low-angle planes of the {111} family come in from the ends and limit the depth. However, when the slots go to the wafer edge, the depths can be up to hundreds of microns deep since the difference in etch rates in the (110) and (111) directions is at least 600 to 1 (23), Fig. 6.10 shows an example of a series of deep vertical grooves.

Table 6.1 lists several silicon anisotropic etches that have been reported. None of them involve the HF-HNO<sub>3</sub> system previously discussed. The KOH reaction, which is also often used for damage removal even though it gives a faceted surface, probably proceeds as

$$Si + H_2O + 2KOH \rightarrow K_2SiO_1 + 2H_2$$
 6.4

The advantages of a KOH etch for damage removal (which does not require the anisotropic feature) are that both the fresh and spent etches pose less of a safety hazard than do the HF–HNO<sub>3</sub> mixtures and that it is often more economical.

EXAMPLE

Prove the assertion that square or rectangular mesas can be etched into (100) silicon and that the walls of the mesas will make angles of  $54.74^{\circ}$  with the (100) surface.

Since the (111) planes are the slowest in etching, the mesas

Cross-sectional views of etching into a (110) silicon plane using 80°C KOH and water. (In the higher magnification view (*bottom*), the silicon dioxide layer used as mask during the etching can be seen. Photoresist cannot be used since it is rapidly attacked by the alkaline etch at this temperature.)



(110) Silicon ODE 80  $\mu$  deep, 10  $\mu$  centers



5000×

10,000×

will be bounded by them. There is no (100) projection in Appendix A, but there is a (001) projection, and it is equivalent in that the wafer surface could just as easily have been labeled (001) as (100). Looking at the (001) projection shows that four (111) planes have their traces on the (001) plane perpendicular to each other:  $(\overline{111})$ ,  $(\overline{111})$ ,  $(\overline{111})$ , and  $(1\overline{11})$ . To find the angle that each makes with the (001) plane, either calculate it from the equation given in Appendix A or read it from the table of angles between planes. Either method will verify that 54.74° is the correct angle.

This form of etching is often referred to as concentration-dependent or selective etching. Some etchants have large differences in etch rates depending on whether the resistivity is high or low and on whether the silicon is n- or p-type. This kind of behavior has long been used to delineate high-low resistivity and pn junction regions for characterization (24). However, the low etch rate region may also be used as an etch stop and produce structures such as very thin membranes (not applicable to mainstream IC production, but sometimes useful in sensor construction or for processing siliconon-insulator structures). The general approach is to use the slow

6.2.5 Resistivity-Sensitive Silicon Etching

#### TABLE 6.1

Silicon Anisotropic Etches

| Solution Components                  | Reference |  |
|--------------------------------------|-----------|--|
| KOH, water, normal propanol          | (1)       |  |
| KOH, water                           | (2)       |  |
| Hydrazine, pyrocatechol              | (3)       |  |
| Ethylenediamine, pyrocatechol, water | (4)       |  |
| Ethylenediamine, water               | (4)       |  |
| Hydrazine, iso-2-propyl alcohol      | (5)       |  |
| Hydrazine, water                     | (5)       |  |

| Etch                                                                       | Composition                                                      | Comments                                                                                                                    |
|----------------------------------------------------------------------------|------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------|
| (100) ODE: Etches [100] ~100 × [111]<br>direction                          | KOH-normal propanol-H <sub>2</sub> O<br>KOH, 250 grams           | ~1 μm/minute at 80°C, [100];<br>"stops" at p*-interface; etches                                                             |
|                                                                            | Normal propanol, 200 grams<br>H <sub>2</sub> O, 800 grams        | Si <sub>3</sub> N <sub>4</sub> at 14 Å/hour; etches SiO <sub>2</sub><br>at 20 Å/minute.                                     |
| (110) ODE: Etches [110] 600 × [111]<br>direction                           | KOH-H <sub>2</sub> O<br>50-50 volume                             | ~0.8 µm/minute at 80°C in (110) silicon.                                                                                    |
| Ethylenediamine: Orientation-<br>dependent and concentration-<br>dependent | Ethylenediamine-<br>pyrocatechol-H <sub>2</sub> O<br>EDA, 255 cc | ~1.1 μm/minute at 100°C in [100];<br>"stops" at p <sup>++</sup> -interface; very<br>slow etching of SiO <sub>2</sub> (~3 Å/ |
|                                                                            | H <sub>2</sub> O, 120 cc<br>PC, 45 grams                         | minute); negligible etch of Al,<br>Au, Ag, Cu, Ni, or Ta.                                                                   |

Note: Ethylenediamine is NH<sub>2</sub>(CH<sub>2</sub>)<sub>2</sub>NH<sub>2</sub>; hydrazine, N<sub>2</sub>H<sub>4</sub>; and pyrocatechol, C<sub>6</sub>H<sub>4</sub>(OH)<sub>2</sub>.

References: (1) H.A. Waggener et al., International Electron Devices Meeting, October 1967; J.B. Price, p. 339, in Howard R. Huff and Ronald R. Burgess, eds., Semiconductor Silicon/73, Electrochemical Society, Princeton, N.J., 1973. (2) A.I. Stoller, RCA Review 31, p. 271, 1970; Kenneth E. Bean, IEEE Trans. on Electron Dev. ED-25, p. 1185, 1978; Don L. Kendall, Ann. Rev. Mater. Sci. 9, p. 373, 1979. (3) J.M. Crishal and A.L. Harrington, Electrochemical Society Spring Meeting Abstracts, abst. no. 89, p. 202, 1962. (4) R.M. Finne and D.L. Klein, J. Electrochem. Soc. 114, p. 965, 1967. (5) D.B. Lee, J. Appl. Phys. 40, p. 4569, 1969; Michel J. Declercq et al., J. Electrochem. Soc. 122, p. 545, 1975.

> etching variety as an etch stop. Thus, for example, if lightly doped n-type silicon is unattacked in a particular etch, a thin n-layer could be epitaxially deposited on a heavily doped substrate and the substrate then etched away. One difficulty with these techniques is that the etched surface is often not smooth, due in part to the etch sensitivity to resistivity nonuniformities,

> By changing the concentration of the HF-HNO<sub>3</sub>-CH<sub>3</sub>COOH mixture used for polish etching to the ratio of 1:3:8, the etch rate of silicon, n- or p-type, increases by about a factor of 15 (from ~0.2 to

 $\sim$ 3 µm/minute) when the doping concentration increases above  $\sim$ 10<sup>19</sup> atoms/cc (25). A KOH-H<sub>2</sub>O-isopropyl alcohol solution at 80°C etches silicon with a boron concentration of 2 × 10<sup>18</sup> atoms/cc about 25 times faster than when the silicon is doped to a concentration of 2 × 10<sup>20</sup> atoms/cc (26). Ethylenediamine-pyrocatechol-water etches high-resistivity silicon but not heavily doped p-type (27, 28). By heavily doping with boron and compensating with germanium, a very effective etch stop is produced. Etch ratios of 1:1500 between the lightly doped and heavily doped regions have been observed.

**6.2.6 Electrolytic Silicon Etching** Low-resistivity n-type silicon and p-type silicon can be electrolytically etched when the silicon is made the anode (+ electrode) in a suitable electrolyte, such as 5% by weight HF in water (29, 30). When silicon is electrolytically etched, it is removed in either divalent or tetravalent states, depending on the current density and the electrolyte concentration (31–32). With a high concentration of HF solution and a current density *J* below some critical value  $J_c$ , reactions such as the following are thought to occur (32):

$$Si + 2HF + 2e^+ \rightarrow SiF_2 + 2H^+$$
 6.5

The SiF<sub>2</sub> then reacts to give an amorphous silicon layer (stain) that is often seen after etching:

$$2SiF_2 \rightarrow Si + SiF_4$$
 6.6

The reaction in Eq. 6.6 is then followed by

$$SiF_4 + 2HF \rightarrow H_2SiF_6$$
 6.7

For low HF concentrations, when silicon is dissolved in the divalent state, the reactions can proceed without the formation of amorphous silicon, perhaps as follows:

$$Si + 2H_2O + 2e^+ \rightarrow Si(OH)_2 + 2H^+$$
 6.8a

$$Si(OH)_2 + 2H_2O \rightarrow Si(OH)_4 + H_2$$
 6.8b

$$Si(OH)_4 \rightarrow SiO_7 + 2H_2O$$
 6.8c

$$SiO_2 + 6HF \rightarrow H_2SiF_6 + 2H_2O$$
 6.8d

At higher current densities (and higher anode potentials), silicon dissolves in the tetravalent state, possibly as follows:

$$Si + 4H_2O + 4e^+ \rightarrow Si(OH)_4 + 4H^+$$

It continues according to Eqs. 6.8c and 6.8d and is usually smooth and specular. However, in etching p <sup>+</sup> from n-layers, it is reported that the remaining surface will be rough unless  $H_2SO_4$  is added to the HF solution (33). HF/organic mixtures, such as alcohols, glycols, and glycerine, help maintain a smooth surface during etching of p-type silicon (31). Shining light on n-type silicon will increase the supply of holes and thus the etch rate, and, in fact, the lighted area can be used to produce selective etching (31). Under some conditions, etching of n-type silicon will also produce tunnels in the silicon (8, 34).

Early applications of electrolytic etching of silicon were for polishing and for attempting to slice silicon using very thin wire electrodes. The polishing application was only partially successful, and the slicing application was a complete failure. More recently, electrolytic etching has been used for the thinning of wafers in order to leave thin membranes (29, 30) and for the production of porous silicon, which can then be rapidly oxidized and used for electrical isolation between components (35, 36). It has also been reported that when hot KOH-based anisotropic etches are used, the use of an applied potential allows more precise control of the silicon etch rate (37).

## 6.2.7 Silicon Staining

6.2.8 Gallium Arsenide Etching (General) When a silicon surface is exposed to HNO<sub>3</sub>-HF mixtures either during intentional etching or as a result of being contacted by the solution at the completion of contact window etching, stains will often form. In addition, during boron diffusions, relatively insoluble layers at the silicon-oxide interface may form. They will remain after normal contact etching and appear to be a stain.

Films can be produced on silicon either anodically in HF (p-type silicon) or by straight immersion in a concentrated HF solution containing a small amount of nitric acid or sodium nitrite (38, 39). The exact composition and appearance depend on the resistivity and type, but apparently they are polymerized silicon hydrides (39). The manner in which nitric-HF etchants are quenched will affect the amount of staining. A much thicker film is reported when the nitric acid concentration is low and quenching is by large quantities of water than when the nitric acid concentration is first greatly increased and then quenched with water. Also, apparently enough current flow can occur during oxide contact window etching to cause staining of p-type silicon after the window is completely open unless the back of the wafer is covered with oxide or resist (40).

The etching of gallium arsenide proceeds in a manner analogous to that of silicon: The gallium and arsenic are first oxidized, and then the oxides are dissolved. The GaAs case, however, has a wider choice of oxidants and oxides that are much easier to dissolve. The net result is a wide selection of GaAs etches. They may be broadly categorized as acidic or basic, depending on whether an acid or a base is used to dissolve the oxides. Some selections reported are HCl, HNO<sub>3</sub>, H<sub>2</sub>SO<sub>4</sub>, H<sub>3</sub>PO<sub>4</sub>, citric acid, NaOH, and NH<sub>4</sub>OH. Methanol, an organic, can also be used for oxide dissolution. Common oxidants are hydrogen peroxide and bromine, but  $K_2Cr_2O_7$ , KMnO<sub>4</sub>,  $K_3Fe(CN)_6$ , and Ce(SO<sub>4</sub>)<sub>2</sub> have also been used. The many choices for each function in the etch have given rise to a large number of etchant formulations. Some are less harsh on masks than others, some give better surface polishing, some give more desirable etch profiles, and some are easier and safer to formulate and use. Table 6.2 gives the constituents of a few of the etches that have been reported.

The two different mechanisms of dissolution are straight chemical and those involving charge transfer. The latter may either require an external source of current or be electroless and derive carriers from reduction of the oxidant (redox reactions) (41). In either case, there is generally a kinetic-limited etching region that is high in oxidant and a diffusion-limited region that is high in oxide etchant. As composition is changed from high oxidant to high etchant, a peak

#### TABLE 6.2

Gallium Arsenide Etches

| Composition                                                                               | Comments                                                                     | Reference |
|-------------------------------------------------------------------------------------------|------------------------------------------------------------------------------|-----------|
| 1 NaOCI, 20 H,O                                                                           | Used with mechanical wafer rotation for polishing.                           | (1)       |
| NH <sub>4</sub> OH, H <sub>2</sub> O <sub>2</sub>                                         | Also used with mechanical wafer rotation for<br>polishing.                   | (2)       |
| 3 H <sub>2</sub> SO <sub>4*</sub> I H <sub>2</sub> O <sub>2</sub> , I H <sub>2</sub> O    | For giving polished surface; not good for patterning:<br>erodes photoresist. | (3)       |
| 0,3N-NH4OH, 0.1N-H2O2                                                                     | Minimal undercutting of SiO <sub>2</sub> masking; gives flat<br>surface.     | (4)       |
| 1 HCl, 1 acetic, 1 K,Cr.O7*                                                               | Gentle with photoresist.                                                     | (5)       |
| 10 citric acid, 1 H <sub>2</sub> O <sub>2</sub> †                                         | Gentle with photoresist; no enhanced etching at<br>mask edge.                | (6)       |
| 3 H <sub>3</sub> PO <sub>4</sub> , 1 H <sub>2</sub> O <sub>2</sub> , 50 H <sub>2</sub> O‡ | No enhanced etching at mask edge; can be used<br>with photoresist.           | (7)       |
|                                                                                           |                                                                              |           |

\*12N-HCl, 17N-acetic, 1N-K3Cr3O7.

\*Citric acid = 50% by weight  $C_3H_4(OH)(COOH)_3 \cdot H_2O + 50\%$  by weight  $H_3O$ . \$85% by weight  $H_3PO_4$ , 30% by weight  $H_3O_2$ .

References: (1) A. Reisman and R. Rohr, J. Electrochem. Soc. 111, p. 1425, 1964. (2) J.C. Dayment and G.A.
Rozgonyi, J. Electrochem. Soc. 118, p. 1346, 1971. (3) S. Jida and K. Ito, J. Electrochem. Soc. 118, p. 768, 1971.
(4) J.J. Gannon and C.J. Nuese, J. Electrochem. Soc. 121, p. 1215, 1974. (5) Sadao Adchi and Kunishige Oe, J. Electrochem. Soc. 131, p. 126, 1984. (6) Mutsuyuki Otsubo et al., J. Electrochem. Soc. 123, p. 676, 1976. (7) Yoshifumi Mori and Naozo Watanabe, J. Electrochem. Soc. 125, p. 1510, 1978.

in the etch rate occurs, and the etched profiles change from anisotropic to isotropic.

When the requirement is to etch a shallow, flat depression, such as is required for a MOS gate, neither of these regions is exactly appropriate. Anisotropic etching may leave a re-entrant corner that is difficult to run leads over, and isotropic diffusion-controlled etching will generally leave trenches next to mask edges. As can be seen from Table 6.2, several of the  $H_2O_2$  etches are reported as not exhibiting trenching. This is apparently because a layer forms on the surface of the GaAs during etching and limits the etch rate through the length of time for reactants and etch products to diffuse through it (42).

Gallium arsenide (111) planes, like those of silicon, generally etch slower than other planes. Unlike silicon, however, parallel pairs of planes such as (111) and (111) are not completely equivalent because in the case of gallium arsenide (zinc blende structure), one layer of a (111) double layer is totally comprised of gallium atoms, while the other layer is totally of arsenic. Thus, depending on whether a (111) plane is approached from the [111] or the [111] direction, either a sheet of gallium atoms or a sheet of arsenic atoms will be encountered. By convention, a (111) gallium face is referred to as a (111) A face, and the arsenic face is called the (111) B face. Also by convention, a (111) plane has an A gallium surface, while the (111) plane has a B arsenic surface. The other gallium face planes are (111), (111), and (111). As might be expected, the etch rates of the gallium and arsenic faces are different (the gallium face is slower). Because of this difference in etch rates, instead of the square-based pyramidal etch pits found during etching of an Si (100) plane, rectangular pits sometimes occur in (100) GaAs because one pair of opposing (111) planes etches faster than the other pair. During etching of mesas in (100) faces, the edges of the mask openings should, as in



## 6.2.9 Gallium Arsenide Etching (Anisotropic)

#### FIGURE 6.11

(a) Octahedron bounded by {111} family of planes. (A (100) plane passes through the corners marked 1–2–3–4.) (b) Cross section parallel to either a [011] or a [011] direction through a mesa etched into a (100) wafer when mask edges were oriented parallel to traces of (111) planes on the (100) surface and etch rate is same for A and B faces. (c) Cross section when B faces etch more rapidly than A faces. silicon, be aligned along the traces of {111} planes. However, because of the lack of symmetry just discussed, different slopes on the two pairs of mesa sides may exist, as shown in Fig. 6.11. The four planes of the upright pyramid are the planes that would be exposed if a mesa were etched onto (100) GaAs. A vertical cross section through the mesa parallel to either the [011] or the [011] direction will have identical profiles, as shown in Fig. 6.11b. For etches in which the GaAs B faces are rapidly etching and the A faces are very slowly etching, the slow-etching (111) and (111) planes of the lower half of the octahedron of Fig. 6.11a will be exposed instead of the fast-etching (111) and (111) planes in the upper half. Thus, for GaAs, a vertical section parallel to a [011] direction will look like the one shown in Fig. 6.11b, while one in the [011] direction will appear as shown in Fig. 6.11c.

Under some etching conditions, the A planes may not be the slowest etching, in which case the section profile may not look like those of Figs. 6.11b and 6.11c. In particular, for some etch compositions, (110), (221), (331), or even (111) B planes are exposed, while in others, etching is nearly isotropic. In the latter case, the profile will resemble that of Fig. 6.1. The edge profiles have been extensively studied for a variety of etchants (43, 44). Fig. 6.12 shows the

#### FIGURE 6.12

Mesa profiles that are observed when a sulfuric acid, hydrogen peroxide, water mixture is used for etching GaAs and in which the slow-etching planes in (011) section are (331) and (111). (Source: Don W. Shaw, J. Electrachem. Soc. 128, p. 874, 1981. Reprinted by permission of the publisher, The Electrochemical Society, Inc.)

| Acid                           | Volume   | Etch rate<br>(100)    | Cross-sectional profiles |                            |
|--------------------------------|----------|-----------------------|--------------------------|----------------------------|
| Aciu                           | ratios*  | $(\mu m/minute^{-1})$ | (011) section            | $(01\overline{1})$ section |
| H <sub>2</sub> SO <sub>4</sub> | 1:8:1    | 14.6                  | T                        | 7                          |
| H <sub>2</sub> SO <sub>4</sub> | 1;8;40   | 1.2                   | T                        | 7                          |
| H <sub>2</sub> SO <sub>4</sub> | 1:8:80   | 0.54                  | $\overline{\langle}$     | 7                          |
| H <sub>2</sub> SO <sub>4</sub> | 1:8:160  | 0.26                  |                          | 7                          |
| H <sub>2</sub> SO <sub>4</sub> | 1:8:1000 | 0.038                 | 7                        | 7                          |
| H <sub>2</sub> SO <sub>4</sub> | 1:1:8    | 1.3                   | T                        | 1                          |
| H <sub>2</sub> SO <sub>4</sub> | 4:1:5    | 5.0                   | $\overline{\zeta}$       | 1                          |
| H <sub>2</sub> SO <sub>4</sub> | 8:1:1    | 1.2                   |                          | 1                          |
| H <sub>2</sub> SO <sub>4</sub> | 3:1:1    | 5.9                   | 7                        | 7                          |

\*Acid (concentrated):H2O2(30%): H2O

range of profiles observed for the  $H_2SO_4$ - $H_2O_2$ - $H_2O$  system as the relative proportions of the components change.

# 6.2.10 Preferential Etches

Preferential etches are used to delineate various crystallographic defects such as dislocations, stacking faults, and twin boundaries; polished-over mechanical damage; and swirl patterns in silicon. Many of these defects are associated with slice manufacture and will have been evaluated before the slice reaches a wafer-fab facility. However, since various high-temperature wafer-fab operations have the capability of introducing large quantities of defects, evaluation during processing is very important. (The various defects to be expected are discussed in the chapters on diffusion, oxidation, and epitaxy.)

The six most common silicon etches are given in Table 6.3, along with comments on their general behavior. Preferential gallium arsenide etches seem not to have been as thoroughly studied as those of silicon, but Table 6.4 lists some that have been reported. The performance of the various etches depends in part on the exact

#### TABLE 6.3

Preferential Silicon Etches for Defect Delineation

| Composition                                                                                                                                                               | Comments                                                                                                                                                                                                                                                                                                                                                                                                                                                | Reference                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 1 HF, 3 HNO <sub>3</sub> , 10<br>CH <sub>3</sub> COOH                                                                                                                     | Delineates defects in (111) silicon; requires<br>long etch times; is concentration-<br>dependent.                                                                                                                                                                                                                                                                                                                                                       | (1)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| 1 HF, 1 (5M–CrO <sub>3</sub> )                                                                                                                                            | Delineates defects in (111); needs agitation;<br>does not reveal etch pits in (100) very<br>well.                                                                                                                                                                                                                                                                                                                                                       | (2)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| 2 HF, 1 (0.15M-K <sub>2</sub> Cr <sub>2</sub> O <sub>7</sub> )                                                                                                            | Delineates OSF in (100) silicon very well;<br>agitation reduces etch times.                                                                                                                                                                                                                                                                                                                                                                             | (3)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| 60 ml HF, 30 ml HNO <sub>3</sub> ,<br>30 ml (5M–CrO <sub>3</sub> ), 2 grams<br>Cu (NO <sub>3</sub> ) <sub>2</sub> , 60 ml CH <sub>3</sub> COOH,<br>60 ml H <sub>2</sub> O | Delineates defects in (100) and (111)<br>silicon; requires agitation.                                                                                                                                                                                                                                                                                                                                                                                   | (4)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| 2 HF, 1 (1M-CrO <sub>3</sub> )                                                                                                                                            | Delineates defects in (100) silicon without<br>agitation; works well on resistivities 0.6-<br>15.0 Ω-cm n- and p-types.                                                                                                                                                                                                                                                                                                                                 | (5)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| 2 HF, 1 (M-CrO <sub>3</sub> ), 1.5 H <sub>2</sub> O                                                                                                                       | Works well on heavily doped (100) silicon.                                                                                                                                                                                                                                                                                                                                                                                                              | (5)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| 1 HF, 1 (1.5M–CrO <sub>3</sub> )                                                                                                                                          | Delineates defects on (111), (100), and (110) silicon without agitation.                                                                                                                                                                                                                                                                                                                                                                                | (6)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
|                                                                                                                                                                           | <ol> <li>1 HF, 3 HNO<sub>3</sub>, 10<br/>CH<sub>3</sub>COOH</li> <li>1 HF, 1 (5M–CrO<sub>3</sub>)</li> <li>2 HF, 1 (0.15M–K<sub>2</sub>Cr<sub>2</sub>O<sub>7</sub>)</li> <li>60 ml HF, 30 ml HNO<sub>3</sub>,<br/>30 ml (5M–CrO<sub>3</sub>), 2 grams<br/>Cu (NO<sub>3</sub>)<sub>2</sub>, 60 ml CH<sub>3</sub>COOH,<br/>60 ml H<sub>2</sub>O</li> <li>2 HF, 1 (1M–CrO<sub>3</sub>)</li> <li>2 HF, 1 (M–CrO<sub>3</sub>), 1.5 H<sub>2</sub>O</li> </ol> | <ul> <li>1 HF, 3 HNO<sub>3</sub>, 10<br/>CH<sub>3</sub>COOH</li> <li>1 HF, 1 (5M-CrO<sub>3</sub>)</li> <li>2 HF, 1 (0.15M-K<sub>2</sub>Cr<sub>2</sub>O<sub>7</sub>)</li> <li>60 ml HF, 30 ml HNO<sub>3</sub>,<br/>30 ml (5M-CrO<sub>3</sub>), 2 grams<br/>Cu (NO<sub>3</sub>)<sub>2</sub>, 60 ml CH<sub>3</sub>COOH,<br/>60 ml H<sub>2</sub>O</li> <li>2 HF, 1 (1M-CrO<sub>3</sub>), 1.5 H<sub>2</sub>O<br/>1 HF, 1 (1.5M-CrO<sub>3</sub>)</li> <li>Delineates defects in (100) silicon very well:<br/>agitation reduces etch times.<br/>Delineates defects in (100) and (111)<br/>silicon; requires agitation.</li> <li>Delineates defects in (100) and (111)<br/>silicon; requires agitation.</li> <li>Delineates defects in (100) silicon without<br/>agitation; works well on resistivities 0.6-<br/>15.0 Ω-cm n- and p-types.</li> <li>Works well on heavily doped (100) silicon.<br/>Delineates defects on (111), (100), and</li> </ul> |

Note: Agitation is ultrasonic.

References: (1) W.C. Dash, J. Appl. Phys. 27, p. 1193, 1956. (2) E. Sirtl and A. Adler, Z. Metallk, 52, p. 529, 1961. (3) F. Secco d'Aragona, J. Electrochem. Soc. 119, p. 948, 1972. (4) Margaret Wright Jenkins, J. Electrochem. Soc. 124, p. 757, 1977. (5) D.G. Schimmel, J. Electrochem, Soc. 126, p. 479, 1979. (6) K.H. Yang, J. Electrochem. Soc. 131, p. 1140, 1984.

| TA | BL | E | 6.4 |
|----|----|---|-----|
|    |    |   |     |

| Preferential Gallium |  |
|----------------------|--|
| Arsenide Etches      |  |

| Name   | Composition                                               | Comments                        | Reference |
|--------|-----------------------------------------------------------|---------------------------------|-----------|
| Schell | 1 HNO <sub>3</sub> , 2 H <sub>2</sub> O                   | Shows pits on Ga [111] faces.   | (1)       |
| R–C    | 1 HF, 5 HNO <sub>3</sub> , 10 AgNO <sub>3</sub> solution* | Shows pits on both [111] faces. | (2)       |
| W-R    | 2 HCl. 1 HNO <sub>3</sub> , 2 H <sub>2</sub> O            | Shows pits on Ga {111} faces.   | (3)       |
| A-B    | 1 HF, 2 H,O, Cr + Ag <sup>+</sup>                         | Shows pits on As {111} faces.   | (4)       |
| KOH    | Molten KOH at 400°C                                       | Shows pits on (100) faces.      | (5)       |
| Sirtl  | 1 HF, 1 (5M-CrO.) in H.O                                  | Leaves mounds instead of pits.  | (6)       |

\*1% by weight silver nitrate in water.

†Add 33% by weight of CrO, and 0.3% by weight of AgNO, to the HF-H<sub>2</sub>O solution. By premixing CrO, with half of water, and AgNO, with HF and remainder of water, the two solutions can be stored indefinitely and mixed together just before using.

References: (1) H.A. Schell, Z. Metallk. 48, p. 158, 1957. (2) J.L. Richards and A.J. Crocker, J. Appl. Phys. 31, p. 611, 1960. (3) J.G. White and W.C. Roth, J. Appl. Phys. 30, p. 946, 1959. (4) M.S. Abrahams and C.J. Buiocchi, J. Appl. Phys. 36, p. 2855, 1965. (5) Morgan Semiconductor product literature. (6) See Table 6.3.

experimental procedures used, especially including the surface cleanup given before the defect delineation etch. Too short a time will give small and poorly developed features. Excessive times may broaden and smear the features, leave only a replica of the defect etched into the underlying material, or completely remove the layer of interest. Thus, it would be well to study details in the original papers (references in Tables 6.3 and 6.4) before seriously using the etchants. Reference 24 discusses experimental techniques and interpretation of results.

6.2.11 Silicon Dioxide Etching HF in various dilutions in water and often buffered with ammonium fluoride is the standard silicon dioxide wet etchant. Si and GaAs etch in HF at a minuscule rate and thus provide an etch stop after an overlying oxide layer is patterned. Hot alkaline bases such as sodium or potassium hydroxide will also etch at useful rates. They are, however, very harsh on organic masking materials and also etch Si and GaAs.

In dilute water solutions, HF will partially dissociate and give

$$HF \rightarrow F^- + HF_2$$
 6.9

In more concentrated solutions, additional complex ions, perhaps  $H_3F_4$  or  $H_4F_5$ , are also present. The overall reaction with SiO<sub>2</sub> is (45)

 $SiO_2 + 6HF \rightarrow H_2SiF_6 + 2H_2O$  6.10

However, the actual mechanism is quite involved, and the SiO<sub>2</sub> etch rate depends on the HF and HF $_2^-$  concentrations but not on F<sup>-</sup> (46). This type of behavior is not unique to the HF–SiO<sub>2</sub> system, however. It is, for example, reported that zinc is attacked by molecular HCl and by molecular acetic acid (47). Since F<sup>-</sup> ions alone will not etch oxide, a salt of fluorine alone dissolved in water cannot be used for etching. The pH must be adjusted to provide HF $_2^-$  ions or HF. Stirring or ultrasonic agitation affects the etch rate only in the region where diffusion dominates, which is reported to be for buffer ratios<sup>7</sup> greater than 10:1 (48).

Hydroxides etch oxide through reactions giving water-soluble silicates; for example,

$$2\text{KOH} + \text{SiO}_2 \rightarrow \text{K}_2\text{SiO}_3 + \text{H}_2\text{O} \qquad 6.11$$

Typical room-temperature etch rate curves for thermal oxide in both HF and buffered HF are shown in Fig. 6.13. In both cases, the rate increases with increasing HF concentration, and despite the fact that  $F^-$  ions do not by themselves etch SiO<sub>2</sub>, the buffered HF has a somewhat higher etch rate, which is apparently due to the increased concentration of HF<sub>2</sub> ions (46). Increasing temperature increases the etch rate, as shown in Fig. 6.14, with buffered solutions having a slightly higher activation energy.



FIGURE 6.13

Etch rate of 25°C of thermal SiO<sub>2</sub> in HF solutions. (Source: From data in Robert Herring and J.B. Price, Electrochem, Soc. Ext. Abst. 73-2, abst. no. 160, 1973; G.I. Parisi et al., J. Electrochem. Soc. 124, p. 917, 1977; and Chao Chen Mai and James C. Looney, SPC and Solid State Technology, p. 19, January 1966.)

<sup>&</sup>lt;sup>7</sup>Buffer ratio as applied to etching of oxides with HF generally means the volume ratio of a solution of 40% by weight NH<sub>4</sub>F in water to 49% HF

Etch rate of thermal oxide in HF solutions versus temperature. (Source: Data from G.I. Parisi et al., J. Electrochem. Soc. 124, p. 917, 1977, and Robert Herring and J.B. Price, Electrochem. Soc. Ext. Abst. 73–2, abst. no. 160, 1973.)



A pure CVD oxide as deposited etches faster than a thermal oxide because of structural defects. Subsequent annealing at high temperatures will generally reduce the etch rate differential. Doped oxide etch rates (whether the oxide is doped during CVD growth or by subsequent doping of a thermal oxide) depend on the impurity and the specific etchant. As the silicon content of an oxide increases, the etch rate in HF decreases until in the limit of SiO, nitric acid must be added to the HF. The addition of arsenic or phosphorus to the SiO, increases the etch rate in HF or buffered HF. Before the application of X-ray fluorescence analysis, the etch rate was used as a measure of the phosphorus content of phosphorus-doped oxides. Small boron additions decrease the rate, while larger ones increase it (49). By changing the etch composition to one containing HNO<sub>3</sub>, such as in P-etch,8 the lightly boron-doped oxide will etch much faster than the thermal oxide (50). Stress in the films also affects the etch rate; increasing stress increases the rate (51).

Fig. 6.15 gives data for etching with KOH. The caustics are used very little for patterning oxide, and when they are, it is with an inorganic mask material. More often, the oxide etch rate is of interest when the oxide is used as a mask during anisotropic etching of silicon.

<sup>\*</sup>Named after W.A. Pliskin (W.A. Pliskin and R.P. Gnall, J. Electrochem. Soc. 111, p. 872, 1964), its composition is 15 ml HF, 10 ml HNO<sub>3</sub>, 300 ml H<sub>2</sub>O.

Etch rate of thermal silicon dioxide by hot KOH. (Source: Data courtesy of Richard Yeakley, Texas Instruments, Incorporated.)



6.2.12 Silicon Nitride Etching

Silicon nitride can be wet etched with either HF solutions or with hot phosphoric acid. Phosphoric acid is the "standard" wet nitride etch. In it, the nitride can etch more than 40 times as fast as CVD oxide, which is often used as a mask. The selectivity decreases at high temperatures, but in order to have useful etch rates, hightemperature boiling concentrated H<sub>3</sub>PO<sub>4</sub> must be used. For example, 91.5% H<sub>3</sub>PO<sub>4</sub> boils at 180°C, etches high-temperature tube nitride<sup>9</sup> at approximately 100 A/minute, and etches CVD oxide at about 10 A/minute (52). Under these same conditions, single-crystal silicon etches about 30% as fast as CVD oxide. HF solutions at low temperature etch the nitride much slower than oxide so that undercut control is difficult to maintain if etching nitride over oxide. Typically, the etch ratio is from 1:10 to 1:100, depending on the etch composition and nitride deposition conditions. However, by etching with a low HF concentration HF-H2O solution at temperatures near the boiling point, comparable etch rates of less than 100 Å/minute have been reported (53, 54). By using an HF-ethylene glycol mixture, comparable rates for tube nitride, thermal oxide, and CVD oxide are observed over a considerably wider temperature and composition range (55). However, wet etch patterning of silicon nitride

<sup>&</sup>quot;"Tube" nitride is deposited at temperatures of ~800°C or above in a quartz tube reactor and has a much lower etch rate than "plasma" nitride deposited at temperatures in the 400°C range in a plasma reactor.

is now seldom done, having been largely superseded by plasma etching.

6.2.13 Metal Etching Since metal is usually deposited over an insulating layer such as silicon oxide or nitride, the etchant used must not noticeably attack oxide or nitride in the time required to remove the metal. On rare occasions, stripping a metal from bare silicon may be desired, but in these cases, special care and formulations can be used if necessary. When a single metal, such as the commonly used aluminum. is to be patterned, there is usually little difficulty. However, when multiple layers must be removed, considerable undercutting can occur. Much of this undercutting is caused by the enhanced etch rate of a metal overlayed by another appreciably lower in the electromotive series (56). For the case of gold over aluminum and gold over molybdenum, rate enhancements of from 10 to 100 have been observed. Table 6.5 shows the position of some of the metals in the electromotive series and illustrates the point that when gold is used as the top metal, trouble with all other metals is a strong possibility. However, by the proper choice of etchant, this effect can be minimized.

Since metal films are neither single crystal nor amorphous, the differently oriented grains may etch at different rates. In addition, the grain boundaries will generally etch more rapidly. The result is that wet-etched metal edges will often be more ragged than oxide edges. When the film has precipitates of some other phase in it, such as silicon in aluminum doped with silicon, the precipitates may not etch in the primary etch and thus will be left behind scattered on the surface.

Aluminum is the most commonly used IC metallization. Except for nitric acid, in which it is passivated, aluminum can be etched by strong acids or bases; for example,

 $2AI + 6NaOH \rightarrow 2Na_3AIO_3 + 3H_2$  6.12

 $2AI + 6HCI \rightarrow 2AICI_1 + 3H_2$ 

Were it not for a protective layer forming on the surface, it would also react directly with water to give hydrogen and aluminum hydroxide. Since the etchants are all water based, reactions such as those of Eq. 6.12 are not complete in that they do not provide for removal of the protective layer. A typical etch for use on silicon ICs is

> 20 acetic acid (parts by volume) 3 nitric acid 77 phosphoric acid

TABLE 6.5

| Metal | Volts |
|-------|-------|
| Al    | +1.7  |
| Cr    | +0.56 |
| Co    | +0.28 |
| Ni    | +0.23 |
| Ag    | -0.8  |
| Pd    | -0.8  |
| Pt    | -0.86 |
| Au    | -1.5  |

This solution is not applicable to aluminum over GaAs since it will etch GaAs, but a 1 HCl, 2  $H_2O$  mixture can be used instead. (For wet etchants suitable for other metals, see, for example, the extensive tables in reference 54.) As in oxide etching, the trend is away from wet etching and to plasma etching, particularly as more interconnect systems incorporate silicides, which are difficult to wet etch.

# 6.3 PLASMA ETCHING

Although there are many plasma processing modes, the two embodiments that dominate the semiconductor plasma etch field are the plasma etching mode and reactive ion etching (RIE). Fig. 6.16 shows the major components of a parallel-plate multiple-slice etching machine. Energy is supplied by an RF generator. The wafers lie flat on one of the electrodes, and a plasma is generated between the electrodes. Biasing of the electrodes by the DC voltage from the RF generator power supply is excluded by capacitive coupling the RF to the electrode. The chemical species in the plasma are determined by the source gas(es) used. The operating pressure depends on the plasma mode being used and may range from a few torr to fractions of a millitorr.

In the plasma etching mode, the electrode areas are symmetrical, and the DC voltage between the plasma and either electrode is the same and relatively small (57). The various ions and free radicals that are generated in the plasma diffuse to the electrode and wafer surfaces, where they can react with the material being etched to form volatile products that are pumped away.



Reactive ion etching uses a negative self-bias DC voltage developed between the plasma and the wafer electrode to accelerate ions from the plasma to the wafers. An asymmetric DC bias between the plasma and the two electrodes is developed when the two electrodes have different areas, with the larger voltage being developed on the smaller electrode (57). Thus, when the electrode on which the wafers rest is deliberately made much smaller than the other electrode, plasma-to-wafer electrode voltages of several hundred volts are readily achievable. In the RIE mode, some etching occurs by free radicals, but more is by ions that are accelerated to the surface. Part of the ion etching is by the ions' chemically reacting at the surface, and part is by physical removal of material when it is struck by the incoming ion (sputtering).

A third kind of etching occasionally used is reactive ion beam etching. In it, ions are produced in a plasma as before. However, they are then extracted from the plasma and accelerated toward the wafer by voltages applied to suitable electrodes. In this manner, the ion density and the accelerating voltage can be set independently, and neutral radicals are essentially eliminated.

Pure plasma mode etching is substantially isotropic without specifically introducing additional reactants to produce sidewall passivation. Without such passivation, this mode is not particularly useful for fine-line etching. Reactive ion and reactive ion beam etching can be quite anisotropic, but their selectivity may not be as good as that of the plasma mode, particularly with regard to photoresist. In addition, because of the high-energy ions, radiation damage to the wafers may occur.

The accelerating voltage used in the RIE process is possible because, while the plasma is conductive and can have a relatively uniform DC potential throughout its volume, the dark space that separates it from the electrodes is guite nonconductive. The voltage difference that develops between the ground electrode and the plasma is greater than the lowest ionization potential of the gas in the chamber (>10-15 V). The voltage between the plasma and the RF-powered electrode can be much greater when the area of the grounded electrode is greater. This electrode is DC isolated from the RF generator by a capacitor and thus cannot transfer charge to the external circuitry. Hence, the negative electron charge that accumulates during half of the RF cycle must be exactly balanced by the positive ion charge during the next half-cycle. Since the electrons are more mobile than the ions, a negative potential will build up until the quantities match. The result is that the electrode becomes negative with respect to both the plasma and the ground electrode. Depending on conditions, the voltage differential may be several hundred volts. The general character of the voltage distribution across the plasma and dark spaces is shown in Fig. 6.17. As the chamber pressure increases, the RF electrode DC voltage decreases, as shown in Fig. 6.18. An ungrounded plate in the plasma will acquire a negative potential (floating potential) with respect to the plasma for the same reason that the hot electrode does. However, the voltage difference between the plasma and the floating





electrode will generally not be nearly as much as between the plasma and the hot electrode (58).

When wafers are placed on a floating plate, the etching is by free radicals and reactive ions and will be rather isotropic. If the wafers are attached to the RF electrode, any positive ions present will be accelerated from the plasma to the wafers and will more nearly travel in a path normal to the wafer surface, thus approximating the conditions of Fig. 6.2b. The number of ions deviating substantially from normal will depend on the amount of scattering, which, in turn, depends on the mean free path length relative to the width of the sheath (dark space).

Using classical kinetic gas theory and assuming that the molecules and ions in the reactor are in thermal equilibrium at some temperature  $T_1$ , the mean free path L of a specific molecule  $M_1$  (or fragment) moving among a large number of  $M_2$  molecules is approximated by<sup>10</sup>

$$L \approx \frac{1}{\xi^2 \psi \pi \sqrt{2}} \tag{6.13}$$

where  $\xi$  is the average diameter of the specific molecule  $M_1$  and a molecule  $M_2$  of the main body of molecules and  $\psi$  is the number of molecules per cc.<sup>11</sup> The equilibrium number of molecules  $\psi$  per cc in a vessel with pressure p and temperature  $T_1$  (K) is given by

$$\psi = \left(\frac{p}{p_{\text{atm}}}\right) \left(\frac{273}{T_1}\right) \times 2.7 \times 10^{19} \text{ molecules/cc} \qquad 6.14$$

However, just because the mean free path is comparable to the dark space width does not mean that all of the ions leaving the plasma reach the wafer without first having a collision and path change. The fraction f of ions that have *not* had a collision and thus a change in direction after traveling a distance X is given by

$$f = e^{-X/L} 6.15$$

For a sheath of width L, only 1/e (37%) of the ions leaving the plasma will reach the wafer traveling in the same direction as when they left.

EXAMPLE

Estimate the value of L for a Cl ion in the sheath for a system operating at 0.4 Pa with chlorine gas as the predominant constitu-

<sup>&</sup>quot;See a standard text on the kinetic theory of gases for various corrections, such as for the effect of differing masses of  $M_i$  and  $M_2$  and for the case of only a few of  $M_i$  among huge numbers of  $M_2$  versus comparable quantities of each.

<sup>&</sup>lt;sup>11</sup>For reactive species, the cross section may be substantially larger than the physical cross section.

ent. Assume that the effect of a DC voltage across the sheath is only to initially direct the ions toward the wafer. Further, assume that the gas temperature in this region is 200°C.

Atmospheric pressure is about 10<sup>5</sup> Pa, and the diameter of a chlorine ion is about 2 Å (59). Assume that the diameter of a chlorine molecule is twice that, or 4 Å. Thus  $\xi$  equals 3 Å. From Eq. 6.14,  $N = 6.3 \times 10^{13}$  molecules/cc. Substituting these values into Eq. 6.13 gives L = 4 cm.

In addition to the possible anisotropic etching, acceleration of ions also leads to an increased etching rate. This arises both because of normal sputtering and because the increased energy enhances the ion reactivity at the surface. The relative effect of nonreactive argon ions (atomic weight 40) and reactive  $CF_3$  ions (atomic weight 69) is shown in Fig. 6.19.

In all etching except that of photoresist (stripping), a halogen species is used. Because of their inherent safety, various Freons<sup>12</sup> or other fluorocarbons, many of which are listed in Table 6.6, are often used as input gases, but SiCl<sub>4</sub>, BCl<sub>3</sub>, Cl<sub>2</sub>, HCl, SF<sub>6</sub>, and NF<sub>3</sub> are also



<sup>&</sup>quot;Registered trademark of E.1. du Pont de Nemours & Co., Inc.

#### 6.3.1 Etching Reactions

#### FIGURE 6.19

Relative etch rate of  $SiO_2$  by plasma-generated argon and CF<sub>3</sub> ions versus ion energy. (*Source:* From data of J.M.E. Harper et al., *J. Electrochem. Soc.* 128, p. 1077, 1981.)

| TA | BL | E | 6. | 6 |
|----|----|---|----|---|
|    |    |   |    |   |

Fluorocarbon Gases

| Formula                       | Common Name | Chemical Name                                |
|-------------------------------|-------------|----------------------------------------------|
| CCLF                          | Freon* 11   | Trichlorofluoromethane                       |
| CCI,F,                        | Freon 12    | Dichlorodifluoromethane                      |
| CCIF,                         | Freon 13    | Chlorotrifluoromethane                       |
| CBrF <sub>3</sub>             | Freon 13B1  | Bromotrifluoromethane                        |
| CF.                           | Freon 14    | Tetrafluoromethane                           |
| C.F.                          | Freon 116   | Hexafluoroethane (perfluoroethane)           |
| C <sub>4</sub> F <sub>8</sub> | Freon 118   | Octafluoropropane (perfluoropropane)         |
| C <sub>4</sub> F <sub>8</sub> | Freon C318† | Octofluorocyclobutane (perfluorocyclobutane) |
| C,CLF                         | Freon 114   | 1.2-Dichlorotetrafluoroethane                |
| C,CIF,                        | Freon 115   | Chloropentafluoroethane                      |
| CHCl <sub>y</sub> F           | Freon 21    | Dichlorofluoromethane                        |
| CHCIF <sub>2</sub>            | Freon 22    | Chlorodifluoromethane                        |
| CHF,                          | Freon 23    | Trifluormethane (Fluoroform)                 |

\*Freen is a registered trademark of E.1. du Pont de Nemours & Co., Inc. †Do not confuse with  $C_aF_8$ , octofluoro-2-butene (perfluoro-2-butene), which is considered moderately toxic.

*Note:* Freons are considered relatively safe, but since chlorinated fluorocarbons containing no hydrogen, when allowed to escape into the air, appear to cause damage to the upper atmosphere, there is a chemical industry drive to replace them with alternative gases.

widely used.  $H_2$ ,  $O_2$ , and inert gases such as Ar are occasionally added to the primary etchant gas. Silicon, silicon dioxide, silicon nitride, many silicides, and some metals can be etched by fluorine compounds, but aluminum must be etched with chlorine in order to have volatile reaction products. Photoresist is stripped by oxidizing it in an oxygen plasma. The specific reactions involved are not necessarily known, and indeed several may take place simultaneously. Further, just because both ions and free radicals may be present does not mean that they both participate in the etching reaction.

For satisfactory etching, several kinds of reactions may be desirable, and to that end there may be a multiplicity of input gases to the plasma. The plasma products may etch the material, but some of the by-products may deposit on the etching surface and either slow or stop the etching. In this case, oxygen, for example, may be included to react with the deposit. Some etchants may react at a reasonable rate with the main material to be etched but hardly at all with its surface oxide film. However, some other plasma gas may be very good at removing the oxide but may otherwise be a slow etcher. By combining the two, a good compromise may be obtained. Occasionally, when reactive ion etching of some component is difficult, physical sputtering may need to be emphasized, in which case, an inert gas that can be ionized may be included in the plasma. Table 6.7 lists many of the feed gases that have been proposed for plasma etching specific semiconductor materials. Silicon dioxide apparently etches by reactions such as (60)

$$3SiO_2 + 4CF_3^+ \rightarrow 2CO + 2CO_2 + 3SiF_4 \qquad 6.16$$
  
$$SiO_2 + 2CHF_2^+ \rightarrow 2CO + H_2 + SiF_4$$

Silicon etching in fluorine plasmas appears to be due to the fluorine free-radical interaction (61)

$$Si + 4F^* \rightarrow SiF_4$$
 6.17

The silicon nitride etching reaction also involves the fluorine radical in the reaction (62):

$$Si_3N_4 + 12F' \rightarrow 3SiF_4 + 2N_2$$
 6.18

Many metals such as Ti, W, Ta, and Mo form volatile fluorides and can be etched rapidly with fluorine-containing plasmas. Other metals such as Al do not form volatile fluorides (63). However, in this case, chlorine-based plasmas can be used. In the case of aluminum, AlCl<sub>3</sub> is formed and, if the system is kept warm, will volatilize and be pumped out. Otherwise, it collects on the wafers and on equipment walls. If the AlCl<sub>3</sub> is exposed to moisture, it will react to form HCl and Al<sub>2</sub>O<sub>3</sub>. The HCl can cause corrosion problems, and the Al<sub>2</sub>O<sub>3</sub> can cause particulate problems. Other chlorine compounds left on the aluminum surface, and even in the overlaying resist, will also cause corrosion if not removed. Thus, a final fluorine-based plasma is often used to replace the chlorine compounds remaining behind with inert fluorine compounds. In addition, the resist can then be stripped in situ in an oxygen plasma to eliminate any chlorine compounds that may not have been removed.

| TA |  |  |
|----|--|--|
|    |  |  |
|    |  |  |

Plasma Etchants

| Material/Etchant     | Comments                                                                                                       | Reference |
|----------------------|----------------------------------------------------------------------------------------------------------------|-----------|
| Silicon Dioxide      | The second s |           |
| $CF_4 + O_2 (10\%)$  | Addition of O <sub>2</sub> increases etch rate.                                                                | (1)       |
| CHF <sub>3</sub>     | Better selectivity over silicon.                                                                               | (4)       |
| $C_2F_6$             |                                                                                                                | (5)       |
| $C_2F_6$<br>$C_3F_8$ | Increased rate over CF <sub>4</sub> .                                                                          | (4)       |
| Silicon Nitride      |                                                                                                                |           |
| $CF_4 + O_2(4\%)$    |                                                                                                                | (6)       |
| CHF                  |                                                                                                                | (5)       |
| $C_2F_6$             |                                                                                                                | (5)       |
|                      |                                                                                                                | (continue |

TABLE 6.7 (continued)

| Material/Etchant                                                            | Comments                                                                                | Reference |
|-----------------------------------------------------------------------------|-----------------------------------------------------------------------------------------|-----------|
| $SF_6 + He$                                                                 |                                                                                         | (7)       |
| Polysilicon                                                                 |                                                                                         |           |
| $CCIF_1 + CI_2$                                                             | Anisotropy and oxide selectivity changes with gas ratio.                                | (8)       |
| $CHCL_3 + Cl_2$                                                             | Use mixture to get uniform initiation; change to Cl <sub>2</sub> for oxide selectivity. | (9)       |
| SF <sub>6</sub>                                                             | Good oxide selectivity.                                                                 | (10)      |
| NF <sub>3</sub>                                                             | Isotropic; high (1 µm/minute) etch rate.                                                | (11)      |
| CCl <sub>4</sub>                                                            | Somewhat anisotropic.                                                                   | (11)      |
| $CF_4 + H_2$                                                                | Poor oxide selectivity; hydrogen gives anisotropy.                                      | (12)      |
| C <sub>2</sub> ClF <sub>5</sub>                                             | Poor oxide selectivity.                                                                 | (13)      |
| Single-Crystal Si<br>Same etchants as<br>polysilicon<br>Single-Crystal GaAs |                                                                                         |           |
| CCl <sub>2</sub> F <sub>2</sub>                                             | Gives rough surface.                                                                    | (22)      |
| $CCl_2F_2$<br>$CCl_2F_2 + O_2$                                              | $O_2$ minimizes surface film formation.                                                 | (23)      |
| Cl <sub>2</sub> -SiCl <sub>4</sub>                                          | Rough surfaces above 10% Cl <sub>2</sub> .                                              | (22)      |
| Aluminum                                                                    |                                                                                         |           |
|                                                                             | Low etch rate; rapid break through Al <sub>2</sub> O <sub>3</sub> .                     | (27)      |
| BCl <sub>3</sub>                                                            | Anisotropic; erratic behavior in presence of moisture.                                  | (15)      |
| CCl₄<br>HBr + Ar )                                                          | Argon sputtering breaks through aluminum oxide.                                         | (16)      |
| HCl + Ar<br>$Cl_2 + Ar$                                                     | Argon sputtering oreaks through animinant oxide.                                        | (10)      |
| $Br_2 + Ar$                                                                 |                                                                                         | (14)      |
| $BCl_3 + Cl_2$                                                              | High etch rate of $Cl_2$ plus rapid etch initiation.                                    | (14)      |
| $BBr_3 + Cl_2$                                                              | As above plus better Al-to-resist selectivity.                                          | (14)      |
| SiCl <sub>4</sub>                                                           | Anisotropics aread atch enter                                                           | (17)      |
| $\mathrm{BCl}_3 + \mathrm{Cl}_2 + \mathrm{CHCl}_3 + \mathrm{N}_2$           | Anisotropic; good etch rate.                                                            | (21)      |
| Molybdenum<br>CF <sub>4</sub>                                               |                                                                                         | (2, 24)   |
| Tungsten                                                                    |                                                                                         |           |
| $Cl_2, Cl_2 + BCl_3$                                                        | Addition of small amounts of BCl, increases etch rate.                                  | (18)      |
| $CBrF_3 + O_2 + He$                                                         | ruanton of shart anothes of berg norouses even rate.                                    | (19)      |
|                                                                             |                                                                                         | 1         |
| Titanium                                                                    |                                                                                         | (2)       |
| $C_2Cl_2F_4$                                                                |                                                                                         | (25)      |
| $CF_4$ , $CCIF_3$<br>$CBrF_3 + He + O_2$                                    | Good selectivity over Si nitride.                                                       | (25)      |
|                                                                             | Good selectivity over 51 million.                                                       | (25)      |
| $Ti:W \\ Cl_2 + O_2$                                                        |                                                                                         | (24)      |
| Ti Silicide                                                                 |                                                                                         |           |
| $CF_4 + O_2$                                                                |                                                                                         |           |
|                                                                             |                                                                                         |           |
| Ta Silicide<br>$SF_6 + C_2 ClF_5$                                           | 50/50 flow rate gives straight walls and good oxide selectivity.                        | (3)       |

276

TABLE 6.7 (continued)

| Material/Etchant                | Comments                                                            | Reference |
|---------------------------------|---------------------------------------------------------------------|-----------|
| $SF_6 + CCl_2F_2$               |                                                                     | (26)      |
| W Silicide                      |                                                                     |           |
| $SF_6 + C_2 C F_5$              | 50/50 flow rate gives straight walls and good oxide<br>selectivity. | (3)       |
| $Cl_2, Cl_3 + BCl_1$            | Addition of small amounts of BCl, increases etch rate.              | (18)      |
| CCl <sub>2</sub> F <sub>2</sub> | service construction part and spectrum of an of the                 | (20)      |
| Gold                            |                                                                     |           |
| $C_2Cl_2F_4$                    | Low oxide etch rate.                                                | (2)       |
| $C_1CI_1F_4 + O_2$              | Addition of oxygen increases gold etch rate.                        | (2)       |

*Note:* This table should not be considered as complete, but only as a starting point in considering possible etchants. The literature on plasma etches is quite voluminous. The references given in each of these references should also be checked.

References: (1) Alan R. Reinberg, Etching for Pattern Definition, p. 91, Electrochemical Society, Princeton, N.J., 1976.

(2) W.H. Legat and H. Schilling, Electrochem. Soc. Ext. Abst. 75-2, abst. no. 130, p. 336, 1975.

(3) Stephen E. Clark et al., Solid State Technology, p. 235, April 1984.

(4) Mitch Hamamoto, Electrochem, Soc. Ext. Abst. 75-2, abst. no. 129, p. 335, 1975.

(5) Aaron D. Weiss, Semiconductor International, p. 56, February 1983.

(6) D.L. Flamm et al., Electrochem, Soc. Ext. Abst. 81-2, abst. no. 270, 1981.

(7) Richard L. Bersin, Solid State Technology, p. 117, April 1978; International Plasma Corporation Specification PTE-4A, 1975.

(8) E. Laes et al., Electrochem. Soc. Ext. Abst. 83-1, p. 235, 1983.

(9) F. Faili and F. Wong, in G.S. Mathad et al., eds., *Plasma Processing*, pp. 458-469, Proc. Electrochem. Soc. 87-6, 1987.

(10) K.M. Eisele, J. Electrochem. Soc. 128, p. 123, 1981.

(11) Douglas H. Bower, J. Electrochem. Soc. 129, p. 795, 1982.

(12) J.W. Coburn and J.F. Winters, J. Vac. Sci. Technol. 16, p. 391, 1979.

(13) J. Hayes and T. Pandhumsoporn, Solid State Technology, p. 71, November 1980,

(14) H.B. Bell and R.W. Light in G.S. Mathad et al., eds., Plasma Processing, Proc. Electrochem. Soc. 87-6, 1987.

(15) K. Tokunaga and D.W. Hess, J. Electrochem. Soc. 127, p. 928, 1980.

(16) P.M. Schaible et al., J. Vac. Sci. Technol. 15, p. 334, 1978.

(17) E.O. Degenkolb, J. Electrochem. Soc. 129, p. 1150, 1982.

(18) D.S. Fischl and D.W. Hess, J. Electrochem. Soc. 134, p. 2265, 1987.

(19) M.E. Burba et al., J. Electrochem. Soc. 133, p. 2113, 1986.

(20) L. Ephrath and R. Bennett, in VLSI Science and Technology, Electrochemical Society, Pennington, N.J., 1982.

(21) Ching-Hwa Chen, Lam Research Corporation literature.

(22) C.B. Cooper et al., in G.S. Mathad et al., eds., Plasma Processing, Proc. Electrochem. Soc. 87-6, 1987.

(23) A.P. Webb, Semiconductor International, p. 154. May 1985.

(24) William Y. Hatta, in G.S. Mathad et al., eds., Plasma Processing, Proc. Electrochem. Soc. 87-6, 1987.

(25) C.J. Mogab and T.A. Shankoff, J. Electrochem. Soc. 124, p. 1766, 1977.

(26) Jeff Herrmann et al., in G.S. Mathad et al., eds., Plasma Processing, pp. 494-506, Proc. Electrochem. Soc. 87-6, 1987.

(27) S.M. Cabral et al., "Characterization of a BCI, Parallel Plate System for Aluminum Etching," Proc. Kodak. Microelectronics Seminar, pp. 57-60, Dallas, 1981. 6.3.2 Plasma Etch Rates The machine variables available when a typical plasma/RIE etcher is used are operating pressure, wafer temperature, electrode voltage, inlet gas composition, gas flow rate, and loading (area of etchable surface divided by area of electrode). Additionally, the electrode spacing, diameters, and composition can generally be changed before a run. By design, the operating frequency may range from a few hundred kHz to many MHz. The etch rate is so intertwined with these many variables that it is very difficult to make a priori rate predictions. These circumstances are to be contrasted with those of wet etching, where the variables are simply etch composition, temperature, and amount of agitation. Despite the complexity of plasma etching, some general observations can be made.

The etch rate will generally depend on the flux of active species (free radicals and chemically reactive ions), although in some cases a high flux will generate enough by-product to completely stop the reaction. Since the active species are primarily produced by collision of electrons with the neutral gas molecules, any method to increase the number of collisions should increase the flux and thus the etch rate. The generation rate  $r_i$  for a species i is given by (64)

$$r_i = K_i n_e N_{ee}$$

$$6.19$$

where  $K_i$  is the rate constant for the *i*th species,  $n_i$  is the number of electrons, and N<sub>0</sub> is the number of neutral gas molecules (proportional to the operating pressure P). Thus, increasing the pressure or the number of electrons will, in principle, increase r, and the etch rate. The number of electrons can be increased by a power increase or by adding extra ones from, for example, an auxiliary hot wire filament (an equipment redesign). By applying a properly directed magnetic field to the plasma, the electrons can be made to follow a spiral path and thus travel further and produce more ions before reaching a wall (again an equipment redesign). As pressure initially increases, there is also an increase in the number of ionizing collisions of each electron before it finally leaves the plasma since the mean free path is decreased. As the pressure continues to increase. at some point the mean free path will become so short that the electrons will be unable to acquire enough energy between collisions to ionize the gas, and the etch rate will drop. For a given electrode spacing and power setting, as the pressure is increased from a very low value, there will be no plasma and no etching until the pressure reaches some critical value where the plasma is initiated. Etching will then begin and increase in rate rather rapidly. When substantial etching is by reactive ions, the rate will then decrease apparently because the ion energy is reduced due to a decreasing sheath potential (wafer electrode to plasma voltage) as the pressure increases (65).

The etch rate generally increases with increasing power, as is expected from Eq. 6.19. The rate of etching in the RIE mode will also increase because of increased voltage across the sheath, as was illustrated in Fig. 6.19. The sheath voltage drop increases as the gap decreases, with increasing RF voltage, and as the ratio of the area of the ground electrode to hot electrode increases.

Initially, the etch rate will increase as the gas flow rate is increased since the reactant will not be as readily depleted. The rate will then remain essentially independent of flow until it is high enough for the residency time to be less than the lifetime of the various active species, at which point it will again decrease. The residency time  $t_c$  can be calculated from the gas volume divided by the gas flow in the volume. Gas flow  $f_4$  is generally measured outside the plasma chamber under standard conditions of temperature and pressure. Inside a chamber with a reduced pressure  $p_c$ , the flow  $f_c$  will be given by  $f_c = 760 f_A/p_c$ , where  $p_c$  is in torr. Thus,

$$t_r = \frac{60Vp_c}{760f_A}$$
 6.20

where V is the volume,  $f_A$  is in cm<sup>3</sup>/minute, and t, is in seconds.

Adding a gas diluent to the main processing gas may do no more than slow down the etch rate by reducing the concentration of reactant. However, if etching is by the RIE mode, the additional sputtering by the diluent may increase the removal rate either directly or by the removal of inert reaction by-products. An example of this is the removal of the polymers that sometimes form when halocarbons containing hydrogen are used as the plasma etching gas. The additional gas may also react with the species formed in the plasma. change their relative concentrations, and thus change the etch rate (66). For example, in a CF, plasma, both F and CF, are formed, but the CF<sub>2</sub> then recombines with F to form CF<sub>4</sub>. Added oxygen will also combine with  $CF_2$  to give  $CO_2 + F_2$ . Thus, when oxygen is present, some of the CF, will combine with it instead of with F. thereby raising the F concentration and increasing the rate of any etching depending on the F concentration. Hydrogen added will combine with F to form HF and with  $CF_3$  to form  $CF_3 + HF$  so that F is suppressed and the concentration of CF<sub>2</sub> increased.

Even though plasma processes are not in thermal equilibrium and the energy of the etching species is far higher than the thermal energy of the wafer, increasing the temperature of the wafers often slightly increases the etch rate r(T), with the dependency still generally being of the form

$$r(T) = r_0 e^{-E/RT} \tag{6.21}$$

where  $r_a$  is a constant, E is the activation energy in eV, k is the Boltzmann constant (given in Appendix C), and T is the absolute temperature.

### 6.3.3 Selectivity

The selectivity ratio is used in this context as the ratio of the etch rate of the layer being deliberately removed by etching to that of a layer exposed to the etch but not to be intentionally removed. Unlike wet etching, where selectivity is generally very good, plasma etch processes must generally be content with ratios of 25 or less. This ratio is important in determining how far into the next layer etching proceeds and how much the windows in masking material are enlarged during pattern etching.

Based on the expected film etch rate and film thickness uniformity, an estimate can be made of the selectivity required in order not to exceed the allowable penetration depth  $D_s$  into the next layer. Several approaches and approximations can be made, but a simple procedure is to assume that over a wafer (single-slice etching) or over all wafers of a batch process, the etch rate can be approximated by  $E_f = E_u \pm e$  where  $E_o$  is the nominal etch rate, and the film thickness can be approximated by  $D_f = D_o \pm d$  where  $D_o$  is the nominal thickness. The nominal etch time  $t_o$  is then  $D_o/E_o$ . However, any thin region in the area of high etch rate will be removed in a shorter time

$$v_1 = \frac{D_v - d}{E_v + e} \tag{6.22}$$

Thick-film regions in the slow-etching areas will not be removed until a later time

$$t_2 = \frac{D_o + d}{E_o - e}$$
 6.23

Thus, some portions of the underlying film will have been exposed to etch for a time  $\Delta t' = t_2 - t_1$  by the time all of the layer is removed. In some circumstances, the etch time may be extended an amount  $\Delta t''$  past  $\Delta t'$  in order to provide additional processing margin. The total over-etch time  $\Delta t$  is then given by  $\Delta t' + \Delta t''$ . To meet the requirement that no more than  $D_s$  of the second film be removed, its etch rate  $E_s$  must be less than  $D_s/\Delta t$ , and the necessary selectivity can be determined from these expressions.

Mask material removal during pattern etching can cause the window openings to increase in size. However, if the geometries are large and the enlargement is reproducible, it is usually possible to compensate by down-sizing the original mask openings. With geometries having feature sizes not much different from the resist thick-

#### FIGURE 6.20

Effect of mask removal during, etching on window widening.



ness, this approach is not feasible, and anisotropic etching and/or very high resist etching selectivity is required. Even with anisotropic etching, substantial window enlargement can still occur if the resist walls are sloped and selectivity is low. These effects are shown in Fig. 6.20. For isotropic removal of resist that has a sidewall angle  $\theta$ with the horizontal, the opening will increase in width by  $2\Delta W =$  $2\Delta R/\sin \theta$ .  $\Delta R$  is the thickness of resist removed from the top and is given by  $E_r t$  where  $E_r$  is the removal rate of resist. In the limit of  $\theta \rightarrow 90^\circ$ ,  $2\Delta W = 2\Delta R$ . If removal is anisotropic,  $2\Delta W = 2\Delta R \cot \theta$ . When the walls are perpendicular,  $\cot \theta = 0$ , and there is no undercutting. For the maximum  $\Delta W$  allowable in the process,  $\Delta R$  can be calculated for the particular etching regime used, and then the minimum resist/film etching selectivity can be determined from  $(D_u/\Delta R)(1 + d/D_u)(1 + e/E_u)$ .

The straightforward method of improving selectivity is by the choice of gases that do not readily react with one of the layers. For example, the selectivity of SiO<sub>2</sub> relative to aluminum is better when a fluorine-based etchant is used instead of one that is chlorine based since aluminum fluoride is quite nonvolatile and tends to form a protective layer over the aluminum. The mixing of gases can sometimes provide selective mechanisms. For example, the addition of either molecular hydrogen or a hydrogen-bearing compound to fluorine-based etch gases is often used to improve SiO<sub>2</sub> to silicon selectivity. This improved selectivity occurs because hydrogen suppresses the

| TAB | LE | 6.8 |
|-----|----|-----|
|     |    |     |

| Plasma | Etching | Selectivities |  |
|--------|---------|---------------|--|
|--------|---------|---------------|--|

| Material           | Etch                   | Selectivity with Respect to:                           |
|--------------------|------------------------|--------------------------------------------------------|
| Thermal SiO,       | $C_2F_6 + CHF_3$       | Si-15:1, resist-5:1                                    |
| Doped CVD SiO,     | $C_{1}F_{6} + CHF_{1}$ | Si-30:1, resist-10:1                                   |
| Polysilicon        | Cl                     | SiO,-25:1, resist-5:1                                  |
| Aluminum           | $BCI_1 + CI_2$         | SiO <sub>2</sub> -25:1, resist-5:1, polysilicon-3:1    |
| Si <sub>3</sub> N, | $CF_4 + O_3$           | CVD SiO <sub>2</sub> -1:1, resist-3:1, polysilicon-1:8 |
| Resist             | 0,                     | SiO,->1000:1, Si->1000:1                               |

fluorine atoms, and since Si is etched primarily by fluorine and SiO<sub>2</sub> by the  $CF_3^+$  ion, a reduction in the fluorine atom improves selectivity (67). Excessive hydrogen may, however, cause problems such as unwanted polymer buildup or structural damage. Table 6.8 lists selectivities reported for various etchant/material combinations. The actual values can vary substantially depending on the etch mode being used and on the power input and exact gas composition, but the table can serve as a guide.

Because of the poor selectivity associated with plasma etching, endpoint detection is usually used to minimize over-etch times. In principle, several approaches are possible. The thickness of the film being etched can be measured optically by interferometer, ellipsometer (68, 69), or light beam displacement (70). If the film is opaque, the difference in reflectivity between the film being etched and the one beneath it can be used to detect the disappearance of the film being etched (69). The presence of a reaction product or the absence of a reacting species can also be monitored and used to determine when the film etching is complete. Detection in these cases can be either by optical (71–75) or by mass (76) spectroscopy.

Thickness measurements require an appropriately sized area to be located in a fixed position in the etch reactor, and mass spectroscopy calls for a rapid sampling of the exit gas stream. Optical spectroscopy can be done independently of the position of the wafer(s) and does not disturb the gas flow. For these reasons, optical spectroscopy is currently used almost exclusively. Emission (rather than absorption) spectroscopy is used to follow either the increased reaction product concentration or the depression of a reactant concentration. For example, since silicon dioxide etching and photoresist removal both produce substantial amounts of carbon monoxide, one of the CO emission bands is used. During silicon etching with a fluorine containing gas, a depression in the concentration of fluorine

6.3.4 End-Point Detection occurs, and its level is commonly used for end-point detection. There are many emission lines in each spectrum, and the choice of the one to use for control depends on their relative strength, whether or not there are interfering lines from other plasma constituents, and the ease of detecting a particular wavelength. As an example of interference, etching aluminum produces AlCl, which is widely used for control. When the etching gas is one containing no molecular chlorine, such as  $CCI_4$  or  $SiCI_4$ , an intense band at 261 nm can be used for control. If  $CI_2$  is added to the input gases (as it sometimes is), a chlorine line at 257 nm produces substantial interference, but another AlCl line such as the one at 522 nm can then be used (74).

It is often observed that as more wafers are put into a plasma reactor or as masking is changed to allow more area on a wafer to be etched, the etch rate declines. Thus, etch rates generally must be experimentally determined for a specific number of wafers and/or photoresist coverage. The effect can also be very insidious in causing the rate to increase when an etching surface is clearing and reducing etching area. The loading effect occurs when there is a substantial depletion of the active species from the plasma during etching. By considering the active species' generation rate, the loss rate due to etching, recombination losses at positions other than on the etching surface, and losses due to the species being swept away by gas flow, an etch rate r versus a normalized etching area  $\alpha$  can be calculated. a is given by the ratio of the area of film exposed for etching to the total area available on the reactor platten. For only one active etching species and with various simplifying assumptions, an expression of the form

$$r = \frac{K_1}{1 + K_2 \alpha} \tag{6.24}$$

results (77).  $K_1$  and  $K_2$  are constants for the particular system configuration, operating conditions, and etchant used. For multipleslice reactors, Eq. 6.24 can be written as

$$r = \frac{K_1}{1 + K_2'N}$$
 6.25

where N is the number of slices being etched and  $K_2'$  includes a factor to normalize N to the number of slices at full load. Eq. 6.25 can be rewritten as

$$\frac{1}{r} = K_{o} + KN \tag{6.26}$$

### 6.3.5 Effect of Loading on Etch Rate

from which it can be seen that 1/r versus N will give a straight line if Eq. 6.25 is obeyed. If there are two or more active species, they will almost certainly be depleted at different rates, and a linear relation between 1/r and N will no longer exist (78). Further, under these circumstances, the etching character as well as the rate could change with increasing numbers of slices. For r to be relatively insensitive to N or pattern geometry,  $K_{2}\alpha$  or  $K'_{2}N$  must be much less than 1. Since  $K_{2}$  and  $K'_{2}$  are functions of the etchant gas and reactor geometry, for a given gas and reactor design, little can be done to minimize a loading effect other than by reducing  $\alpha$ . In single-slice reactors, this option is not available, and in multiple-slice reactors, severely reducing the number of slices generally makes the operation uneconomical.

The electrodes as well as the wafers are subject to sputtering, and when this happens, electrode material is introduced into the plasma. If Si or SiO<sub>2</sub> is used as an electrode (or electrode covering) in fluorine plasmas, the products are volatile and like those from pattern etching polysilicon and SiO<sub>2</sub>. However, if an aluminum electrode is used, it may be sputtered and redeposited. It is also possible for the plasma products to react with the electrode material and give deleterious compounds. It is, for example, thought that stainless steel electrodes promote polymerization of CCl<sub>4</sub> and reduce its effectiveness in etching aluminum (79).

Three kinds of damage have been observed: structural, foreign ion implantation, and rough surfaces. In addition, undesirable residues are sometimes left on the surface. The latter can, in principle, be eliminated by the proper choice of gases, but these same residues may contribute to an anisotropy that is desirable or to a selectivity that is necessary. Rough surfaces are primarily a problem when etching only partially through a material—for example, when etching trenches in silicon or gate depressions in GaAs. However, rough walls can also be a problem after etching, for example, vias completely through a wafer. Roughness can generally be eliminated or reduced to a tolerable level by the choice of etchant or, in some cases, by a change of electrode materials (66).

When RIE is used, the ion impact energy seems to be enough to always produce structural damage. In the case of silicon, either a furnace anneal, a rapid thermal anneal, or a removable thin sacrificial oxide removes or prevents most damage. Much of the concern has arisen over structural damage induced in the contact area and

See references 80-86.

### 6.3.6 Effect of Electrode Material

6.3.7 Plasma-Induced Damage\* the thin residue of fluorocarbon left over it during the etching of contact windows in thermal oxide over silicon. Structural damage from hydrogen either deliberately introduced with  $CF_4$  to promote selectivity or from hydrogen in the etchant gas, such as in  $CHF_3$ , is particularly severe. Amorphous layers are sometimes observed, and deep-level electron traps are found. A shift in the barrier height of Schottky diodes made on such etched surfaces occurs, and there is a reduction in MOS lifetime. A common way to remove both the surface film and the underlying damage is to first subject the etched area to an oxygen plasma to oxidize the film and a thin layer of silicon. The oxidized residue is then stripped in dilute HF.

The normal operating frequency for plasma etching equipment is 13,56 MHz (a frequency assigned for industrial use), although lower (100–300 kHz) and higher (microwave, such as 2.45 GHz) frequencies are used in some equipment. The gases used are primarily those listed in Table 6.7, Valves and flow meters required are the standard kind used in the semiconductor industry. The vacuum required varies with the mode of operation and is generally in one of the ranges shown in Fig. 6.21.

The plasma generation chamber and the associated wafer chamber designs vary widely from application to application and between manufacturers. Schematics of some of these designs are shown in Fig. 6.22. The barrel (tubular) reactor shown in Fig. 6.22a was the earliest geometry and was primarily used for stripping resist. To minimize wafer damage, the plasma was confined to the perimeter by a perforated metal shield, and the reactive species diffused to the wafers. The etch rate was much higher near the edge than it was in the center, and the machine was thus not suitable for pattern etching. To minimize the nonuniformity problem, the parallel-plate reactor (Fig. 6.22b) was introduced. Many new designs use singlewafer chambers (Fig. 6.22c). In this case, one wafer will cover the whole bottom electrode, and the gases must escape around the outer edge of one of the electrodes rather than through the center of the wafer platen. The geometry in Fig. 6.22d, shown with the chamber (ground electrode) raised, is an alternative to the parallel plate. This arrangement allows the ground electrode to have a much greater



## 6.3.8 Plasma Etching Equipment

#### FIGURE 6.21

Typical pressure ranges for ion milling, reactive ion etching, reactive ion beam etching, and free-radical plasma etching.

#### FIGURE 6.22

Various configurations of a reaction chamber.



area than the power electrode and thus increase the sheath voltage available for ion acceleration.

For cases where it is desirable to ensure that there are no highenergy ions, the plasma can be generated in a microwave cavity, generally smaller in cross section than a wafer, and then the active species is allowed to diffuse to the slice. This principle is not unlike that of the original barrel reactors, but by using a geometry as shown in Fig. 6.22e, a more uniform etching results.

If, instead, it is desirable to accent the high-energy ion aspect, the geometry of Fig. 6.22f can be used (87). The ions are produced in a plasma as before, but there is now an accelerating grid so that unlike RIE, the energy of the ions can be adjusted independently of the power used to produce them. If the input gas is chosen to give reactive ions, the process is reactive ion beam etching (RIBE). However, if an inert gas such as argon is used, then any substrate (wafer) etching will be strictly by physical erosion, and the process is ion milling.

At elevated temperatures, some gases will react directly with various semiconductor materials and, for some specialized applications, offer an alternative to either wet or plasma etching. The most common application of vapor phase etching is in the in situ cleanup of silicon surfaces before epitaxy. (This aspect is discussed in Chapter 7.) Some gases that will etch silicon directly at high temperature are  $Cl_2$ ,  $Br_2$ , HCl,  $SiCl_4$ , and  $SF_6$ . Anhydrous HF + water vapor will react with  $SiO_2$  even at room temperature, allowing its removal at rates of tens of Å/s (88).

CHAPTER TY 6

SAFETY

Wet etching involves the use of numerous acids or bases that can produce severe burns when they come into direct contact with the skin. In addition, fumes can produce skin irritation and/or eye damage. The etching process should be done in a well-vented hood. When acids or other corrosive liquids are handled, an apron, rubber gloves, and safety glasses should be worn. The gloves should not. however, be considered as protection that allows the hands to be immersed in the liquid since small difficult-to-detect pinholes can allow enough leakage to produce severe burns. The work area should be well lighted and free of obstructions that might cause etchant spills. Care must be taken in the mixing, handling, and disposal of etching solutions. In diluting acids or bases, always add the chemical to the water in order to prevent possible excessive heat buildup and explosion. One of the more insidious acids is HF, which is a rather weak acid but which will nonetheless penetrate the skin and produce deep and very painful burns that may not be apparent for several hours. Other acids produce an immediate burning sensation. In addition, concentrated HNO, produces a brown discoloration of the skin, while concentrated H2SO4 gives a blackening. If an accident involving acid occurs, immediately flush the affected area with copious amounts of water. When the burns appear severe or if HF is involved, medical attention is advisable.

Acids and organic solvents should be stored in separate cabinets and disposed of in separate containers; otherwise, they may react to cause a fire or explosion. Cyanide plating solutions or etchants are reasonably safe when they are kept alkaline, but if mixed with

## 6.4

VAPOR PHASE ETCHING an acid, the highly poisonous hydrogen cyanide gas may be liberated.

The plastic hoods and exhaust ducts commonly used in wet etching operations are quite flammable. Should they catch fire from an immersion heater, for example, the smoke and fumes can be a health hazard (89).

Plasma reactors use high-voltage RF and have high DC voltages associated with the RF generator. Hence, servicing should be done only by qualified personnel, and no electrical safety interlocks should be deactivated. Some of the plasma etchants are Freon based and thus guite nontoxic. However, metal etchants often use chlorine or some other toxic gas, albeit in small quantities. Even when using nontoxic gases, some of the reaction products are toxic and are often either condensed on the chamber walls or trapped and concentrated in the plasma reactor vacuum pump system. Thus, care should be taken in cleaning the reactor, servicing pumps, and disposing of spent oil (90). Some gases, such as NF<sub>3</sub>, become reactive at elevated temperatures and should be distributed through lines that have been carefully cleaned of organics and are free of gages and valves with combustible packing glands. The quest for new etching gases continues, but there is no assurance that they will be benign or will have been studied enough for their toxicity to have been properly evaluated when they are introduced.

KEY IDEAS 6

- Except when special etches are used for single-crystal material (such as Si or GaAs), wet etching is isotropic.
- The fact that etching is isotropic causes mask undercutting during etching of individual layers. Since it is difficult to match etch rates for different materials, extra undercutting of some layers will also occur when multilayer structures are being etched.
- By the proper choice of plasma etching mode (such as RIE) and etchant, plasma etching can be done with minimal undercutting.

- Wet etching is less expensive, but plasma etching is more appropriate for fine geometries.
- Because of the wide choice of wet etches, they can usually be more material-selective than plasma etches.
- Plasma anisotropic etching occurs because of an etch-resistant polymer buildup on vertical surfaces.
- Vapor phase etching is a high-temperature process and is seldom used except just prior to epitaxial deposition.

# PROBLEMS 6

- How much 49% HF should be mixed with 500 ml of water to give a 1 normal solution? (The specific gravity of the 49% HF is approximately 1.16.)
- After silicon slices are etched with CP4A, the slices usually have a pincushion cross section. Based on this evidence, would you judge that the etch region is diffusion or kinetically controlled? Why?
- If etching is to go to completion, how wide should the mask opening be if a groove 10 μm deep is to be anisotropically etched in a (100) silicon wafer? How should the mask opening be oriented with respect to the wafer flat? (See Chapter 2 for flat orientation according to SEMI standards.)
- 4. How could the Ga face of a (111) oriented GaAs slice be determined?
- Etch solutions are often filtered during the etch operation. When SiO<sub>1</sub> is etched with buffered HF<sub>7</sub> what materials might be removed by the filter?
- How many molecules of O<sub>2</sub> are in a roomtemperature 10 liter chamber at a pressure of 1

# REFERENCES 6

- "Chemical/ Metallurgical Properties of Silicon," in Integrated Silicon Device Technology, Vol. X, Research Triangle Institute Tech. Rpt. ASD-TDR-63-316, November 1965.
- H.K. Kuiken et al., "Etching Profiles at Resist Edges," J. Electrochem. Soc. 133, pp. 1217– 1226, 1986. (See also following article in journal.)
- R.G. Brandes and R.H. Dudley, "Wall Profiles. Produced during Photoresist Masked Isotropic Etching," J. Electrochem. Soc. 120, pp. 140– 142, 1973.
- 4. R. Sellamuthu et al., "A Study of Anisotropic

millitorr? How many molecules of CCl<sub>2</sub>F<sub>3</sub> would there be under the same conditions?

- Calculate the approximate residency of a gaseous species in a 1 liter chamber at a pressure of 4 millitorr if the flow of gas into the chamber is 100 cm<sup>3</sup>/minute when measured at atmoshperic pressure.
- 8. If 1  $\mu$ m of CVD SiO<sub>2</sub> over polysilicon is to be etched and no more than 100 Å of silicon are to be removed, what etch selectively will be required? Assume that the oxide film thickness varies by  $\pm 10\%$  and that the etching uniformity is also  $\pm 10\%$ . In addition, consider that an additional over-etch of 10% is required because of the variability in end-point detection.
- For a given plasma etch reactor configuration, sketch how the etch rate versus exposed area to be etched will probably vary. Indicate the region over which the concentration of an etching species could be used for end-point detection. Explain your answer.
- If etch damage is of particular concern, which configuration(s) shown in Fig. 6.22 would be most appropriate? Why?

Trench Etching of Si with NF<sub>1</sub>-Halocarbon," J. Vac. Sci. Technol. B5, pp. 342-346 and references therein, 1987.

- Chuck Murray, "Wet Etching Update," Semiconductor International, pp. 80–85, May 1986.
- D.R. Oswald, "Automatic Chemical Processing for Silicon Wafers," J. Electrochem. Soc. 123, pp. 531–534, 1976.
- H.K. Kuiken and R.P. Tijburg, "Centrifugal Etching: A Promising New Tool to Achieve Deep Etching Results," J. Electrochem. Soc. 130, pp. 1722–1729, 1983.
- 8. D.R. Turner, "On the Mechanism of Chemically

Etching Germanium and Silicon," J. Electrochem. Soc. 107, pp. 810-816, 1960.

- H. Robbins and B. Schwartz, "Chemical Etching of Silicon II. The System HF, HNO<sub>3</sub>, H<sub>2</sub>, and HC<sub>2</sub>H<sub>3</sub>O<sub>2</sub>," J. Electrochem. Soc. 107, pp. 108– 111, 1960. (For more information on the HF– HNO<sub>3</sub> system, see the other three papers in this series: Harry Robbins and Bertram Schwartz, "Chemical Etching of Silicon I. The System HF, HNO<sub>3</sub>, and H<sub>2</sub>O," J. Electrochem Soc. 106, pp. 505–508, 1959; B. Schwartz and H. Robbins, "Chemical Etching of Silicon III. A Temperature Study in the Acid System," J. Electrochem. Soc, 108, pp. 365–372, 1961; B. Schwartz and H. Robbins, "Chemical Etching of Silicon IV. Etching Technology," J. Electrochem. Soc. 123, pp. 1903–1909, 1976.)
- Margaret Wright Jenkins, "A New Preferential Etch for Defects in Silicon Crystals," J. Electrochem. Soc. 124, pp. 757–762, 1977.
- John N. Shive, Chap. 16, "Intermediate Surface Treatment," in *Transistor Technology*, Bell Telephone Laboratories, Inc., 1952.
- S.M. Hu and D.R. Kerr, "Observation of Etching of n-Type Silicon in Aqueous HF Solutions," J. Electrochem. Soc. 114, p. 414, 1967.
- A. Reisman et al., "The Controlled Etching of Silicon in Catalyzed Ethylenediamine-Pyrocatechol-Water Solutions," J. Electrochem. Soc. 126, pp. 1406–1415, 1979.
- C. Raetezl et al., "Etching Properties of Different Silicon Films," *Electrochem, Soc. Ext. Abst.* 74–2, abst. no. 142, October 1974.
- K.E. Bean and W.R. Runyan, "Dielectric Isolation: Comprehensive, Current, and Future," J. Electrochem. Soc. 124, pp. 5C–12C, January 1977.
- G.C. Messenger, IEEE Trans. on Nucl. Sci. NS-13, p. 141, 1966.
- J. Peter Ballantyne et al., "Small Chip Blocks Large Voltages," *Bell Laboratories Record*, pp. 91–94, April 1982.
- Kurt E. Petersen, "Silicon as a Mechanical Material," Proc. IEEE 70, pp. 420–457, 1982.
- H.A. Waggener et al., "Anisotropic Etching for Forming Isolation Slots in Silicon Beam Leaded Integrated Circuits," Int. Electron Dev. Conference, Washington, D.C., October 18–20, 1967.
- 20. Kenneth E. Bean, "Anisotropic Etching of Sili-

con," IEEE Trans. on Electron Dev. ED-25, pp. 1185-1193, 1978.

- M.M. Abu-Zeid, "Corner Undercutting in Anisotropically Etched Isolation Contours," J. Electrochem. Soc. 131, pp. 2138–2142, 1984.
- A.I. Stoller. "The Etching of Deep Vertical-Walled Patterns in Silicon," *RCA Review 31*, pp. 271–275, 1970.
- Don, L. Kendall, "Vertical Etching of Silicon at Very High Aspect Ratios," Ann. Rev. Mater. Sci. 9, pp. 373–403, 1979.
- W.R. Runyan, Semiconductor Measurements and Instrumentation, McGraw-Hill Book Co., New York, 1975.
- H. Muraoka et al., "Controlled Preferential Etching Technology," pp. 327–338, in Howard R, Huff and Ronald R. Burgess, eds., Semiconductor Silicon/73, Electrochemical Society, Princeton, N.J., 1973.
- C. John Rhee and Jack Saltich. "Integral Heat-Sink IMPATT Diodes Fabricated Using p' Etch Stop," Proc. IEEE 61, pp. 385–386, 1973.
- A. Bohg, "Ethylene Diamine-Pyrocatechol-Water Shows Etching Anomaly in Boron-Doped Silicon," J. Electrochem. Soc. 118, pp. 401–402, 1971.
- J.C. Greenwood, Ethylene Diamine-Catechol-Water Mixture Shows Preferential Etching of p-n Junction," J. Electrochem. Soc. 116, pp. 1325-1326, 1969.
- H.J.A. van Dijk and J. de Jonge, "Preparation of Thin Silicon Crystals by Electrochemical Thinning of Epitaxially Grown Structures," J. Electrochem. Soc. 117, pp. 553–554, 1970.
- Ronald L, Meek, "Electrochemically Thinned N/ N<sup>+</sup> Epitaxial Silicon—Method and Applications," J. Electrochem. Soc. 118, pp. 1240–1246, 1971.
- A. Uhlir, Jr., "Electrolytic Shaping of Germanium and Silicon," *Bell Syst. Tech. J.* 35, pp. 333–347, 1955.
- R. Memming and G. Schwandt, "Anodic Dissolution of Silicon in Hydrofluoric Acid Solutions," Surface Science 4, pp. 109–124, 1966.
- C.P. Wen and K.P. Weller, "Preferential Electrochemical Etching of P<sup>+</sup> Silicon in an Aqueous HF-H<sub>2</sub>SO<sub>4</sub> Electrolyte," *J. Electrochem. Soc. 119*, pp. 547–548, 1972.
- 34. M.J.J. Theunissen, "Etch Channel Formation

during Anodic Dissolution of N-Type Silicon in Aqueous Hydrofluoric Acid," J. Electrochem. Soc. 119, pp. 351-360, 1972.

- Y. Wantanabe et al., "Formation and Properties of Porous Silicon and Its Application," J. Electrochem. Soc. 122, pp. 1351–1355, 1975.
- Takasahi Unagami, "Formation Mechanism of Porous Silicon Layer by Anodization in HF Solution," J. Electrochem. Soc. 127, pp. 476–483, 1980.
- H.A. Waggener and J.V. Dalton, "Control of Silicon Etch Rates in Hot Alkaline Solutions by Externally Applied Potentials," *Electochem. Soc. Ext. Abst.*, abst. no. 273, Fall 1972.
- R.J. Archer, "Stain Films on Silicon," J. Phys. Chem. Solids 14, pp. 104–110, 1960.
- K.H. Beckmann, "Investigation of the Chemical Properties of Stain Films on Silicon by Means of Infrared Spectroscopy," *Surface Science 3*, pp. 314–332, 1965.
- R.E. Hines, "A Simple Cure for a Form of Window Staining Which Occurs during the Etching of Silicon Integrated Circuits," *Microelectronics* and *Reliability* 11, p. 537, 1972.
- H. Kuiken et al., "Etching Profiles at Resist Edges," J. Electrochem. Soc. 133, pp. 1217– 1226, 1986.
- E. Kohn, "A Correlation between Etch Characteristics of GaAs Etch Solutions Containing H<sub>2</sub>O<sub>2</sub> and Surface Film Characteristics," J. Electrochem. Soc. 127, pp. 505–508, 1980.
- Sadao Adachi and Kunishige Oe, "Chemical Etching Characteristics of (001) GaAs," J. Electrochem, Soc. 130, pp. 2427–2435, 1983.
- Don W, Shaw, "Localized GaAs Etching with Acidic Hydrogen Peroxide Solutions," J. Electrochem. Soc. 128, pp. 874–880, 1981.
- J. Kleinberg, W. Argersinger, and E. Griswold. Inorganic Chemistry, Heath and Co., Boston, 1960.
- John S. Judge, "A Study of the Dissolution of SiO, in Acidic Fluoride Solutions," J. Electrochem. Soc. 118, pp. 1772–1775, 1971.
- Robert S. Alwitt and Robert S. Kapner, "A Study of the Dissolution of Zinc in Buffered Acetic Acid Solutions," J. Electrochem. Soc. 112, pp, 204–207, 1965.
- John Lawrence, "Controlled Etching of Silicon Dioxide in Buffered Hydrofluoric Acid," Elec-

trochem. Soc. Ext. Abst. 72-2, abst. no. 191, pp. 466-468, 1972.

- A.S. Tenney and M. Ghezzo, "Etch Rates of Doped Oxides in Solutions of Buffered HF," J. Electrochem. Soc. 120, pp. 1091–1095, 1973.
- Lillian Rankel Plauger, "Etching Studies of Diffusion Source Boron Glass," J. Electrochem. Soc. 120, pp. 1428–1430, 1973.
- Lou Hall, "Etch Rate Characterization of Silane Silicon Dioxide Films," J. Electrochem. Soc. 118, pp. 1506–1507, 1971.
- W. van Gelder and V.E. Hauser, "The Etching of Silicon Nitride in Phosphoric Acid with Silicon Dioxide as a Mask," J. Electrochem. Soc. 114, pp. 869–872, 1967.
- Victor Harrap, "Equal Etch Rates of Si,N<sub>4</sub> and SiO<sub>2</sub> Utilizing HF Dilution and Temperature Dependence," pp. 354-362, in Howard R. Huff and Ronald R. Burgess, eds., Semiconductor Silicon/73, Electrochemical Society, Princeton, N.J., 1973.
- Werner Kern and Cheryl A. Deckert, Chap. V-I, "Chemical Etching," in John L. Vossen and Werner Kern, eds., *Thin Film Processes*, Academic Press, New York, 1978.
- Cheryl A. Deckert, "Pattern Etching of CVD Si<sub>3</sub>N<sub>4</sub>/SiO<sub>2</sub> Composites in HF/Glycerol Mixtures," J. Electrochem. Soc. 127, pp. 2433– 2438, 1980.
- J.J. Kelly and C.H. de Minjer, "An Electrochemical Study of Undercutting during Etching of Duplex Metals," J. Electrochem. Soc. 122, pp. 931–936, 1975.
- H.R. Koenig and L.I. Maissel, "Application of RF Discharges to Sputtering," *IBM J. Res. Develop.* 14, pp. 168–171, 1970.
- J.L. Vossen, "Glow Discharge Phenomena in Plasma Etching and Plasma Deposition," J. Electrochem. Soc. 126, pp. 319–324, 1979.
- Linus Pauling, The Nature of the Chemical Bond, Cornell University Press, Ithaca, New York, 1960.
- Ch. Steinbruchel et al., "Mechanism of Dry Etching of Silicon Dioxide," J. Electrochem. Soc. 132, pp. 180–186, 1985.
- H. Kawata et al., "The Dependence of Silicon Etching on an Applied DC Potential in CF<sub>4</sub> + O<sub>2</sub> Plasmas," J. Electrochem. Soc. 132, pp. 206– 211, 1985.

- H. Abe et al., "Etching Characteristics of Silicon and Its Compounds," *Jap. J. Appl. Phys. 12*, p. 154, 1973.
- Alan R. Reinberg, "Plasma Etching in Semiconductor Manufacturing—A Review," pp. 91-110, in Henry G. Hughes and Myron J. Rand, eds., *Etching for Pattern Definition*, Proc. Electrochem. Soc. 87-6, Princeton, N.J., 1976.
- 64. G.S. Mathad, "Design Considerations for a High Pressure, High Etch Rate Single Slice Reactor," pp. 134–142, in Proc. 6th Symposium on Plasma Processing, 87–6, Electrochemical Society, Pennington, N.J., 1987.
- Russ A. Morgan, *Plasma Etching in Semicon*ductor Fabrication, Elsevier Science Publishers B, V., New York, 1985.
- Daniel L. Flamm et al., "Basic Chemistry and Mechanisms of Plasma Etching," J. Vac. Sci. Technol. B1, pp. 23-30, 1983.
- L.M. Ephrath, "Selective Etching of Silicon Dioxide Using Reactive Ion Etching with CF<sub>4</sub>-H<sub>2</sub>," J. Electrochem. Soc. 126, pp. 1419–1421, 1979.
- K.L. Konnerth and F.H. Dill, "In-Situ Measurement of Dielectric Thickness during Etching or Developing Process," *IEEE Trans. on Electron Dev. ED-22*, pp. 452–456, 1975.
- Dave Johnson, "Optical Methods Detect End Point in Plasma Etching," *Industrial Research* and Development, pp. 181–185, October 1980.
- H.P. Kleinknecht and H. Meier, "Optical Monitoring of the Etching of SiO<sub>2</sub> and Si<sub>3</sub>N<sub>4</sub> on Si by the Use of Grating Test Patterns," J. Electrochem. Soc. 125, pp. 798-803, 1978.
- E.O. Degenkolb and J.E. Griffiths, "Simple Optical Devices for Detection of RF Oxygen Plasma Stripping of Photoresists," *Appl. Spectroscopy 31*, pp. 40–42, 1977.
- Bill B. Stafford and Georges J. Gorin, "Optical Emission End-Point Detecting for Monitoring Oxygen Plasma Photoresist Stripping," Solid State Technology, pp. 51-55, September 1977.
- 73. R.G. Poulsen and G.M. Smith, "Use of Optical Emission Spectra for End-Point Detection in Plasma Etching," pp. 1058-1070, in H.R. Huff and E. Sirtl, eds., Semiconductor Silicon/77, Electrochemical Society, Princeton, N.J., 1977.
- Kwang O. Park and Fredrick C. Rock, "End Point Detection for Reactive Ion Etching of Alu-

minum," J. Electrochem, Soc. 131, pp. 214-215, 1984.

- S. Mittal et al., "Endpoint Detection for Contacts and Vias in VLSI Technology," pp. 254–266, in *Proc. 6th Symposium on Plasma Processing* 87– 6, Electrochemical Society, Princeton, N.J., 1987.
- George B. Bunyard, "Plasma Process Development and Monitoring via Mass Spectrometry," *Solid State Technology*, pp. 53-57, December 1977.
- C.J. Mogab, "The Loading Effect in Plasma Etching," J. Electrochem. Soc. 124, pp. 1262– 1268, 1977.
- Daniel L. Flamm et al., "Multiple-Etchant Loading Effect and Silicon Etching in CIF, and Related Mixtures," J. Electrochem. Soc. 129, pp. 2755-2760, 1982.
- K. Tokunaga et al., "Comparison of Aluminum Etch Rates in Carbon Tetrachloride and Boron Trichloride Plasmas," J. Electrochem. Soc. 128, pp. 851–855, 1981.
- R.G. Frieser et al., "Silicon Damage Caused by Hydrogen Containing Plasmas," J. Electrochem. Soc. 130, pp. 2237-2241, 1983.
- J. Dieleman and F.H.M. Sanders, "Plasma Effluent Etching: Selective and Non-Damaging," *Solid State Technology*, pp. 191–196 and references therein, April 1984.
- S.J. Fonash, "Damage Effects in Dry Etching," Solid State Technology, pp. 201-204 and references therein, April 1985.
- G.S. Oehrlein et al., "Near-Surface Damage and Contamination after CF<sub>4</sub>/H<sub>2</sub> Reactive Ion Etching of Si," J. Electrochem. Soc. 132, pp. 1441– 1447, 1985.
- A. Rohatgi et al., "Characterization and Control of Silicon Surface Modification Produced by CCl<sub>4</sub> Reactive Ion Etching," J. Electrochem. Soc. 133, pp. 408-416, 1986.
- G.S. Oehrlein et al., "Investigation of Reactive-Ion-Etching-Related Fluorocarbon Film Deposition onto Silicon and a New Method for Surface Residue Removal," J. Electrochem. Soc. 133, pp. 1002–1008, 1986.
- D. Misra and E.L. Heasell, "A Study of Reactive Ion Etching (CF<sub>4</sub> + O<sub>2</sub> Plasma) Induced Deep Levels in Silicon," J. Electrochem. Soc. 134, pp. 956–958, 1987.

REFERENCES 293

- D.F. Downey et al., "Introduction to Reactive Ion Beam Etching," *Solid State Technology*, pp. 121–127, February 1981.
- Rinn Cleavelin and Gary T. Duranko, "Silicon Dioxide Removal in Anhydrous HF Gas," Semiconductor International, pp. 94–99, November 1987.
- Peter H. Singer, "Wet Bench Fire Suppression," Semiconductor International, pp. 154–157, September 1987.

.

90. Jean Ohlson, "Dry Etch Chemical Safety," Solid State Technology, pp. 69–73, July 1986.



# Epitaxy

# 7.1 INTRODUCTION

The epitaxial growth process involves the deposition of a thin layer of material onto the surface of a single-crystal wafer (substrate) in such a manner that the layer is also single crystal and has a fixed and predetermined crystallographic orientation with respect to the substrate. If the layer material is the same as the substrate, such as silicon on silicon, then it will have the same crystallographic orientation as the original crystal and become a crystallographic extension of the wafer. If deposition is from the vapor phase, this combination is sometimes referred to as *homoepitaxy*, *autoepitaxy*, or *isoepitaxy* but is generally simply called *epitaxy*. If the materials are different, as with silicon on sapphire, the combination is referred to as *heteroepitaxy*, and the orientation of the layer may well be different from that of the substrate.

If the layer is grown from the melt rather than by chemical vapor deposition (CVD), the process is referred to as *liquid phase epitaxy* (LPE). Silicon is seldom grown by LPE; gallium arsenide layers, however, often are. It is also possible to grow epitaxially from the solid phase. For example, the very thin amorphous layer left on top of a single-crystal slice after polishing or the thin amorphous layer formed on a wafer surface because of ion implant damage can be regrown as single crystal by a suitable high-temperature annealing cycle.

The term *epitaxy*, referring to oriented crystallographic overgrowth on a foreign substrate, was apparently coined in the 1920s by the French mineralogist Royer as he observed the overgrowth of water-soluble salts onto the cleaved surfaces of naturally occurring mineral specimens (1). Such studies were of considerable scientific interest and were expanded to include materials evaporated onto mineral surfaces (rock salt was a favorite substrate). However, no industrial applications of these studies took place until silicon and germanium began to be overgrown onto sapphire and spinel substrates in the mid-1960s. During the same time interval (1930–1960),

#### 7.1 INTRODUCTION 295

techniques for the vapor phase growth of crystals were becoming well developed, and, indeed, a patent filed in 1951 applied the vapor phase growth of silicon and germanium to semiconductor devices (2). Later, to fulfill a specific need of the fledgling semiconductor industry, when thin layers of single-crystal germanium were vapor grown onto germanium single-crystal slices, the process was labeled as "epitaxy" even though the original definition was considerably different. Thus, when experiments to overgrow silicon and germanium onto single-crystal substrates such as sapphire and spinel began, the new label "heteroepitaxy" was introduced to distinguish that process from the by then established use of epitaxy to describe overgrowth on a substrate of the same material.

The original problem to which an epitaxial layer provided the solution was the reduction of transistor collector resistance. The source of this collector resistance was the layer of high-resistivity semiconductor material between the collector-base junction and the collector contact. The grown junction transistor, shown schematically in Fig. 7.1a, had a very long length of high-resistivity material, and its performance at high current was severely degraded. The mesa transistor, also shown schematically in Fig. 7.1a, had a substantially reduced current path length since the mesa chip thickness was much less than half the length of the grown junction bar. However, for ease of manufacture, the distance from the collector-base junction to the collector contact was still much longer than that required by the device itself.<sup>1</sup> The structure shown in Fig. 7.1b reduces the collector resistance even more by having a large portion of the wafer of low resistivity.



Use of low-resistivity material to reduce collector series resistance.



'The required distance is something just in excess of the width of the collectorbase space charge region when the maximum collector-base voltage is applied.

#### 296 CHAPTER 7 EPITAXY

Vapor phase epitaxial growth is the only method currently available for producing layers of high resistivity on low-resistivity substrates such as just described. When doping is done by either solidstate diffusion or ion implantation, the impurities are introduced from the surface so that, of necessity, a higher concentration occurs near the surface than exists deep within the wafer. During growth from the melt, once a dopant is put into the melt, it cannot be easily removed, so the impurity concentration during growth can be abruptly increased but not decreased.<sup>2</sup> However, in vapor phase growth, the dopant, which is deposited from the vapor simultaneously with the semiconductor, can be quickly changed, with respect both to species and to concentration. The time for change is primarily dictated by the time required to change the gas species/concentration in the reactor chamber and can be from a few seconds to a few minutes, depending on reactor design. Thus, in principle, uncompensated sequential layers of widely differing resistivity can be produced, although they are generally not required for IC fabrication.

The mesa transistor structure of Fig. 7.1b can be made by starting with a low-resistivity substrate (slice) and adding a thin higherresistivity layer of the same conductivity type by epitaxial growth (3). The silicon bipolar IC required a different structure since the goal was to provide physical support and electrical (pn junction) isolation between the many components of the IC. The structure used for some time was a high-resistivity n-layer epitaxially overgrown onto a high-resistivity p-substrate. Component isolation was accomplished by local p-diffusions, which reached from the surface of the n-layer to the p-substrate. This configuration, as shown in Fig. 7.2a,

#### FIGURE 7.2

Use of a local diffusion before epitaxy to reduce the highresistivity collector current path length and hence collector series resistance in an IC transistor.



<sup>2</sup>The resistivity can be reduced in either diffused structures or melt-grown crystals by adding a compensating impurity. However, since resistivity depends on the difference between the concentration of n- and p-impurities, its value is difficult to control. Further, the addition of compensating impurities in large quantities can substantially reduce carrier mobility.

#### 7.2 VAPOR PHASE EPITAXY 297

also had high collector resistance since the path from the collectorbase junction to the collector contact on the top of the wafer was long and composed entirely of high-resistivity material. By adding a localized high-concentration n-diffusion before the epitaxy, the structure of Fig. 7.2b can be made, and with it, a substantial portion of the electrical path is now through the low-resistivity diffused layer (4).

The transistors in MOS ICs are self-isolating and therefore can be made in single-resistivity material. However, higher performance and/or smaller chip size can often be achieved by using epitaxy, and beginning in the mid-1980s, high-resistivity layers on low-resistivity substrates of the same type were used in production quantities. Gallium arsenide ICs requiring epitaxy generally use an n-layer on a semi-insulating substrate.

As in the chemical vapor deposition (CVD) described in Chapter 4, no solid material should be formed in the gas above the wafer surface; that is, as before, the solid forming reaction must be surface catalyzed. In addition, successful epitaxy requires that each atom permanently added to the surface be in the exact position required to form a defect-free single-crystal layer. A study of vapor phase epitaxy can be roughly divided into the three categories of surface nucleation, surface reaction kinetics, and gas transport of reactants and reaction products to and from the wafer surface.

Classical homogeneous nucleation theory supposes that molecules will collide and at least temporarily bind together and that at any given time a wide distribution of nuclei ranging up in size from one molecule exists. If the nucleus (cluster of molecules) is above some critical size, it will be stable and continue to grow; otherwise, there is a high probability that it will lose molecules and disappear. The change  $\Delta G$  in the Gibbs free energy of the molecules due to the formation of a spherical cluster (perhaps appropriate for a raindrop forming in air) is given by (5)

$$\Delta G = 4\pi r^2 \sigma + \frac{4}{3}\pi r^3 \Delta G_v \qquad 7.1$$

where r is the cluster radius,  $\sigma$  is the interfacial energy per unit area between the solid and the vapor, and  $\Delta G_v$  is the Gibbs free energy difference per unit volume between the vapor and the cluster.  $\Delta G_v$ is given by

$$\Delta G_{v} = -\left(\frac{kT}{\Omega}\right) \ln\left(\frac{P}{P_{c}}\right)$$
 7.2

7.2 VAPOR PHASE EPITAXY

7.2.1 Nucleation

#### 298 CHAPTER 7 EPITAXY

#### FIGURE 7.3

Plot of  $\Delta G$  versus cluster radius r. (It first increases, and then, if the critical radius r\* is reached, rapidly decreases and becomes negative.)



where  $\Omega$  is the volume per molecule in the crystal, *T* is the absolute temperature, *k* is Boltzmann's constant, *P* is the vapor pressure above the growing cluster, and  $P_c$  is the equilibrium vapor pressure. The plot of Eq. 7.1 in Fig. 7.3 shows that initially  $\Delta G$  increases with *r* but reaches a maximum and then rapidly decreases. As soon as  $\Delta G$  begins to decrease, at  $r = r^{*}$ , the cluster becomes thermodynamically stable.  $r^{*}$  can be calculated from Eq. 7.1 and is

$$r^* = -\frac{2\sigma}{\Delta G_v}$$
 7.3

Thus, if the absolute value of  $\Delta G_v$ , as determined by the vapor pressure of the nutrient, is too small, the critical radius will be so large that little or no nucleation will occur.

The spherical homogeneous nucleation case just discussed is not directly applicable to crystal growth, but it does demonstrate the concept of a critical cluster size necessary for growth and that of the necessity of a higher-than-equilibrium vapor pressure. Somewhat closer to crystal growth is a flat-disk cluster of diameter r and height h laying on a flat surface, in which case (5)

$$\Delta G = 2\pi r \varepsilon + \pi r^2 (\sigma_1 + \sigma_2 - \sigma_3) + \pi r^2 h \Delta G_v \qquad 7.4$$

where  $\varepsilon$  is the edge free energy per unit length and where  $\sigma_1, \sigma_2$ , and  $\sigma_3$  are respectively the interfacial energy per unit area between the cluster and the substrate, the cluster and the gas ambient, and the substrate and the gas ambient.3 Eq. 7.4 has the same general character as Eq. 7.1, and thus the earlier conclusions are still applicable. Note that in this case the cluster can collect molecules not only directly from the gas stream but also, through surface diffusion, from the body of molecules that strike the surface around the cluster. If, instead of a circular cluster forming on a broad flat surface, a semicircular cluster forms next to a surface step, it can be shown that the sum of all of the edge and interfacial energies is less than it is for the circular case. Nucleation at a ledge is then favored over nucleation on a flat surface, and a surface with ledges is expected to grow much more rapidly than a smooth one. In many cases, it appears that the observed growth rates of crystals can be explained only by assuming a continuous supply of ledges. This aspect is dis-

<sup>&</sup>lt;sup>3</sup>The edge free energy of a monolayer high step is usually not equal to  $hs_2$ , where h in this case is the thickness of the monolayer. Hence, in Eq. 7.4, for small h,  $\varepsilon$  instead of  $hs_2$  should be used. The  $\sigma_3$  term must be subtracted since it represents a free energy change due to covering up a portion of the original surface by the disk.

#### 7.2 VAPOR PHASE EPITAXY 299

cussed theoretically in the 1951 classic paper by Burton, Cabrera, and Frank (6).

Screw dislocations in a single-crystal wafer will produce a continuous supply of ledges that will not grow out, regardless of the thickness grown. In addition, from a practical standpoint, with the thin layers normally grown for semiconductor applications, the fact that the surface is almost never exactly on-orientation ensures a supply of atomic steps (ledges) that will not completely disappear during layer growth. This feature can be seen from Fig. 7.4, which shows a series of atomic planes of separation *s* intersecting the surface at an angle  $\theta$ . For this case, the distance *S* along the surface between steps is given by

$$S = \frac{s}{\sin \theta}$$
 7.5

The atomic spacing s is normally considered to be the separation of two adjacent planes of atoms—that is, the spacing of (400) planes when a (100) surface is exposed.<sup>4</sup> However, since evidence exists that in the case of a silicon (100) surface that has been heated above 1000°C, the equilibrium step height is two layers or the spacing be-

FIGURE 7.4

Simultaneous lateral growth from many ledges resulting in a net vertical growth.





T=0

<sup>4</sup>This presumes either a diamond or zinc blende structure. For a discussion of the placement of atoms in these kinds of crystals (typical of silicon and gallium arsenide, respectively), see Appendix A.

#### 300 CHAPTER 7 EPITAXY

tween (200) planes (7), it is possible that multiple-layer step heights also occur on other surfaces.

If growth proceeds by nucleation at the steps, then the steps will each move out laterally as shown in Fig. 7.4a, and the surface of Fig. 7.4b will be transformed into that of Fig. 7.4c at some later time and will have moved upward a distance t. When the thickness t is grown, the length L of the atomically flat surface caused by the steps growing out is given by  $L = t/\sin \Theta$ . If the surface is misoriented by 1° and a 25 µm layer is grown, a flat of about 1.4 mm would develop on one side of a wafer.

When individual atomic bonding is considered, the picture of crystal growth is somewhat changed from that just described. With the values of vapor pressure normally encountered in vapor phase growth, the number of atoms in a critical cluster may be as low as 1. For such small numbers, a classical thermodynamics approach using properties that appear in Eq. 7.4 is really no longer appropriate, and a statistical mechanics approach is necessary (8). However, the concept of a critical cluster size is retained. In the diamond lattice, enough atomic bonds are available at a ledge on any surface, regardless of its orientation, for a single atom that diffuses to it to be properly oriented and stably bonded. Where no ledges exist and nucleation must take place on a crystal plane, fewer atomic bonds per surface atom can reach out of the plane to attach to an atom on the surface.5 However, properly oriented single-atom attachment is still sometimes possible. For example, there are two bonds per atom reaching from one (400) plane to the next, and with isoepitaxy, these two bonds are enough to provide orientation so that growth on terraces can, in principle, proceed by a single atom. In the case of (111) orientation, however, only one bond per atom reaches from one (111) double layer to the next. With only one bond, a single atom on a (111) plane surface will neither be tightly held nor uniquely oriented. A cluster of three atoms can provide orientation, but it may be either the correct one or a twinned one (9). Experimentally, it has been observed that the growth rate on a closely oriented (111) plane is considerably less than when orientation is a few degrees away from the (111) (10, 11). Further, it is much more difficult to grow high-quality epitaxy on surfaces closely oriented to (111). Both of these observations are consistent with a greater difficulty in nucleating on (111) terraces (flats between steps) than at (111) steps. However, in the case of growth on (100) planes, only a slight decrease in

<sup>&</sup>lt;sup>5</sup>It should be remembered that the upper bonds in a surface do not dangle (remain unbonded). They will either distort to connect with other lattice atoms without satisfied bonds or else bond to foreign atoms adsorbed on the surface.

#### 7.2 VAPOR PHASE EPITAXY 301

rate is observed as the (100) plane is approached, and the quality of epitaxial layers grown on-orientation is found to be quite good. This suggests that nucleation and proper orientation are possible on an atom-by-atom basis on (100) planes.

In summary, the possible actions of a single atom as it is adsorbed on a wafer surface, as shown in Fig. 7.5, are as follows:

- The atom may strike the surface and then be desorbed before it can diffuse to a ledge or participate in cluster formation.
- If it has very little energy or if the deposition rate is very high, the atom may be surrounded by other atoms and effectively locked in place without regard to crystallographic location, in which case a polycrystalline or amorphous layer will grow.
- 3. When the energy is high enough for appreciable motion, but the ledges are widely separated, several atoms may join together on a terrace to form a stable and sometimes properly oriented cluster that can then act as a new ledge for future growth.
- The atom may diffuse to a ledge, bond properly, and grow epitaxially.
- 5. On some substrate orientations, the single atom may become epitaxially bonded on a terrace.

Clusters have been observed in silicon-on-silicon molecular beam epitaxy (discussed in a later section), but then such growth is promoted when there are impurities on the surface (12).

FIGURE 7.5

When growth is on a single-crystal substrate of the same lattice structure and only a slightly differing lattice constant, there is ap-

Possible actions of ad-atoms on a crystal surface.



#### 302 CHAPTER 7 EPITAXY

parently little difference in nucleation behavior from that just discussed. The sites will all be of the proper orientation, although the substrate and layer atomic spacings may not exactly fit. The list of semiconductors having the same crystal structure is actually quite small. Silicon and germanium both have the diamond lattice, while the III-V and some of the II-VI compounds have a zinc blende lattice. Table 7.1 gives lattice spacings of some common semiconductor materials. Note that there are different values for high- and low-resistivity Si and different values depending on whether the heavy doping is n- or p-type. This effect also occurs in the other materials but has not been as extensively studied. There is no set amount of lattice mismatch that can be tolerated. It depends both on the elastic properties of the individual materials and on the density of defects that is acceptable to the user. As an experimental guide, the quality of high-resistivity on low-resistivity Si appears quite satisfactory even though some misfit dislocations occur. Considerably more mismatch occurs between Si and GaAs, and many more misfit dislocations, but GaAs of usable quality can be overgrown on Si. When the mismatch is small, the layer accommodates the misfit by having strain in the first few layers. At some point, the mismatch becomes large enough so that the necessary strain exceeds the yield point of the material, and misfit dislocations occur.6 When misfit dislocations occur, there are usually not enough of them to reduce the strain to zero. Apparently, there are only enough to reduce the strain to a level less than that of the yield point. For example, in the case of Ge overgrown on GaAs, with no residual strain the dislocation spacing would be expected to be every  $0.5 \,\mu m$ . whereas a spacing of 4 µm was experimentally observed (13). When the misfit becomes yet larger, as regions nucleated at different ledges grow together, there can be gross crystallographic defects.

#### TABLE 7.1

Lattice Spacings for Cubic Semiconductors

| Material               | Unit Cell Length (Å) | Group | Structure   |
|------------------------|----------------------|-------|-------------|
| 0.001 Ω-cm B-doped Si  | 5.4270               | IV    | Diamond     |
| High-purity Si         | 5.4309               | 1V    | Diamond     |
| 0.001 Ω-cm As-doped Si | 5.4315               | IV    | Diamond     |
| High-purity Ge         | 5,6576               | IV    | Diamond     |
| High-purity GaAs       | 5.6538               | 111-V | Zinc blende |
| GaP                    | 5.4504               | III-V | Zinc blende |
| BSiC                   | 4.3596               | IV-IV | Zinc blende |
| βZnS                   | 5.4039               | II-VI | Zinc blende |

'See Appendix A for a discussion of misfit dislocations.

#### 7.2 VAPOR PHASE EPITAXY 303

Heteroepitaxial nucleation of a III-V compound semiconductor such as GaAs on Si or Ge is complicated by the fact that in III-V zinc blende crystals, the two interpenetrating sublattices are of different composition. In the case of GaAs, one is of gallium and one is of arsenic. Thus, for growth in a (100) direction (which is an orientation preferred by device designers), there are alternate layers of arsenic and gallium. Under most circumstances, the first layer to nucleate on silicon is arsenic, so over the whole wafer surface, even though stepped, the first layer will be arsenic. As more layers are added and the epitaxy thickness increases, unless each ledge is a multiple of two atomic spacings high, an As layer and a Ga layer will meet over the steps and cause an antiphase boundary defect. One solution to the problem is to use (211) oriented wafers. In this orientation, regardless of the step height, the bonding is such that the arsenic and gallium each have preferred sites, and an antiphase boundary cannot develop (14). Another solution is to use (100) oriented wafers cut a few degrees off-orientation with the tilt toward a (110) plane. Then, if the wafers are annealed a few minutes at 1000°C, regardless of the original step heights, the surface reconstructs itself so that all steps are two atomic spacings high (7). With this sort of surface, the arsenic and gallium layers will match up over the whole wafer and thus produce no antiphase boundaries.

In heteroepitaxy involving materials of different crystal structure, such as silicon on sapphire or other single-crystal oxide, nucleation is much more complex. It is generally assumed that the silicon (or other semiconductor) atoms occupy the position of the metal-for example, aluminum in Al<sub>2</sub>O<sub>3</sub> or Al and Mg in spinel-and bond to the oxygen. However, the spacings are almost always very different, and the fit will be very poor. Fig. 7.6 shows that a cluster of five atoms of silicon in a (100) configuration on a (1102) sapphire surface fits moderately well, although from this point on, there is little overlap. It can be theorized that a small stable cluster of the atoms being deposited forms homogeneously on the substrate surface and then has enough surface mobility to become oriented (15, 16). The cluster thus formed will act as a sink for atoms striking the surface nearby and keep the supersaturation so low that no new clusters will form. At some distance away, however, other clusters can form and grow. As shown in Fig. 7.7, at an early stage of growth, a series of single-crystal islands will be oriented mostly in the same direction, which is similar to growth observed on amorphous substrates, except that then the different islands are not oriented, as can be seen in Fig. 7.8. The octahedral equilibrium shape of silicon crystallites is also visible in this figure. The islands of Fig. 7.7 will grow until they coalesce and form a complete layer, at which point nucleation becomes homoepitaxial rather than heteroepitaxial. Un-

#### 304 CHAPTER 7 EPITAXY

#### FIGURE 7.6

Overlay of a (100) layer of silicon atoms onto the aluminum atoms of a sapphire (1 T 02) surface. (The A and B oxygen positions indicate locations respectively below and above the aluminum atoms. Beyond the cluster of five silicon atoms in the center of the figure, correlation with the aluminum atom positions becomes progressively worse.) (*Source:* Adapted from Arnold Miller and Harold M. Manasevit, *J. Vac. Sci. Technol. 3*, p. 68, 1966.)



fortunately, clusters oriented at different places on the substrate, while all having the same azimuthal orientation, will not be properly spaced one from the other. Thus, various crystallographic defects will occur at the boundaries between the original islands. Because of errors in nucleation, some of the original islands may also not be oriented as the others and cause small misoriented domains.

Surface mobility is a key part of providing for the proper placement of atoms, regardless of whether growth is homoepitaxial or heteroepitaxial. Since the mobility decreases as the temperature decreases, it is reasonable to assume that for a given deposition rate

#### 7.2 VAPOR PHASE EPITAXY 305

#### FIGURE 7.7

Silicon single-crystal islands formed on a (100) spinel surface. (Most of the crystallites show (100) symmetry and appear to have the same orientation.) (*Source:* Photograph courtesy of E. Sirtl.)



#### FIGURE 7.8

Small silicon crystallites that have nucleated and grown on  $SiO_2$ . (Note that most of them are octahedral in shape and that the orientation varies from crystallite to crystallite.)



there is a minimum temperature below which epitaxy cannot take place. The surface mobility is proportional to the surface diffusion coefficient  $D_s$ , which has the same form of temperature dependence as the bulk coefficient; that is,

$$D_s = D_{so}e^{-E/kT} 7.6$$

where  $D_{so}$  is a constant, *E* is the activation energy of the diffusion process, *k* is Boltzmann's constant, and *T* is the temperature in kelvins. Based on Eq. 7.6, the curve separating the maximum rate for single growth from amorphous growth would be expected to change with temperature as shown in Fig. 7.9. Experimentally, for temperatures above about 1000°C, it has been reported that the curve separating single-crystal silicon from polycrystalline growth has a slope

#### 306 CHAPTER 7 EPITAXY

#### FIGURE 7.9

Trend of line of demarcation between single-crystal and amorphous deposition. (Between the two regions, there will be a band of polycrystalline material.)





of  $\sim$ 5 eV, which is equal to the activation energy of bulk silicon selfdiffusion (15). This slope can be explained by assuming that, in the case of homoepitaxy, if bulk diffusion is high enough, the lower temperature limit might be set by bulk diffusion since the atoms could then move to the proper position even after being covered.

The previous discussion of nucleation implies that atoms of the species to be grown impinged on the growing surface. However, unless the nutrient is supplied by evaporation, as in MBE, a surface reaction must occur if particle formation above the surface is to be avoided. Such reactions can occur either when the two reactant species are adsorbed at adjacent sites on the surface or when they diffuse to adjacent sites after having been previously adsorbed at remote locations. Under such circumstances, if one component of the gas stream is strongly adsorbed, it may occupy a majority of sites and severely reduce the reaction rate. An example of this is the introduction of phosphine ( $PH_1$ ) to a silane gas stream in order to dope a silicon layer being grown from silane decomposition. Normally, the silane is adsorbed on the wafer surface and then decomposes. However, phosphine appears to occupy a large percentage of the sites normally occupied by silane and hence reduces the silicon deposition rate (16). Similarly, a major gas stream constituent may be adsorbed on very few sites and hence essentially not participate in the final reaction.

In the case of silicon deposition from SiCl<sub>4</sub>, because of simultaneous vapor phase reactions, many compounds other than SiCl<sub>4</sub>

#### 7.2 VAPOR PHASE EPITAXY 307

#### TABLE 7.2

Equilibrium Fractional Coverage of (111) Silicon

| Component            | Coverage $\theta^*$                                              | $P_e(dynes/cm^2)$   |
|----------------------|------------------------------------------------------------------|---------------------|
| н                    | 0.63                                                             | 10                  |
| Cl                   | 0.20                                                             | 0.2                 |
| SiCl <sub>2</sub>    | 0.16                                                             | 700                 |
| Vacant sites         | 0.015                                                            |                     |
| H <sub>2</sub>       | 10-4                                                             | 106                 |
| HCI                  | 10-7                                                             | $2 \times 10^{4}$   |
| SiHCl,               | $1.7 \times 10^{-10}$                                            | 4                   |
| SiCl <sub>4</sub>    | $1.5 \times 10^{-11}$                                            | 30                  |
|                      | m at 1230°C with a Cl/H ratio o<br>sure; total pressure = 1 atm. | f 0.01.             |
| Source: Adapted from | A.A. Chernov, J. Crystal Grow                                    | th 42, p. 55, 1977. |

are present in the gas stream. Table 7.2 lists a number of these compounds, along with the calculated fractional site coverage  $\Theta$  for each of them (17). It can be seen that other constituents keep SiCl<sub>4</sub> from being adsorbed on the surface, and, hence, it cannot take part in the final reaction to produce elemental silicon. SiCl<sub>2</sub>, however, is present and has a high adsorption site density. In studying probable reaction paths for cases where some of the reactions occur on surfaces, it is very important to consider surface site coverage. A later section will note that in most silicon halide reactions, SiCl<sub>2</sub> + H<sub>2</sub>  $\rightarrow$  Si + 2HCl is listed as the final step, which is in keeping with predictions based on the data of Table 7.2.

If surface reaction chemistry is assumed as the rate-limiting mechanism, the Langmuir-Hinschelwood mechanism can be used to describe the behavior (18). The rate of surface adsorption  $d\Theta/dt$  of a given molecular species is proportional to the gas pressure p of the component in question and to the fraction of uncovered sites remaining; that is,

$$\frac{d\Theta}{dt} = k_1 p(1 - \Theta)$$
 7.7

where  $k_1$  is a constant. The rate at which the species leaves is proportional to the number of sites occupied, or

$$\frac{-d\Theta}{dt} = k_2\Theta$$
 7.8

For equilibrium conditions, the number of molecules arriving and leaving must be equal, so  $\Theta$  can be solved from Eqs. 7.7 and 7.8. By assuming that only two species (A and B) take part in the deposition

reaction (SiCl<sub>2</sub> and H, for example), the reaction rate (deposition rate) can be calculated from

Rate = 
$$K\Theta_A\Theta_B$$
 7.9

where  $\Theta_A$  and  $\Theta_B$  are the surface fractions covered by A and B and K is a rate constant. By making further assumptions, Eqs. 7.7–7.9 can be solved to give (19)

Rate = 
$$\frac{Kb_A b_B p_A p_B}{(1 + b_A p_A + b_B p_B)^2}$$
 7.10

where  $b_A$  and  $b_B$  are the adsorption coefficients of species A and B. The partial pressures of A and B are  $p_A$  and  $p_B$ , respectively. Upon extending this approach to include an etching reaction based on HCl, silicon experimental data can be reasonably well fitted.

Growing an epitaxial layer requires a means of transporting the atoms of the growth material to the growth surface. The transport methods can be broadly described as follows:

- A molecular beam—for example, from the evaporation of silicon in a hard (~10<sup>-10</sup> torr) vacuum.
- 2. A vaporized compound(s) transported to the surface via a carrier gas and decomposed at the surface to give the desired material to be grown—for example, silane  $(SiH_4)$  decomposed on the surface by heat or plasma to give  $Si + 2H_2$ .
- A vaporized compound transported to the surface via a carrier gas and reduced at the surface by the carrier—for example, SiCl<sub>4</sub> reduced by a hydrogen carrier gas to give Si + HCl + other compounds such as SiHCl<sub>3</sub>.
- 4. A closed tube system with a temperature differential containing feed material at the cold end, the substrate to be overgrown at the hot end, and, for example, a siliconbearing compound that will disproportionate. The disproportionating compound reacts at the cold (seed) end to deposit material and then diffuses to the hot end, where it reacts with the feedstock material. For example, the reaction  $2SiI_2 \rightarrow Si + SiI_4$  at the cool end depletes silicon, which is replenished at the hot end via  $Si + SiI_4 \rightarrow 2SiI_2$ .

All four methods were investigated very early (1960s), and all are still considered useful. The first, molecular beam epitaxy, is primarily a laboratory process. The bulk of the commercial applications use either the second method or the third. The last method,

7.2.3 Material Transport

disproportionation, has recently been proposed for low-temperature depositions but since the 1960s has not been used commercially.

In the second and third methods, while the reacting species are brought into the reactor in a straightforward manner, the problem of getting the reactants to the growing surface still remains. If there were complete mixing of the gas over the whole reactor chamber volume, the concentration of a species at the surface would be the same as that anywhere in the chamber and would be directly calculable from the input gas concentrations. Unfortunately, such is not the case. As the gas moves downstream, it can become depleted of reactant and have a buildup of reactant products. Further, because the velocity of the gas goes to zero at the wafer surface, a "stagnant" layer will be adjacent to the wafers. There will be minimal gas mixing in the stagnant region (also referred to as a boundary or a depletion layer), so the reacting species must arrive primarily by diffusion. In some cases, this leads to diffusion-limited growth, as was discussed earlier in Chapter 4, section 4.5.1.

Vapor phase epitaxy at either atmospheric pressure or pressures of 10–100 torr and at temperatures of 950°C–1150°C is used for essentially all silicon epitaxial wafer production. However, interest in lower temperatures continues since they minimize slip<sup>7</sup> and impurity diffusion. Depositions in the 800°C–1000°C range can be done with conventional epitaxial reactors, while depositions from 600°C to 800°C are primarily by molecular beam epitaxy (MBE). Perhaps the most severe problem in any temperature range, but particularly below 900°C, is that of providing a clean, damage-free silicon surface on which to make the epitaxial deposition. Other problems are the incorporation of excessive amounts of substrate dopant in the first part of the layer deposited (autodoping) and the shifting and distortion of shallow depressions in the wafer surface as an epitaxial layer is added (pattern shift and distortion).

In order to prevent defects such as stacking faults and spurious polycrystalline growths from originating at the substrate-layer interface, the initial surface must be both clean and damage free. The standard cleanups, such as the RCA procedure discussed in Chapter 3, do a good job of removing most troublesome contaminants. However, the thin layer of oxide remaining after etching and, if deposition is at low temperature, the residual carbon from adsorbed organics

7.3

VAPOR PHASE SILI-CON EPITAXY

7.3.1 Surface Cleaning and Vapor Phase Etching

The mechanics of slip generation are discussed in Chapter 8. The way slip is introduced during the epitaxial process will be described later in this chapter.

must all be removed before epi. High-temperature depositions will allow any carbon that does remain to diffuse into the substrate (20). Heavy metal contaminants adsorbed on the surface must be removed, not because they interfere with epitaxial growth, but because they will dissolve in the silicon during the high-temperature epitaxial operation and then precipitate in subsequent oxidation steps to give haze. Aqueous-based cleanups as described in Chapter 3 can be used just before the epitaxial step to remove any metals present. Damage left from the mechanical polishing operation must also be removed, and for this purpose high-temperature HCl vapor phase etching in the epitaxial reactor was introduced (21-23). However, because of advances in substrate polishing techniques, the problem of removing additional material in the reactor is no longer as important as it once was. As processing temperatures have decreased, the major problem has been that of removing the residual oxide.

Vapor phase reactions that have been studied for in situ silicon etching in an epitaxial reactor are listed in Table 7.3. The first four remove  $SiO_2$ , with the reactions done under high vacuum being designed primarily for MBE. Even with a hydrogen carrier gas, the

#### TABLE 7.3

Silicon Vapor Etchants for Preepitaxy Etching

| Process                                   | Reaction                                                                         | Minimum Temperature (°C) | Reference*   |  |
|-------------------------------------------|----------------------------------------------------------------------------------|--------------------------|--------------|--|
| Thermal etching<br>(high vacuum)          | $SiO_2 + Si \rightarrow 2SiO$                                                    | 800-900                  | (25, 26–28)  |  |
| Ga beam etching<br>(high vacuum)          | $SiO_2 + 4Ga \rightarrow 2Ga_2O + Si$<br>$SiO_2 + 2Ga \rightarrow Ga_2O + SiO_2$ | <800                     | (29)         |  |
| Si beam etching (high<br>vacuum)          | $SiO_2 + Si \rightarrow 2SiO$                                                    | 700                      | (27)         |  |
| Thermal etching<br>(atmospheric pressure) | $SiO_2 + Si \rightarrow 2SiO$                                                    | 1000                     | (24, 30)     |  |
| Hydrogen etching                          | $SiO_2 + H_2 \rightarrow SiO + H_2O$                                             | 1000                     | (24, 31, 32) |  |
| HI + HF in helium                         | $Si + 4H1 \rightarrow Sil_4 + 2H_2$<br>$SiO_7 + 4HF \rightarrow SiF_4 + 2H_2O$   | 900                      | (30)         |  |
| Hydrogen sulfide                          | $Si + H_2S \rightarrow SiS_2 + 2H_2$<br>SiS <sub>2</sub> + Si $\rightarrow 2SiS$ | 950                      | (33, 34)     |  |
| Cl in helium                              | $Si + 2CL_2 \rightarrow SiCl_4$                                                  | 1000                     | (35)         |  |
| H1 in hydrogen                            | $Si + HI + H_2 \rightarrow mSiI_4 + \cdots$                                      | 1000                     | (36)         |  |
| Sulfur hexafluoride                       | $4Si + 2SiF_6 \rightarrow SiS_7 + 3SiF_4$                                        | 1050                     | (37, 38)     |  |
| HCl in hydrogen                           | $Si + 2HCL \rightarrow SiCl_{2} + H_{2}$                                         | 1100                     | (22, 23, 39) |  |
| Water vapor                               | $Si + H_2O \rightarrow SiO_2 + 2H_2$<br>Si + SiO_3 \rightarrow 2SiO              | 1250                     | (40)         |  |
| HBr in hydrogen                           | $Si + H_2 + HBr \rightarrow mSiBr_4 + \cdots$                                    | 1250                     | (41, 42)     |  |
| *See end-of-chapter referenc              | es.                                                                              |                          |              |  |

third reaction is thermodynamically much more likely to proceed than is the fourth reaction, as can be seen from the equilibrium partial pressure  $(p_e)$  of SiO versus temperature shown in Fig. 7.10 for each reaction (24). One difficulty with converting  $SiO_2$  + Si to silicon monoxide at low pressures is that, at the lower temperatures, the reaction progresses so slowly that substantial silicon evaporation and pitting occur in the first regions that become free of oxide (25). If a 1200°C heat cycle is used, then the low-temperature advantage of MBE is partially lost. One approach to speed the reaction at lower temperatures is to deposit a small amount of silicon on top of the residual oxide while the wafer is held at 700°C-900°C (27). Another approach is to deposit a small amount of gallium, which will also react with SiO<sub>2</sub> to give a vaporizable product (29). Sputter etching (not listed in the table) is another possible method for removing oxide, but substantial damage can be introduced that may require extensive high-temperature annealing for removal (43).

The rest of the reactions in Table 7.3 remove silicon and are

### FIGURE 7.10

Equilibrium partial pressure of SiO versus temperature for reactions of SiO<sub>2</sub> with Si and H<sub>2</sub>. (*Source:* P. Rai-Choudhury and D.K. Schroder, J. Electrochem. Soc. 118, p. 106, 1971. Reprinted by permission of the publisher, The Electrochemical Society, Inc.)



intended for damage removal. However, when used at temperatures above about 1000°C to 1050°C, any oxide present will also be removed, along with the silicon. Thus, even though Table 7.3 lists some reactions as useful down to 900°C, from a practical standpoint, not one of these reactions is satisfactory below 1000°C-1050°C. The minimum temperatures listed in the table were based on keeping a useful etch rate and providing a polishing etch rather than one that produced pitting. The most commonly used silicon vapor etch is HCl, typically at concentrations of between 2% and 4% at a temperature of about 1150°C. Fig. 7.11 shows the general way in which the etch rate varies with temperature and concentration, although the values will depend on the specific reactor design. Depending on the operating conditions, most of the vapor etchants will have polishing regions and regions in which the surface will be rough and pitted. Fig. 7.12 shows the approximate position of the temperatureconcentration demarcation for HCI-H2.

An alternative etchant useful at lower temperatures when the surface is already essentially damage free is hydrogen. It was the only etchant available when silicon epitaxy was introduced, but because of the imperfect polishing, it seldom removed enough silicon to ensure good-quality epi. It has now been found that if low-pressure ( $\sim 25$  torr) operation is combined with the use of carefully polished wafers with minimal residual oxide, hydrogen etching can be successfully used down to 950°C (44).



#### FIGURE 7.11

Effect of temperature and HCl concentration on etch rate of silicon when HCl in hydrogen is used for the etchant. (Temperatures have not been corrected for emissivity.) (*Source:* Adapted from K.E. Pean and Paul Gleim, late newspaper, Electrochemical Society Meeting. Fall 1963.)







The reactions that have been used to produce silicon at the wafer surface fall into the three general categories of hydrogen reduction, disproportionation, and thermal decomposition. An example of each is as follows:

1. Hydrogen reduction

 $SiCl_4 + 2H_2 \rightarrow Si + 4HCl$  7.11

2. Disproportionation

 $SiI_4 + Si \rightarrow 2SiI_2$  (higher temperature) 7.12

 $2SiI_2 \rightarrow Si + SiI_4$  (lower temperature) 7.13

3. Thermal decomposition

$$SiH_4 \rightarrow Si + 2H_2$$
 7.14

Of these, only hydrogen reduction and thermal decomposition are currently used, with halides and silanes as the most common source compounds. In principle, the fluorine, chlorine, bromine, and iodine series of compounds, listed in Table 7.4, could all be used.<sup>8</sup> However, SiF<sub>4</sub>, for example, can be reduced by hydrogen only above 2000°C (45), SiI<sub>4</sub> requires heated lines since it does not boil until

<sup>\*</sup>This list is not complete. A variety of more complex H-Cl-Si compounds could conceivably be used.

TABLE 7.4

Properties of Some Silicon Halides

| Compound                         | Melting Point (°C) | Boiling Point (°C) | $H_f(kcal/mol)^*$ |
|----------------------------------|--------------------|--------------------|-------------------|
| SiF4                             | Sublimes           |                    | - 370 †           |
| SiHF,                            | - 131              | -80                |                   |
| SiH <sub>2</sub> F <sub>2</sub>  | -122               |                    |                   |
| SiH,F                            |                    | - 99               |                   |
| SiCl4                            | - 70               | 58                 | - 157 ‡           |
| SiCl,                            |                    |                    | -38 ‡             |
| SiHCl <sub>3</sub>               | - 126              | 33                 | -117 ‡            |
| SiH,Cl,                          | -122               | 8                  | -75 #             |
| SiH,Cl                           | -118               | - 30               | -34 ‡             |
| SiBr <sub>4</sub>                | 5                  | 153                | -95 §             |
| SiBr <sub>2</sub>                |                    |                    | -12 §             |
| SiHBr,                           | -73                | 109                | -73 §             |
| SiH <sub>2</sub> Br <sub>2</sub> | -70                | 66                 | -46 §             |
| SiH,Br                           | - 94               | 2                  | -19 §             |
| Sil,                             | 124                | 290                | - 32 §            |
| Sil <sub>2</sub>                 |                    |                    | +18 §             |
| SiHI,                            | 8                  | 220                | -21 §             |
| SiH <sub>2</sub> I <sub>2</sub>  | -1                 | 150                | -11 §             |
| SiH <sub>3</sub> I               | - 57               | 45                 | -2 §              |

\*Heat of formation at 298°C.

†E.A.V. Ebsworth, Volatile Silicon Compounds, The Macmillan Co., New York, 1963.

\$L.P. Hunt and E. Sirtl, J. Electrochem. Soc. 119, p. 1741, 1972.

§L.P. Hunt and E. Sirtl, J. Electrochem. Soc. 120, p. 806, 1973.

Source: Adapted from Comprehensive Inorganic Chemistry, Vol. 7, Sneed and Brasted, D. Van Nostrand Co., New York, 1958.

290°C, and SiI<sub>2</sub> decomposes at room temperature. Silane (SiH<sub>4</sub>), also referred to as monosilane, is widely used, but the next higher silane, Si<sub>2</sub>H<sub>6</sub> (disilane) has also been studied (46). Table 7.5 lists the properties of some of the silanes. SiH<sub>4</sub> is quite stable at room temperature, but as the order increases, the stability decreases, with Si<sub>6</sub>H<sub>14</sub> decomposing at room temperature over a period of a few months (47). The thermal decomposition of carbon–silicon compounds such as CH<sub>3</sub>SiCl<sub>3</sub> can also provide a source of silicon (48), but, in general, the decomposition of carbon-containing compounds produces carbon along with the silicon and leads to polycrystalline growth.

The final choice of a feed material will depend not only on whether growth can be made to take place but also on the relative complexity of the necessary equipment, on the relative safety of the compound, and on which compounds are readily available at reasonable cost in the required purity range. Table 7.6 lists the most common feed materials, along with the probable reaction path for each. In all cases involving hydrogen reduction, the reactions are

| TABLE 7.5             | Gas                            | Melting Point (°C)                            | Boiling Point (°C) |
|-----------------------|--------------------------------|-----------------------------------------------|--------------------|
| Properties of Silanes | SiH                            | - 185                                         | -112               |
|                       | Si <sub>2</sub> H <sub>6</sub> | - 133                                         | -15                |
|                       | Si <sub>3</sub> H <sub>8</sub> | -117                                          | 53                 |
|                       | Si4H10                         | - 94                                          | 80                 |
|                       |                                | and a second state of the second state of the |                    |

Source: Compiled from data in E.A.V. Ebsworth, Volatile Silicon Compounds, The Macmillan Co., New York, 1963.

### TABLE 7.6

Common Source Gases for **Epitaxial Silicon** 

| Source             | Deposition<br>Temperature* (°C)                                 | Possible Main<br>Reaction Path†                |
|--------------------|-----------------------------------------------------------------|------------------------------------------------|
| SiCl               | 1150-1250                                                       | $SiCl_4 + H_2 \rightleftharpoons SiHCl_3 + HC$ |
|                    |                                                                 | $SiHCl_3 \rightleftharpoons SiCl_3 + HCl$      |
|                    |                                                                 | SiCl, + H, ≓ Si + 2HCl                         |
| SiHCl <sub>3</sub> | 1100-1200                                                       | SiHCl <sub>3</sub> ⇒ SiCl <sub>2</sub> + HCl   |
|                    |                                                                 | $SiCl_2 + H_2 \rightleftharpoons Si + 2HCl$    |
| SiH,Cl,            | 1000-1100                                                       | SiH <sub>2</sub> Cl <sub>2</sub> ≓ Si + 2HCl   |
| SiH <sub>4</sub>   | 950-1050                                                        | $SiH_4 \rightarrow Si + 2H_2$                  |
| *Typical.          |                                                                 |                                                |
|                    | and M. Saito, J. Crystal Grow<br>ctrochem. Soc. 121, p. 899, 19 | oth 52, p. 213, 1981, and R.F.C.               |

much more complex than that indicated by Eq. 7.11, and several intermediates are formed, including small quantities of long-chain, oily, explosive compounds. The equilibrium partial pressures of the various components that exist in an H2-Cl-Si system as a function of temperature and Cl/H ratio have been calculated (49). Some of these are shown in Fig. 7.13 and can be used as a guide in judging what reactions are probable during deposition. Also of help is the sampling of the gas stream just above the growing silicon surface in order to see what products are present. The possible equations given in Table 7.6 were based on such data (50, 51).

Silicon epitaxial deposition rates can range from infinitesimally small up to several microns per minute, depending on the temperature and feed material. However, since layer thicknesses are generally from 1 µm to 10µm, a rate of a few tenths up to a few microns per minute is a range that provides good crystallographic quality and thickness control. Slower rates can make the deposition time so long as to be economically unattractive, and substantially higher rates lead to layer defects.

### 7.3.3 Deposition Rates

#### FIGURE 7.13

Equilibrium partial pressure versus temperature for major constituents of the vapor above a free silicon surface when only hydrogen, chlorine, and silicon are present. (Source: E. Sirtl, L.P. Hunt, and D.H. Sawyer, J. Electrochem. Soc. 121, p. 919, 1974. Reprinted by permission of the publisher. The Electrochemical Society, Inc. Note: Recent revisions in values of the thermodynamic data used in calculating these curves will lead to some changes. See L.P. Hunt, J. Electrochem. Soc. 135, p. 206, 1988.)



The rate behavior versus temperature and gas flow is shown in Fig. 7.14. At low temperatures, the rate r versus temperature is closely given by an Arrhenius-type equation (region A of Fig. 7.14a):

$$r = A e^{-E/RT} 7.15$$

where A is a constant, E is an activation energy, R is the gas constant, and T is the absolute temperature. In this temperature range, reaction rates (kinetics) determine the deposition rate. As the temperature increases, however, a point is reached where the nutrient supply rather than reaction rate is the limiting factor. When the reactants reaching the surfac e are limited by diffusion through the stagnant gas film adjacent to the wafer, the rate becomes relatively temperature insensitive, as shown by the flat portions of the curves of Fig. 7.14a (curves B). When this region is encountered, it is, in principle, possible to increase the rate up to the rate predicted by curve A by increasing the flow rate. The increased flow reduces the thickness of the boundary layer discussed in Chapter 4 and increases the diffusion rate until it is no longer the limiting step. This increase of rate is shown in Fig. 7.14b. Based on simple theory of the reduction in layer thickness with flow, a plot of the rate versus square root of flow should give a straight line until the rate approaches that predicted by Eq. 7.15 (52). In fact, this is not necessarily observed ex-

### FIGURE 7.14

Dependency of deposition rate on temperature and flow. (When the temperature becomes high enough for homogeneous nucleation to occur in the gas stream, the rate decreases as indicated by the dashed lines in part a.)



perimentally, but the maximum rate observed does appear to match that predicted by an extrapolation of the Arrhenius curve measured at lower temperatures (53). From a practical standpoint, since it is difficult to maintain uniform temperature over a large multiwafer reactor, diffusion-limited operating conditions are often chosen in order to minimize layer thickness sensitivity to temperature.

The activation energy E is a function of the reaction feed material being used, although the "apparent activation energies" that have been measured vary widely. Some data, based on polycrystalline deposition rates, suggest that there is little difference in E for the three common halide reactants of Table 7.6 and that the value is ~33 kcal/mol (54). Other values for SiCl<sub>4</sub> range from 37–50 kcal/mol (31, 53, 55, 56). There seems to be reasonable agreement of  $\sim 22$ kcal/mol for SiHCl<sub>3</sub> (57). Values reported for SiH<sub>2</sub>Cl<sub>2</sub> range from ~5-13 kcal/mol at atmospheric pressure (58) to ~24 kcal/mol at reduced pressure (59). The lower values apparently arose from making measurements in the diffusion-controlled region. Reported silane activation energy values also vary widely, ranging from 10-37 kcal/mol (51, 60). It has been demonstrated that a high level of boron can reduce the apparent activation energy (51) and that possibly contamination of the surface with other materials such as oxygen or carbon will increase it. Despite the fact that the deposition rate in the kinetically controlled region has been shown to be orientation dependent, no report has been given of differing activation energies depending on orientation. Thus, the orientation effect is contained in the pre-exponential term (A) of Eq. 7.15. The wide variability in activation energy data leads once again to the statement that many of the properties of interest in semiconductor processing are highly dependent on the experimental conditions. Hence, checking the performance of any new reactor configuration rather than assuming its behavior is advisable.

EXAMPLE

Using the definition of the activation energy E given in Eq. 7.15, verify that the activation energy of the curve shown in Fig. 7.14a really is -22 kcal/mol.

Take the log of both sides of Eq. 7.15,  $\ln r = \ln A - E/RT$ . Choosing two values of r,  $(r_1 \text{ and } r_2)$  and determining the corresponding values of  $1/T_1$  and  $1/T_2$  give, after some manipulation,  $E = \ln(r_1/r_2)/[(1/T_2) - (1/T_1)]$ . From the curve, for  $r = 1 \mu m$ ,  $T = 1200^{\circ}$ C and  $1/T = 0.68 \times 10^{-3} \text{ per K}$ . For  $r = 0.3 \mu m$ , 1/T = 0.79. From Appendix C,  $R = 2 \times 10^{-3} \text{ kcal/K} \cdot \text{mol}$ . Doing the arithmetic gives  $E = (2/0.11) \ln(3.33) = 21.6 \text{ kcal/mol}$ .

As the concentration of SiCl<sub>4</sub> or SiHCl<sub>3</sub> increases, the rate initially increases but then peaks and eventually decreases and becomes negative so that etching occurs, as shown in Fig. 7.15. Dichlorosilane behavior up to about 12% is also shown. Presumably, it will also etch silicon at higher concentrations. The silane decomposition reaction appears to be first order in that the rate increases linearly with partial pressure. Thus, unlike the halides, where increased concentration soon leads to a reduced deposition rate, rates

### FIGURE 7.15

Effect of increasing halide concentration on deposition rate. (Quantitative comparisons between halides should not be made based on these curves since the data were obtained from different reactors operating under different conditions.)



can be very high and have been reported in excess of 100  $\mu$ m/minute at 1100°C (15). HCl is sometimes added to silane during deposition, where it reduces the rate as well as gas phase nucleation at higher temperatures (15, 61).

Processing temperatures have gradually decreased since the introduction of the IC because of the desire to minimize high-temperature-induced defects and impurity redistributions due to diffusion. In the case of epitaxy, the specific driving forces have been the desire to reduce substrate up-diffusion because of the trend toward thinner epitaxial layers, the need to minimize wafer slip during epitaxy, and the desire to minimize the effect of deposition on oxygen precipitation behavior. As curve A of Fig. 7.14 predicts, when the temperature is reduced to the 700°C-800°C range, the deposition rates become much less. Fig. 7.16 shows how the low-temperature rates would be expected to drop for activation energies appropriate for the common silicon sources. For these curves, a rate of 1 µm/minute was assumed at 1050°C. Also shown are some experimental points for the atmospheric pressure deposition of SiCl<sub>4</sub>, SiH<sub>2</sub>Cl<sub>2</sub>, and silane, as well as two 775°C rates obtained from lowpressure silane depositions. These data and curves indicate that usable rates can be obtained with some sources down at least to the 750°C range. However, as the temperature is lowered, it becomes progressively harder to initially clean the surface of oxide and to keep it clean during deposition. As will be discussed in a later section, nucleation on an oxide surface is much more difficult than nucleation on silicon. Thus, if oxide either is allowed to form during deposition or is not completely removed beforehand, the crystalline

### FIGURE 7.16

Projections of low-temperature deposition rate behavior based on activation energies. (Individual points represent experimentally observed values from the following sources: SiH,Cl, data from John D. Borland and Clifford 1. Drowley, Solid State Technology, p. 141, 1985: Silane data from Hseuh-Rong Chang, ECS Extended Abst. 85-1, abst. no. 276, L.D. Dyer, AIChE J. 18, p. 728, 1972, J. Bloem, J. Crystal Growth 18, p. 70, 1973, and T.J. Donahue et al., Appl. Phy. Lett. 44, p. 346, 1984; SiCl, data from S. Nakanuma, IEEE Trans. Electron Dev. ED-13, p. 578, 1966.)

### FIGURE 7.17

Deposition rate versus chamber pressure for a barrel reactor. (*Source:* From data in R.B. Herring, Applied Materials Tech. Rpt. HT-010, 1980.)



perfection will be severely degraded and the deposition rate reduced from that predicted from an extrapolation of an Arrhenius plot such as the one in Fig. 7.16.

As the chamber pressure is reduced, the deposition rate for a given feedstock concentration decreases first rather slowly and then quite rapidly when the pressure decreases below about 100 torr. Fig. 7.17 shows the general behavior for the case of SiCl<sub>4</sub> and SiH<sub>2</sub>Cl<sub>2</sub>

(62). The major effect of reducing pressure is to reduce the width of the stagnant layer next to the wafer surface. This reduction can, in turn, reduce autodoping (as discussed in a later section) and often can provide better thickness uniformity.

Under some conditions, the deposition rate depends on the crystallographic orientation. Thus, flats or "facets" perpendicular to the slow-growth directions form on curved surfaces. The study of orientation effects therefore includes both a measurement of deposition rates on various chosen planes and an examination of the orientation of facet faces, their relative size, and their frequency of occurrence. The experimental data are rather sparse, but data based on growing from SiCl<sub>4</sub> at about 1200°C onto wafers cut to given orientations indicate that the deposition rate varies in the order (111) < (100) <(110), with the total variation being about 1:2 (10, 63). Differences in growth rates on variously oriented wafer surfaces would be expected to be a maximum when deposition rates are surface controlled (in region A of Fig. 7.14a). Unfortunately, most of the rate-orientation data was taken during the time period when high temperatures were deemed necessary for good-quality growth, and it is not clear whether depositions were in the diffusion-controlled or kinetically controlled regions. However, a more recent plot of log growth rate versus 1/T for SiH<sub>2</sub>Cl<sub>2</sub> + HCl at reduced pressure in the 900°C-1000°C temperature range indicates the same activation energy for deposition on both (100) and (111) planes, with a rate difference of about 2:1 (59). If the activation energy is indeed independent of the plane of deposition, then the orientation effect is contained in the pre-exponential term of Eq. 7.15.

As an alternative to directly measuring rate, the comparative width of facets that form on round rods or hemispheres can be studied (64–66). During facet formation, the slow-growing faces become the largest. However, after prolonged growth, orientation of the long dimension of the rod or the base of the hemisphere affect which faces remain. For example, if crystal growth is on a rod running in a <111> direction, the rod first becomes roughly hexagonal, with alternate (110) and (211) planes running the length of the rod. With further growth, however, the (110) planes disappear, and the rod becomes bounded completely by (211) planes. If a (211) rod is used, two of the opposing faces will be (111), but to form a closed figure, two pairs of (311) planes are also required.

Facet formation is a commonly studied feature of crystal growth, whether it is from the melt, a solution, or a vapor. Column 1 of Table 7.7 lists facets that have been reported for silicon-melt growth along with the growth rate order in directions perpendicular to the facets. Column 2 lists the facets observed during vapor phase

7.3.4 Effect of Orientation on Deposition Rate

#### TABLE 7.7

Silicon Facet Occurrence and Order of Growth Rate for Various Orientations

|     |     | Refer | ences |     |     |
|-----|-----|-------|-------|-----|-----|
| (1) | (2) | (3)   | (4)   | (5) | (2) |
| 110 | 211 | 110   | 110   | 110 | 100 |
| 311 | 100 | 211   |       |     | 311 |
| 100 | 110 |       | 100   | 100 | 110 |
|     | 311 | 311   | 211   |     |     |
| 111 | 111 | 111   | 111   | 111 | 111 |

Note: Rate increases from bottom to top.

References: (1) G.A. Wolff, Am. Mineralogist 41, p. 60, 1956. (2) C.H.J. van den Brekel, J. Crystal Growth 23, p. 259, 1974. (3) J.E. Allegretti et al., pp. 255– 270, in Ralph O. Grubel, ed., Metallurgy of Elemental and Compound Semiconductors, Interscience Publishers, New York, 1961. (4) S. Mendelson, J. Appl. Phys. 35, p. 1570, 1964. (5) S.K. Tung, J. Electrochem. Soc. 112, p. 436, 1965.

growth onto a silicon hemisphere; column 3, the facets observed for growth onto variously oriented silicon rods. The other columns show data taken after growth onto wafers cut to specific orientations. It can be seen from columns 1–3 that the same facets occur during growth from the melt as during growth from the vapor, and all belong to planes of a (110) zone.<sup>9</sup> The fact that the same facets are found in both melt and vapor phase grown material indicates that it is crystal structure and not a surface reaction that determines which facets are likely. However, it is quite possible for the kinds of atoms adsorbed on the surface to change the relative growth rates of the facets.

Crystal habit changes (a change in the slow-growing planes) can be observed when crystals are grown from water solution if appropriate impurities are added to the growth solution (67). A similar phenomenon is observed when silicon is grown by vapor phase disproportionating reactions containing tellurium in that the slowestgrowing planes become (100) and cubes of silicon grow instead of octahedrons (68). Some studies of faceting of small depressions in (100) oriented wafers have shown that the faceting becomes more pronounced as the temperature decreases and less pronounced as the operating pressure is decreased (69). It also appears that window faceting decreases as the number of chlorine atoms in the feedstock decreases (although deposition from SiH<sub>4</sub> still shows some faceting) (70). Decreased faceting with decreasing pressure is consistent with some reaction species adsorbed on the growing surface enhancing

<sup>&</sup>quot;See Appendix A for a discussion of crystallographic zones.

the growth of a particularly oriented facet, probably a (311) in this case.

### 7.3.5 Layer Doping

To provide doping of the epitaxial layer requires co-depositing a suitable dopant with the silicon. The dopant concentration in the layer necessary for a given resistivity is well defined and is available in graphic form. (See Chapter 8 for a discussion and curve.) The first doping procedure used liquid SiCl<sub>4</sub> as the silicon feed material and mixed a dopant halide such as boron trichloride, antimony pentachloride, or phosphorus trichloride with it in the supply vessel. Thus, as the carrier gas picked up SiCl<sub>4</sub> in the bubbler,<sup>10</sup> it also picked up some of the dopant and simultaneously carried it into the reactor where it too was reduced by the hydrogen and co-deposited with the silicon. Since the two vapor pressures are usually not the same, the relative concentration changes as the volume of SiCl<sub>4</sub> is reduced during deposition.<sup>11</sup> Thus, in turn, the relative concentration in the gas stream changes and thus the resistivity of the layer.

In order to provide more uniformity of doping and to improve the ease of changing resistivity, the dopant and the silicon-bearing material are now added separately to the carrier gas stream. Liquid sources can still be used, but it is much more common to use gaseous sources such as phosphine (PH<sub>3</sub>), diborane ( $B_2H_6$ ), and arsine (AsH<sub>3</sub>). Since these gases are very toxic (see Chapter 4), they are generally supplied diluted with either hydrogen or an inert gas. An alternate method of supplying small quantities of doping hydride (but one that has seldom been used) is to manufacture it on demand from a high-frequency discharge (71).

Over the range of temperatures and doping levels normally encountered in epitaxy, the silicon growth rate can be considered independent of the amount or kind of doping. However, in the 600°C temperature range commonly used for polysilicon deposition, some dopants do affect the silicon deposition rate. Phosphorus, for example, can severely reduce the rate apparently because, at low temperatures, the phosphorus successfully competes for surface sites and reduces silicon compound adsorption (72).

<sup>&</sup>quot;A bubbler is a common contrivance for saturating a gas with vapor from a liquid. The carrier gas is allowed to bubble up through a column of liquid, during which time, if the liquid path is long enough, the bubbles become saturated with vapor from the liquid. By changing the temperature of the liquid, and hence its vapor pressure, the amount of liquid transferred to the carrier gas stream can be changed. For a further discussion, see Chapter 8.

<sup>&</sup>quot;As pointed out by H.C. Theuerer of Bell Laboratories, by blowing the carrier gas over a frit (wick) constantly wet by the liquid, this problem can be eliminated since the wick provides isolation from the reservoir.

For a fixed dopant concentration, the amount of antimony, arsenic, and phosphorus incorporated into the layer decreases with increasing deposition temperature (73, 74), while boron increases (75). When growth is in the diffusion-controlled region, increasing the deposition rate causes the amount of antimony, arsenic, and phosphorus to increase so that the resistivity drops (74).

The behavior of phosphorus has been studied more extensively than that of other dopants, and the phosphorus-bearing species present in the gas stream as a function of temperature has been calculated for the case of a phosphine doping source (76, 77). As long as the doping is low enough for the layer to be intrinsic at the deposition temperature, the doping level is proportional to the concentration of atomic P in the gas stream. Over the lower range of PH<sub>3</sub> concentrations, the atomic P concentration is proportional to that of PH<sub>3</sub> in the input gas stream. Above a point that corresponds to a doping level of about 1018 atoms/cc, the atomic P concentration is proportional to the square root of the PH, concentration. When the conditions are such that the layer is extrinsic at the deposition temperature, the charge of the dopant ion affects its incorporation, and the doping level becomes proportional to the square root of the atomic P concentration (and thus generally to the fourth root of the PH<sub>3</sub> concentration).

### EXAMPLE

If the doping level is  $10^{18}$  atoms/cc, for what deposition temperatures will the layer be extrinsic at deposition temperature? As the criterion for intrinsic material, let the number of ionized dopant atoms be equal to n/2. From the curve of  $n_i$  versus temperature given in Chapter 8, it is found that  $n_i = 2 \times 10^{18}$  carriers/cc at 800°C.

Based on the preceding example, since normal doping levels for epitaxy are below 10<sup>18</sup> atoms/cc and the temperatures are generally above 1000°C, phosphorus doping can be expected to be proportional to the phosphine concentration.

When abrupt concentration profiles, such as a high-resistivity layer on a low-resistivity substrate, are attempted, the actual impurity profile, instead of being abrupt as shown in Fig. 7.18a, appears as shown in Fig. 7.18b. Not only is the transition region more graded than would be expected from diffusion alone, but also the highresistivity layer may not be as high as anticipated. The gradation is due in part to diffusion from the substrate that occurs during epitaxial growth and in part from gas phase transport of dopant from the substrate (autodoping). The procedure for calculating the amount of

7.3.6 Impurity Redistribution during Epitaxy  $\Box$ 



diffusion is discussed in Chapter 8. Ordinarily, the effect of autodoping overshadows that of diffusion. However, when diffusion is dominant, there are some guidelines for reducing its effect.

**Diffusion** To a first approximation, when diffusion is dominant, the distance X from a concentration step to the point where the concentration is reduced by 1/e is given by

$$X = \sqrt{Dt}$$
 7.16

where D is the solid-state diffusion coefficient and t is the diffusion time. X can be used as a measure of diffusion; smaller X's imply less diffusion. The time t is equal to the time of epitaxial growth and is d/r where d is the thickness of the layer and r is the epitaxial growth rate. Thus,

$$X = \sqrt{\frac{Dd}{r}} = \sqrt{d}\sqrt{\frac{D}{r}}$$
 7.17

At a given temperature, X is decreased if the deposition rate is increased. Both D and r are temperature dependent, but, generally, r changes less than D, particularly when deposition is in the diffusion-limited region. Thus, lowering temperature will also reduce X.

EXAMPLE

How much will X decrease for a silicon deposition if the deposition temperature is reduced from 1100°C to 900°C? Assume that the substrate is doped with arsenic.

A typical deposition rate at 1100°C is 1  $\mu$ m per minute. A typical rate at 900°C is 0.1  $\mu$ m per minute. The diffusion coefficients of arsenic in silicon at these two temperatures are  $\sim 2 \times 10^{-14}$  and  $\sim 1 \times 10^{-16}$  cm<sup>2</sup>/s, respectively. Substituting these values in Eq. 7.17 gives a reduction factor of ~4.5.

Autodoping Autodoping occurs because impurities find their way from heavily doped wafer regions into the gas ambient and/or onto undoped surfaces and then co-deposit with the growing film. Initially, it was speculated that the growth mechanism using silicon halide sources would involve some etching as well as growth and by that means the dopant would enter the gas stream. However, it has been demonstrated that the effect is still present when a silane source is used, in which case no co-etching is possible. It now appears that the dopant leaves the surface primarily by evaporation. It may also diffuse from the bulk to the surface and then move laterally without ever leaving the surface. Fig. 7.19 shows possible paths for the two cases of a heavily doped wafer (appropriate for discrete devices or MOS) and a lightly doped wafer with heavily doped local areas (the usual bipolar case). During any time at temperature before deposition begins, dopant can evaporate, enter the



### FIGURE 7.19

gas stream, and can either deposit on adjacent uncovered susceptor surfaces in the case of heavily doped wafers or can deposit adjacent to diffused areas in bipolar wafers. The edges and back of heavily doped wafers may not be totally covered during deposition and may continue to outgas throughout the deposition cycle. In this case, the outgassing dopant appears as extra background doping, as shown in Fig. 7.18b. For growth over small areas of local high doping, not only will autodoping occur above the heavily doped region as indicated in Fig. 7.18b (vertical autodoping), but also lateral doping will take place out from the region boundaries (lateral autodoping).

Autodoping is an effect that can materially affect the performance of devices made in epitaxial material,<sup>12</sup> and consequently numerous studies have been published since the effect was reported in 1961 (78). Early models were concerned with the effect from heavily doped wafers and considered that the dopant was transferred first to the ambient gas and then to the growing layer. One model considered only the front side as a source (79); others considered that the major contribution was from the back and that it was never completely covered and sealed (80–82). A model was next developed that considered front- and back-side sources as well as diffusion (83). Since bipolar wafers with subcollector diffusions have no heavily doped back-side, and since wafers that do have such doping are generally sealed with an oxide or polysilicon, a strictly front-side source model is required in these cases. The salient experimental observations are as follows:

- 1. In a horizontal reactor, the effect increases for downstream wafers (84).
- 2. A doping tail occurs both normal to the surface (axially) (78) and laterally out from individual heavily doped subcollector regions (85).
- 3. On an individual wafer, the effect is more pronounced the larger the fractional area covered by subepitaxial diffusions and varies approximately as the square root of the fraction (86).
- Both chlorine-bearing silicon source and silane source depositions show autodoping (87).
- 5. Of the common silicon dopants, Sb produces the least autodoping (84).

<sup>&</sup>lt;sup>12</sup>As examples, excess lateral doping can reduce packing density, and axial (vertical) autodoping, which has the same effect as excessive substrate up-diffusion, can cause reduced collector-base breakdown voltages unless the layer thickness is increased.

- 6. A high-temperature prebake in  $H_2$  will minimize the effect (84).
- 7. For some reactor configurations, higher deposition rates increase autodoping (88).
- For As and Sb, decreased deposition temperature increases autodoping (89). For B, it decreases autodoping (90). For P, it has been reported to decrease (90) and increase autodoping (44).
- 9. For As and Sb, decreased deposition pressure decreases autodoping (91). For B and P, the reverse is true (90).

Models now assume that the majority of the dopant comes from out-diffusion during preepitaxial heating and that it is temporarily stored by surface adsorption (89, 92-94). For some reactor designs and operating conditions, autodoping profile modeling is now included in the Stanford University computer modeling program SUPREM (95). The reduction in autodoping when Sb is used occurs because of its reduced efficiency in being transferred into the growing silicon (81). However, because the low segregation coefficient of Sb prevents growing heavily Sb-doped Si crystals and because its low solid solubility in Si prevents heavy doping by diffusion, the resistivity of Sb-doped wafers and Sb-diffused regions in high-resistivity wafers cannot be made as low as those of As. Hence, As doping is sometimes used. The increase in autodoping for As and Sb as the temperature is lowered and the decrease for B and P are apparently due to different temperature behaviors for the co-deposition processes of the different dopants. Note that the spread in the reported magnitude of this effect is considerable and that some data show little difference in behavior between dopants (96). The increase in autodoping with growth rate has been explained both in terms of a decreased time for the gas stream to carry away the dopant (88) and in terms of a decreased time for the concentration of adsorbed impurities on the surface to equilibrate with the gas ambient and the growing interface (89). Because it increases the gas diffusion coefficients, a reduction in operating pressure would seem to lead to a more efficient transfer of dopant to the part of the gas stream being swept from the reactor and thus decrease autodoping. Experimentally, autodoping for Sb and As is reported to be reduced at low pressures, while B and P autodoping is increased (90). It is also observed that autodoping behavior as a function of both pressure and growth rate is dependent on the deposition temperature (44, 96).

7.3.7 Pattern Shift and Distortion When a shallow, flat-bottomed depression in a silicon wafer is overgrown by an epitaxial layer on the wafer surface, the edges of the

### FIGURE 7.20

Diagrams showing various ways in which a shallow depression in a silicon surface changes in shape and position after epitaxial deposition.



feature showing after growth will usually not be directly over the boundary of the original depression. The after-epi depression is ordinarily larger (pattern distortion) than the original, and the centers of the two depressions may not lay on top of each other (pattern shift). Examples of pattern distortion and shift are shown in Fig. 7.20. These depressions generally have a depth of about 1000 Å, and the epitaxial film thickness is on the order of 10  $\mu$ m so that the step height to thickness ratio is actually much less than shown in the figure. The pattern shift is defined as the shift of the center of the pattern (C' - C in Fig. 7.20d). The relative pattern shift  $\zeta$  is (C' - C)/t, or

$$\zeta = \frac{(x_1' - x_1) + (x_2' - x_2)}{2t}$$
7.18

where t is the thickness of the epitaxial layer (70). The relative distortion  $\delta$  is given by

$$\delta = \frac{(x_2' - x_1') - (x_2 - x_1)}{t}$$
7.19

Measuring the values  $x_1$ ,  $x_2$ ,  $x_1'$ , and  $x_2'$  and thus determining  $\delta$  is straightforward assuming that the boundaries can be seen. However, under some circumstances, referred to as pattern washout, the depression fills up enough to obliterate one or more sides. In order to determine C' - C and thus  $\zeta$ , cross sectioning and staining (97) or some indirect method such as electrically measuring the resistance of a test pattern (98) or else looking for stacking faults in the epitaxial layer deliberately introduced from the buried layer doping (99) is required.

The depressions just discussed typically arise in the production of a bipolar buried-layer (subcollector) epitaxial wafer, which is a major application of silicon epitaxy. The process flow is shown in Fig. 7.21. During the buried-layer diffusion or ion implant and anneal, the oxidation rate in the window is greater than that over the rest of the wafer since there is no initial oxide in the window to slow oxidation. Thus, more silicon is converted to oxide in the window, and when the oxide is stripped away prior to adding the epitaxial layer, there is a slight depression in the silicon surface. Knowing the location of the edges of the original depression is important since the subsequent isolation mask must be aligned to the buried-layer diffused region. In order to minimize space, the isolation should be very close to the transistor being isolated and hence to the buriedlayer diffusion. However, if it actually contacts the buried layer, the breakdown voltage between the transistor collector and the p-isolation diffusion will be too low. Thus, if appreciable pattern shift and/or distortion occurs and the isolation mask is aligned to the top surface pattern without any compensation, serious misalignment and a subsequent low yield will result.

#### FIGURE 7.21

Process flow for producing buried-layer diffusions. (Note that if there is substantial pattern shift—shift in the position of the depression—the subsequent p-isolation diffusion will not register properly with the buried layer.)



### FIGURE 7.22

Method of orienting offoriented (111) slices for minimum pattern shift. (The direction of tilt corresponding to "toward the nearest (110)" is shown. The orientation of etch pits is also shown, as well as the usual pattern orientation.) The flat is shown as (110), but it, or the (101), or the (011) are all equivalent. (*Source:* Adapted from Duane O. Townley, *Solid State Technology*, p. 43, January 1973.)



Shift and distortion occur because of differences in nucleation and growth on the sides of the depression and on the large, flat areas. These differences depend on crystal orientation as well as on various growth parameters. It was recognized very early that cutting (111) slices a few degrees off the (111) plane would minimize the pattern shift (100, 101). The optimum amount is on the order of 4°, and the slice must be tilted toward the nearest (110) plane13 (102) as shown in Fig. 7.22. For this kind of off-oriented wafer and rectangular depressions oriented with two sides parallel to (110) traces and two sides parallel to (112) traces (the normal orientation) as shown in Fig. 7.22, there will be a small pattern shift and widening of the pattern in the [112] direction and noticeably more widening but no shift in the [110] and [110] directions. An example of this effect is shown in Fig. 7.23. There is pattern distortion, but no pattern shift when growing on (100) surfaces, and deliberately misoriented (100) slices are seldom used. However, when small amounts of pattern shift are a problem, it may be necessary to examine the amount of shift to be expected by virtue of the normal manufacturing orientation tolerance.

The effect of processing conditions on pattern shift and orientation has also been studied extensively (70, 103, 104). Typical values of  $\zeta$  are between 0 and  $\sim 2$ , while  $\delta$  ranges from  $\sim -1$  to  $\sim 1$ . Deposition variables known to affect  $\zeta$  and  $\delta$  are rate, temperature, pressure, layer thickness, and silicon source material. Some general observations concerning the data are as follows:

- There is little difference in the behavior of horizontal, pancake, or barrel reactors (105).
- 2. Increasing growth rate causes off-oriented (111) pattern

<sup>&</sup>lt;sup>13</sup>For a discussion of this terminology, see Appendix A.

#### FIGURE 7.23

Example of pattern distortion. (The cross is the depression, although because of the interference contrast used during photography, it may appear raised. There is also pattern shift in the [112] direction, but it can be seen only by cross sectioning the wafer. The pattern distortion is primarily in the [110] and [110] directions and shows up in this photograph as the vertical stripe being wider than the horizontal one.)

7.3.8 Selective Epitaxial Growth (SEG)



shift to increase and distortion to decrease for both (100) and off-oriented (111) (70, 103, 104).

- Increasing temperature causes pattern shift to decrease and distortion to increase for both orientations (70, 103, 104). These trends for off-oriented (111) pattern shift are shown in Fig. 7.24.
- As pressure is decreased from atmospheric to less than 100 torr, off-oriented (111) pattern shift changes from + to -, while (100) distortion changes from - to + (69, 106).
- Both pattern shift and distortion are functions of thickness, with shift increasing and relative distortion decreasing as thickness is increased (70).
- 6. Changing the silicon source gas from silane (Cl atoms/molecule = 0) to SiH<sub>2</sub>Cl<sub>2</sub> to SiHCl<sub>3</sub> to SiCl<sub>4</sub> (Cl atoms/molecule = 4) causes both  $\zeta$  and  $\delta$  to increase (70).

As has been discussed, to nucleate on an amorphous substrate is generally more difficult than to nucleate on silicon. It is thus, in principle, possible to use an amorphous layer, usually  $SiO_2$ , as a mask to prevent epitaxial deposition in selected regions of a wafer (107). Even if some polycrystalline deposition occurs over the mask, the epitaxial deposition will have been restricted to the openings in the mask, and a discrete device can be made in each of these regions (108). An alternative but seldom used approach to deposition only in the windows is to allow deposition everywhere and then either

### FIGURE 7.24

Relative pattern shift versus deposition temperature for deposition onto a (111) wafer from SiCl<sub>4</sub>. (The wafer is offoriented a nominal 4° toward the nearest (110) plane.) (*Source:* P.H. Lee et al., *J. Electrochem. Soc. 124*, p. 1824, 1977. Reprinted by permission of the publisher, The Electrochemical Society, Inc.)

# 7.3 VAPOR PHASE SILICON EPITAXY 333



strip off the mask material and the overlaying polycrystalline layer with it (109) or else polish away the polycrystalline layer.

The driving force for the use of selective depositions was originally the desire to produce local regions with either uniform or retrograde<sup>14</sup> doping normal to the wafer surface. A major perceived application was in the formation of IC complementary transistors (110). It was also hoped that all-epitaxial transistors, allowing separate tailoring of the base, collector, and emitter, would provide better performance than diffused transistors (111, 112). Fig. 7.25a is the oldest configuration of selective epitaxy and was used to fabricate a variety of devices such as diodes and bipolar transistors that required a thick epitaxial layer. However, except for some dielectrically isolated ICs, it is not clear that any commercial devices using this configuration were ever built.

For MOS transistors and some very shallow-base highfrequency bipolar transistors, the configuration shown in Fig. 7.25b or 7.25c has been proposed (24, 113, 114). Both of these configurations allow sidewall oxide isolation without the birdbeak that occurs in LOCOS-type sidewall isolation (see "Selective Oxidation" in Chapter 3 for details of this type of process). Further, since epitaxial growth generally proceeds more rapidly than diffusion, SEG can sometimes be used in place of diffusions—for example, for CMOS wells—to reduce the time a wafer is held at elevated temperature. The configuration in Fig. 7.25b and selective polycrystalline metal

<sup>&</sup>lt;sup>47</sup>Retrograde doping means a profile that is the reverse of that obtainable by diffusion—that is, a lower impurity concentration at the surface.

### FIGURE 7.25



CVD are used to raise contact surfaces from the bottom of the oxide to the top surface (planarization). The same configuration along with expanded single-crystal growth out over the top of the oxide have been used in silicon imagers (115).

Both thermal silicon dioxide and CVD silicon nitride have been used for selective epitaxial masking of silicon growth (107, 116–118). Nucleation occurs much more easily on CVD silicon dioxide than on either silicon nitride or thermal silicon oxide. Thermal oxide allows the most processing latitude but may degrade during hightemperature (~1200°C) depositions. Therefore, silicon nitride has sometimes been used but, with the move to lower temperatures, is now seldom considered for silicon. Along with the choice of mask material, other variables are mask layout, feedstock, deposition temperature, deposition pressure, and deposition rate.

It was observed by 1963 that even when substantial nucleation occurred on the mask, there was usually a band of oxide around the silicon window where no nucleation occurred (112). An example of this effect is shown in Fig. 7.26. The bare silicon acts as a sink and

335

### FIGURE 7.26

Banding around windows in an  $SiO_2$  mask. (In the band, there is no polycrystalline silicon buildup on the oxide during selective deposition.)



keeps the concentration of any silicon-bearing species adsorbed on the oxide nearby below the critical concentration for nucleation. Thus, if the mask layout is such that there is no wider layer of oxide than the unnucleated strip just discussed, then there will be no nucleation on the oxide. This approach has the disadvantage of supplying extra nutrient to the edges of the silicon window so that a pronounced increased growth rate will occur at the edge ("rabbit ears") and a more gentle dish-shaped depression will extend to the middle (119). The additional silicon source from the oxide also has the effect of producing an overall growth rate that increases as the fraction of area covered by masking increases, as indicated in Fig. 7.27 (120). When the surface is mostly covered by bare silicon, there

### FIGURE 7.27

Trend lines showing effect of ratio of exposed silicon area to masking oxide area. (*Source:* Based on data from Earl G. Alexander and W.R. Runyan, *Trans. Metall. Soc. AIME 236*, p. 284, 1966, and H.M. Liaw et al., p. 463, in *CVD 84*, Electrochemical Society, Pennington, N.J., 1984.)

#### FIGURE 7.28

Curves showing effect of lower deposition temperature on relative rates of deposition over a whole wafer and in small areas. (In this case, the small area is a 100  $\mu$ m wide slit. Deposition is from a mixture of SiH<sub>2</sub>Cl<sub>2</sub> and HCl.) (Source: R. Pagliaro et al., J. Electrochem. Soc. 134, p. 1235, 1987. Reprinted by permission of the publisher, The Electrochemical Society, Inc.)



will, of course, be little increase in the rate from silicon species diffusing in from the surrounding oxide. However, when the oxide begins to occupy more than about 10% of the total area, and if the distance across the oxide is no more the width of the bare band just discussed, the effect becomes noticeable. Lowering the deposition temperature has the effect of both reducing surface diffusion and of moving the deposition reaction from one that is diffusion limited to one that is surface reaction limited. Thus, as the temperature is reduced, the effect of mask coverage on the deposition rate and excess growth at the edges both decrease (121). Fig. 7.28 shows the same diminishing difference between the two rates as temperature is decreased by comparing the deposition rate in a 100  $\mu$ m wide window with that of a full wafer.

Since the early silicon epitaxial work used SiCl<sub>4</sub> as feedstock, the first selective depositions also used it. As previously mentioned, silicon tetrachloride requires high deposition temperatures, and when an SiO<sub>2</sub> mask is used for selective deposition, it will be substantially eroded. Thus, there was a move first to SiHCl<sub>3</sub> and SiH<sub>4</sub> (115, 122, 123) and later to SiH<sub>2</sub>Cl<sub>2</sub> (115, 124, 125). SiCl<sub>4</sub> offers the best selectivity; silane, the least. It was observed, first for SiCl<sub>4</sub> (126) and later for the others, that the addition of HCl improves selectivity. SiHCl, has sufficient selectivity to be used without HCl; however, when silane or SiH2Cl2 is used, the addition of HCl is almost a necessity. The mechanism by which the HCl (or HBr) increases selectivity is not clear, but it does reduce the deposition rate for a given silicon species concentration and move the reaction closer to equilibrium (116). The possibility also exists that the halogen is strongly adsorbed on the mask surface and reduces the number of nucleation sites available. Grooves are sometimes observed

at the boundary between the mask and selective deposition, and it has been reported that if the angle between the mask wall and the horizontal surface is greater than about 85° (difficult to maintain with a wet etch), the grooves are minimized (124). For a fixed set of geometries and mask edge profiles, it is often possible to substantially improve planarity by relatively small variations in processing conditions (127). When equilibrium conditions are approached, faceting of the silicon islands often occurs. Such faceting can be minimized on (100) wafers by orienting the sides of the oxide mask openings along (100) traces rather than along (111) traces as is normal (125, 128). Such reorientation could not, however, be satisfactorily done merely by rotating the printing mask since that would orient the scribe lines in a manner that would make chip separation more difficult. Stacking faults and dislocations tend to originate at the masksilicon interface, and it has been reported that these defects are minimized by orienting the mask along (100) traces (129). The reason for the propensity for more defects when the mask sides are parallel with (111) traces has been ascribed both to the manner in which nucleation of each layer occurs (130) and to stresses at the oxidesilicon interface (131).

The use of reduced pressure generally improves selectivity (59). In the case of  $SiH_2Cl_2$ , without reduced pressure, no region of selectivity is found. Fig. 7.29 shows, at 25 torr, how the region of selectivity varies with HCl concentration and temperature. Note that even at this low pressure, there is no selective region without HCl.

The selectivity discussion has centered about chlorine-silicon feed material compounds, but silicon iodides can also be used for

#### FIGURE 7.29

Effect of HCl in gas stream on selectivity of silicon deposition from dichlorosilane. (*Source:* Adapted from John O. Borland and Clifford I. Drowley, *Solid State Technology*, pp. 141– 148, August 1985.)



## 7.3.9 Equipment for Silicon Vapor Phase Epitaxy

### FIGURE 7.30

Major elements of an epitaxial CVD system. (If the system operates at reduced pressure, the reactor exhaust will include a vacuum pump before the burnoff scrubber.) epitaxial deposition and, based on limited data, appear to afford greater selectivity (132).

The major elements of an epitaxial reactor system are shown in Fig. 7.30. Reactors are generally described by the mode of heating (usually radiant or induction), the operating pressure (low or atmospheric), and the deposition chamber configuration. Many names have been used to describe the chambers, including such descriptive terms as dome, waffle iron, pancake, barrel, and carousel. The reason for such a proliferation of names is that the chamber has been the subject of almost as many redesigns as there have been epitaxial investigators. The original germanium and silicon epitaxial processes used a simple one-wafer vertical flow reactor chamber as shown in Fig. 7.31a. Shortly thereafter, multiwafer horizontal flow chambers as shown in Fig. 7.31b were introduced. Currently, the pancake and barrel configurations, shown schematically in Fig. 7.32, are primarily used. Two recent introductions are the carousel (Fig. 7.32d) for increased capacity and a single-wafer horizontal flow reactor for better parameter control. Silicon CVD epitaxial equipment is now purchased from a few specialized vendors, but during the first decade after the introduction of epitaxy, most of it was constructed in-house. Thus, it is difficult to trace the developmental stages that led to the current configurations. However, Fig. 7.33 lists some of the known designs.

The gas control system is generally quite straightforward, consisting of mass flow meters to monitor and control the inert purge gas (usually  $N_2$ ),  $H_2$ , and the dopant gas. When silane or dichlorosilane is used as the silicon source, it can also be controlled by a mass flow meter, although the SiH<sub>2</sub>Cl<sub>2</sub> cylinder may need tempera-





FIGURE 7.31

reactor.

ture controlling and either external heating or cooling to maintain a satisfactory vapor pressure above the liquid. Since silicon tetrachloride and trichlorosilane are liquids at room temperature, a bubbler is normally used to produce a vapor. With bubblers, the amount of material delivered depends on the vapor pressure of the liquid and the flow of carrier gas. Vapor pressure curves for SiCl<sub>4</sub>, SiHCl<sub>3</sub>, and  $SiH_2Cl_2$  are given in Fig. 7.34. The mole fraction f of halide picked up by the gas through the bubbler is given by

$$f = \frac{p_h}{p_o}$$
 7.20

where  $p_h$  is the vapor pressure of the halide and  $p_o$  is the pressure of the gas mixture flowing through the bubbler (usually approximately atmospheric). If the stream through the bubbler is then diluted by mixing with another flow (bypass gas) before it enters the reaction chamber (as is normal), then the mole percent  $M_{\infty}$  of halide entering is given by

$$M_{\%} = \frac{100 fF}{F_b + F + Ff}$$
 7.21

# Elm Exhibit 2159, Page 356



Schematic of epitaxial reactors: (a) pancake, (b) and (c) barrel, and (d) carousel.



where  $F_b$  is the bypass gas flow rate and F is the flow rate of gas entering the bubbler. Note that since a molecular volume is the same for any gas, the mole percent equals the volume percent.

EXAMPLE

If the flow of hydrogen through the bypass is 60 liters/minute and the bubbler contains  $SiCl_4$  at 0°C, what gas flow through the bubbler is required to give 1  $M_{12}$  of  $SiCl_4$  in the total gas stream?

From Fig. 7.34,  $p_h$  is ~76 mm. Thus, f of Eq. 7.20 equals 0.1. From the problem,  $F_b = 60$ . Substituting these values into Eq. 7.21 gives F = 6.7 liters/minute.



Numerical calculations of deposition rate have largely centered on the diffusion- (transport-) limited region (51, 54, 133–139) where most atmospheric pressure reactors operate. A better understanding of the effect of flow, temperature gradients, and so on has led to the development of processes yielding better thickness uniformity.

Other equipment-related considerations are the maintenance of a low particulate count in the reactor, composition of the susceptors, and disposal of the toxic gases after they leave the reactor. Reactors are almost always operated with the outer walls cool in order to minimize deposition on them. However, where deposition does occur, the repeated temperature cycling causes the material to

#### FIGURE 7.34

Vapor pressure versus temperature for the most commonly used silicon halides. (*Source:* Union Carbide product information.)

7.3.10 Molecular Beam Epitaxy (MBE)



break up and form particulates. Reactor designs that have wafers in a near-vertical position are less susceptible to particulates because less projected area exists to catch falling particles and because thermally induced convection currents can sweep particles away that might otherwise strike the wafer surface. Susceptors are commonly of high-purity graphite that has been coated with a layer of silicon carbide. Careful attention must be given to the coating thickness uniformity in order to minimize cracking. The large volume of explosive and often toxic gaseous effluent is ordinarily first burned and then water-scrubbed before it is discharged into the air. The scrubber residue must be monitored and, if required, chemically neutralized before it is put into a drain.

Silicon molecular beam epitaxy (MBE) provides silicon atoms by evaporating silicon in a high vacuum and allowing the evaporating beam to strike the wafer. It is, in principle, like the evaporation method described in Chapter 4 for metallization. By using evaporation, no surface chemical reactions need to be considered and thus no reduction in rate as the deposition temperature is lowered. However, even though no reduction in the evaporation rate occurs as the wafer temperature decreases, the problem of reduced surface mobility exists. Thus, if single-crystal material is to be produced, the rate used must decrease with decreasing temperature in order to allow time for the atoms striking the surface at random locations to diffuse to proper positions. The same difficulty also exists at low temperatures in providing a clean surface on which to initiate epitaxy as was discussed earlier.

MBE was applied to silicon by 1962<sup>15</sup> (140). With the relatively poor vacuums used initially, the surface rapidly became contaminated, and only a poor-quality film was produced. By increasing the deposition rate so that the silicon impingement rate was large compared to that of contaminating gases in the chamber, the quality could be improved (142) but not enough to match that of CVD epitaxy in a hydrogen atmosphere. During the ensuing 25 years, vacuum systems became much better and MBE equipment much more sophisticated and expensive. However, for ordinary purposes, it still does not compete with conventional epitaxy. Nonetheless, it is capable of producing epitaxy at temperatures of a few hundred degrees and can provide for rapid changes in dopant concentration. It is also well suited for very thin epitaxial layers.

Fig. 7.35 is a schematic of a silicon molecular beam epitaxial

system (143). The source is an electron-beam-heated piece of silicon. For cases where doping is to be constant throughout the layer,

FIGURE 7.35

Schematic of silicon MBE system. (*Source:* Adapted from J.C. Bean and E.A. Sadowiski, *J. Vac. Sci. Technol.* 20, p. 137, 1982.)



<sup>15</sup>At that time, it was simply referred to as "vacuum evaporation." In 1966, a highvacuum deposition system using a silane beam decomposing on the hot silicon surface was referred to as "molecular beam epitaxy" (141). By the mid-1970s, MBE was routinely being used to describe the use of an evaporating silicon source in high vacuum.

the silicon source can be predoped. When doping is to be changed during deposition, separate doping source ovens and an ion implanter are also included. A residual gas analyzer (RGA) is attached to monitor the chamber composition. In order to increase throughput (a very-high-vacuum system usually requires overnight pumping after it has been open to the atmosphere), a loadlock station for loading and unloading wafers without breaking vacuum in the main chamber is included. To check layer crystal quality, a high-energy electron diffraction system is attached, as is an Auger electron scattering (AES) analyzer for examining surface contamination. An ion sputter gun is included in the system to clean the surface before deposition.

Most dopants have a very low probability of sticking to the growing silicon surface. However, the average residency time before desorption is sometimes hundreds of seconds. Therefore, unless special precautions are taken, abrupt changes in doping cannot be made. For example, at 700°C, antimony has an average time of over two hours. The adsorbed atoms on the surface provide the doping, and the doping level cannot be changed until their concentration is changed. When abrupt changes in doping are desired, growth can be stopped until the surface dopant adsorbed layer has re-equilibrated. When the dopant changes are accompanied by changes in growth rate and growth temperature, a more rapid response in the layer is possible (144). An alternative procedure is to incorporate a low-voltage ion implanter into the MBE equipment and implant the dopant as deposition takes place (145, 146).

Table 7.8 lists defects that may be found in silicon epitaxial layers (106). The first column lists defects that appear on the surface and are visible without any surface treatment. The second column lists defects that are more crystallographically oriented and often require etching to delineate. (A few appropriate etchants are given in Chapter 6, but for further details, see reference 97.)

| 7.3.11  | Defects    |  |
|---------|------------|--|
| Introdu | ced during |  |
| Growth  | 1          |  |

TABLE 7.8

Surface-Related Defects

Silicon Epitaxial Layer Defects

Haze Pits Orange peel Spikes Edge crown Faceting **Bulk-Related Defects** 

Dislocations Misfit dislocations Slip Stacking faults Fog (visible after heat treatment)

#### 7.3 VAPOR PHASE SILICON EPITAXY 345

Haze Generally, this high density of fine pits can be caused by oxygen in the incoming gas stream, air leaks, or moisture adsorbed on the reactor walls. It is different from the fog defect described later. As little as 1 ppm of oxygen present during the heatup cycle can cause a noticeable increase in haze. However, up to 10 ppm can be tolerated during the etch and deposition cycles. Further, 30 ppm can cause a gross degradation of surface quality (147). If a low concentration of HCl is present during the heatup and cooldown cycles, as from a leaky valve, a hazy surface will be formed.

**Pits** Widely separated individual pits probably originated during etching of the substrate and remained because enough material was not removed during etching.

**Orange Peel** This rough rippled surface can be caused by too high a growth rate. Orange peel also sometimes occurs during chemical etching of slices and, if it were present on a starting slice, would be visible after deposition.

Spikes The most common cause of spikes is particulates left on the surface. The spikes usually grow much more rapidly than the flat surface since they reach up through the boundary layer to a higher concentration of feed material. As reactor and slice cleanup procedures have improved, the spike density has declined dramatically. At one time, spikes were such a problem that spike crunchers, shavers, and etchers were used to remove them before the first contact print mask was applied. A less common cause of spikes is a localized high metal concentration on the surface leading to VLS (vaporliquid-solid) growth, which is much more rapid than normal vapor phase growth. VLS growth occurs by vapor transport of the silicon to a molten metal-silicon alloy (148). Growth takes place at the alloy-silicon interface and keeps the alloy pushed to the end of the growth such that distinctive spikes with a ball at the end form as shown in Fig. 7.36. The high metal concentrations needed for VLS growth have, for example, occurred when metallic tips were substituted for plastic tips on vacuum slice pickups.

Edge Crown This raised region at the outer periphery of the slice is due to a more rapid growth near the edge than over the rest of the wafer, which is caused by the gas flow over the edge of the slice.

Faceting Two kinds of faceting can be seen. One kind occurs on the surface of (111) slices cut very close to orientation. The other

#### FIGURE 7.36

Example of unintentional VLS growth during silicon epitaxy.



kind of facets sometimes grow at the edge of a slice where chipping occurred after polishing but before epitaxial growth.

**Dislocations** Dislocations in the starting slice that intersect the top surface will propagate on into the epitaxial layer. The incomplete removal of slice surface mechanical damage will cause additional dislocations to form in the layer.

**Misfit Dislocations** When there is a lattice mismatch, as between a lightly doped layer and a very heavily doped substrate, misfit dislocations will be generated (see Chapter 8 and Appendix A). They are also sometimes deliberately introduced by growing a thin layer of germanium-rich silicon before the desired epitaxial layer and are used for gettering (149). They lie at the interface between the two regions of differing lattice spacing and seldom intersect the surface. They do, however, often cause a very slight rippling of the surface that can be seen under interference contrast. Fig. 7.37 (top) is an etched cross section showing misfit dislocations formed by a thin layer of co-deposited Ge and Si. Fig. 7.37 (bottom) is a top view of

#### 7.3 VAPOR PHASE SILICON EPITAXY 347

a similar slice and shows the plaid surface caused by the lattice distortion from the dislocations. Remember, however, that the dislocations themselves are parallel to the surface and lie well below it.

Slip Discussed in considerable detail in Chapter 8, slip occurs during the epitaxial cycle when there is enough temperature differential over the slice to cause stresses large enough to cause plastic flow of the silicon. Rapid nonuniform heating or cooling can cause the yield stress to be exceeded, but since wafers lay on flat, highly conductive susceptors, this is usually not the cause of slip. More often, it is caused either because the wafer is initially bowed, and thus only

#### FIGURE 7.37

(Top): Cross section showing misfit dislocations between Si and a Ge–Si layer (magnified 1650X). (Bottom): Top surface of a layer above a plane of misfit dislocations as viewed with Nomarski interference microscopy (magnified).

Si epitaxial layer Plane of misfit dislocations Ge-Si epitaxial layer Plane of misfit dislocations - Si substrate

touches the susceptor in the middle,<sup>16</sup> or because the thermal differential that normally exists axially through the slice causes it to bow and lose peripheral contact with the susceptor (150–152). The axial thermal gradient can be reduced by recesses in the susceptor if heating is by RF (153–155) or by radiant heating either by itself (106) or in conjunction with an RF-heated susceptor (155). Chipped slice edges will increase the likelihood of slip. The strength of silicon increases as the interstitial oxygen content increases. Thus, slices from float-zoned crystals are particularly prone to high-temperature slip. Precipitated oxygen weakens silicon so that if, for example, the oxygen content in the center of the slice is very high and during previous heat treatments has precipitated, then slip in that region is likely.

**Stacking Faults** Described in Appendix A and shown in Fig. 7.38, stacking faults can originate at the substrate-layer interface because either crystal damage or surface contamination was not completely removed before deposition began. Those that are initiated at the substrate surface will all have grown to the same size at the top surface since their size is a linear function of the layer thickness. For (111), one side of the fault is given by t/0.816, and for (100), by t/0.707, where t is the layer thickness. When many sizes of stacking faults



<sup>&</sup>lt;sup>16</sup>If a bowed wafer is placed with the concave side down (rim touching), the wafer will initially be hotter at the periphery, and thermal expansion will cause it to flatten and make better contact. If it is initially placed so that only the center of the wafer contacts the susceptor, the center will initially be hotter, causing the wafer to bow even more away from the susceptor and further increase the radial thermal gradient.

#### FIGURE 7.38

Silicon stacking faults that nucleated at substrate-layer interfaces. (The tops of the layers were treated with Sirtl etch to delineate the faults.)

#### 7.4 VAPOR PHASE GALLIUM ARSENIDE EPITAXY 349

exist, they have been nucleated at varying times during growth, probably by particulates in the reactor falling on the slice.

Fog Also referred to as haze or S-pits, fog is not seen until a subsequent oxidation, followed by stripping the oxide and giving the silicon surface a light etch. S-pits, as discussed in Chapter 3, are caused, not by errors in epitaxial deposition, but by contamination of the slice by heavy metals. A subsequent oxidation causes precipitates to form, which, in turn, cause the S-pits. A likely source of contamination in an epitaxial reactor is moisture in the HCl line, which causes corrosion of stainless steel tubing.

Vapor phase gallium arsenide epitaxial layers are primarily grown by either halide transport (the oldest method), metalorganic decomposition (MOCVD), or molecular beam epitaxy (MBE). Conventional halide or hydride CVD requires, unlike silicon halide transport, a hot-walled reactor, with its increased chance of particulate contamination from growth on the walls. Further, it is very difficult to transport aluminum, in the event that gallium aluminum arsenide growth is also desired. These shortcomings have led to increasing emphasis on the use of organometallic compounds or MBE. Because of the increased density of gallium vacancies that occur at higher temperatures, satisfactory growth is done only below about 850°C. As for silicon, except for MBE, vapor phase in situ etching of the wafers just before epitaxy is generally used. Also as for silicon, HCl is the common etchant, and it may either be fed into the reactor (156) or be produced in the reactor from the reaction of hydrogen with AsCl<sub>3</sub> (157). Temperatures of around 900°C are required to produce specular surfaces, so the wafer must be heated above deposition temperature for the etching cycle.

By 1959, it was established that crystals of various III–V compounds, including GaAs, could be grown by iodine disproportionation in a closed tube (158). However, the first vapor phase epitaxial growth of GaAs used a chlorine disproportionation reaction (159). A GaAs source, the substrate, and a transport agent such as AlCl<sub>3</sub> were sealed in a tube. The source, which was in one end of the tube, was held in the 1000°C range, while the substrate at the other end of the tube was about 200°C cooler. As in such reactions, at the hightemperature end, the GaAs reacted with the transport gas; at the low-temperature end, GaAs deposited on the substrate. The current halide processes are similar except that they are done in a twotemperature-zoned horizontal open-tube reaction chamber as shown schematically in Fig. 7.39 (160). Typically, AsCl<sub>3</sub> from a bubbler reacts with hydrogen carrier gas in the front section of the heated

7.4 VAPOR PHASE

EPITAXY

GALLIUM ARSENIDE

7.4.1 Halide Transport

#### FIGURE 7.39

GaAs two-temperature-zone epitaxial deposition tube.

| H <sub>2</sub> |         |             |         |
|----------------|---------|-------------|---------|
|                | Gallium | GaAs wafers | Exhaust |
| $H_2 + AsCl_3$ | 850° C  | 750° C      | Exhaust |

chamber to give HCl plus molecular arsenic (As<sub>4</sub>). The HCl then reacts with either a GaAs or an arsenic-saturated gallium source held at about 850°C to give GaCl or some other Ga–Cl species. Downstream, at a lower temperature, such as 750°C, the GaCl, H<sub>2</sub>, and As<sub>4</sub> react at the GaAs slice surface to give GaAs + HCl. The remaining reactants then leave the reactor chamber where they are burned, scrubbed, and exhausted. Because of the very low vapor pressure of the gallium compounds, all parts of the chamber from the point of gallium-compound production past the deposition region must be kept heated.

The exact reactions are somewhat speculative and apparently depend on the amount of AsCl<sub>3</sub> remaining in the gas stream at the gallium or GaAs source position (161). Experimentally, it is observed that the AsCl<sub>3</sub> begins reacting with H<sub>2</sub> near 550°C and, for temperatures in excess of 750°C, appears to be completely dissociated. Instead of using AsCl, to supply both the arsenic and the transport halide, HCl can be used to supply chlorine, and arsine can be used for the arsenic source (162). Regardless of which method is used, the final species in the reactor are quite similar, and the deposition behavior is much the same. Growth rates have been extensively studied, and various models have been proposed (163-168). The deposition rate goes through a peak as temperature or GaCl partial pressure increases, but for As, it first increases and then becomes relatively insensitive to further increases in the As<sub>4</sub> partial pressure. Fig. 7.40 shows the behavior with temperature. The rate dependency on crystal orientation is much more pronounced than it is for silicon, and since the (111) and (111) planes are not equivalent in the zinc blende structure, a pronounced difference occurs in growth on the two. The maximum rate is observed on the  $(111)_{\Lambda}$ face; the minimum, on the (111)<sub>B</sub> face. The (100) face shows an intermediate rate (166).

The halide transport system just described requires a two-zone furnace and does not provide easy transport of materials such as aluminum, which is needed if GaAlAs is to be overgrown on GaAs. The use of organometallic<sup>17</sup> compounds for supplying the gallium (or

7.4.2 Metalorganic Source CVD (MOCVD)

<sup>&</sup>quot;Note that while chemistry books refer to these compounds as "organometallics." the common name used for the semiconductor process is "metalorganic."



Effect of wafer temperature on GaAs deposition rate for a chlorine transport system. (*Source:* Data from Don W. Shaw, J. Electrochem. Soc. 115, p. 406, 1968.) 7.4 VAPOR PHASE GALLIUM ARSENIDE EPITAXY 351



other group III metal) simplifies the process since several organometallic sources have sufficient vapor pressures to be vaporized in a bubbler. They can then be transported to the epitaxial reaction chamber in a carrier gas (normally  $H_2$ ) for reaction with arsine (169– 172). Thus, this process is particularly applicable to GaAs heterojunction growth applications.

 $(C_2H_3)_3$ Ga (triethylgallium) and  $(CH_3)_3$ Ga (trimethylgallium) are commonly used. The overall reaction is, for example,

 $(CH_3)_3Ga + AsH_3 \rightarrow GaAs + 3CH_4$  7.22

Unlike its effect on silicon growth, the presence of a carbon compound in this case does not prevent single-crystal GaAs growth. Group II, IV, and VI compounds are also available that can be used for shallow-level doping (172), and one compound, hexacarbonyl chromium, has been used for deep-level doping (173).

For deposition temperatures in the 600°C–850°C range, the growth rate is insensitive to temperature. As in the case of halide transport, an excess of arsenic is maintained, and the growth rate is proportional to the organometallic concentration. Crystallographic orientation effects also appear minimal. In some reactor and piping configurations, the reactants may form unwanted compounds before the hot reaction zone is reached and thus cause excessive variations in either growth rate or doping level.

7.4.3 GaAs Molecular Beam Epitaxy

Using equipment much like that discussed for silicon, GaAs layers can be grown by MBE. One difficulty is that arsenic generally evaporates as  $As_4$  and then must dissociate into atomic arsenic at the

surface before GaAs can be formed. An alternative that is sometimes used is chemical beam epitaxy. In this case, rather than using the desired elements as the beam constituents, the beam is composed of compounds that will react on the surface to give Ga and As (174). This form of epitaxy is much like the early silicon MBE work, which used a silane beam.

Two of the major uses of GaAs MBE are for the production of lattice strain fields and for the heteroepitaxial growth onto silicon. In the latter case, one major difficulty is in minimizing antiphase defects (see section 7.2.1), and another is in minimizing threading misfit dislocations that travel up into the epitaxial layer.<sup>18</sup> Both difficulties can be minimized by growing on a slightly misoriented (100) wafer (175, 176).

Surface preparation for MBE consists of first removing crystallographic damage by conventional wet etches and then vaporizing any residual Ga and As oxides in the MBE chamber. A potential disadvantage of such a procedure is that any carbon (from hydrocarbons) remaining on the wafer surface from the wet etch will not be removed in the chamber. (In this case, the presence of carbon on the surface does interfere with crystal growth.) However, it has been demonstrated that an HCL-ethanol wafer rinse in an inert atmosphere immediately before the wafer is inserted into the MBE equipment is a very effective cleanup (177).

Thin epitaxial layers can be grown from various melts in which the semiconductor to be grown is soluble. This process is known as liquid phase epitaxy (LPE). Silicon can be grown by LPE but has not been widely used; of the applications proposed, none have been IC oriented. GaAs liquid phase epitaxy is widely used, but mostly for light-emitting applications. LPE entails dissolving some of the material to be grown in a molten metal-for example, silicon in gallium or tin-inserting the substrate wafer, and then cooling the melt until the required amount of material has frozen onto the substrate surface. Growth of various materials from lower-temperature melts (fluxes) has long been used for producing various high-melting-point refractory material crystals, and it was noted in 1953 that silicon single crystals could be grown from gallium, indium, or tin melts (178, 179). The LPE growth of thin layers of GaAs onto a wafer and a method of doing it were proposed in 1963 (180), and LPE silicon was produced by 1965 (84).

7.5 LIQUID PHASE EPITAXY

<sup>&</sup>lt;sup>18</sup>Two kinds of misfit dislocation form in the GaAs layer. When the layer is (100) oriented, one is in the (100) interface plane and is quite immobile. The other can move in (110) planes and thus can travel up into the layer.

#### 7.5 LIQUID PHASE EPITAXY 353

The early silicon applications were primarily directed to those requiring very heavily doped layers on heavily doped substrates such as tunnel diodes. Suggested applications since have included ohmic contacts (181), gallium-doped infrared detectors (182), thinfilm solar cells (183), vertical-junction solar cells (184), field effect transistors (184), and field-controlled thyristors (184). Gallium arsenide (185), gallium phosphide (186), indium phosphide (187), gallium aluminum arsenide (188), indium gallium arsenide on indium phosphide (189), and indium gallium arsenic phosphide (180) have all been grown by LPE, and as mentioned earlier, such material is widely used in light-emitting diodes and other electro-optical devices. LPE has also been applied to the growth of mercury cadmium telluride for infrared applications (181). Some advantages of LPE, when it can be used, are the simplicity and economy of equipment, the rapidity with which thick layers can be grown, and the ease of growing heavily doped layers. A major problem in LPE application to GaAs ICs is a lack of thickness control when thin layers are grown.

#### 7.5.1 Silicon LPE

Silicon has enough solubility in several metals to allow it to be grown from them at rather modest temperatures. As an example, Fig. 7.41 shows silicon solubility in gallium, from which it can be



### Elm Exhibit 2159, Page 370

#### FIGURE 7.42

Solubility of silicon in various molten metals versus temperature. (Source: Plotted from data in C.D. Thurmond and M. Kowalchik, Bell Syst. Tech. J. 39, p. 169, 1960; P.H. Keck and J. Broder, Phys. Rev. 90, p. 521, 1953; and M. Hansen, Constitution of Binary Alloys, McGraw-Hill Book Co., New York, 1958.)



seen that if a gallium melt at 700°C were saturated with silicon and then reduced in temperature to 600°C, approximately 1.5 atomic percent (0.015 atomic fraction) of silicon could deposit (freeze) onto wafers immersed in the gallium. Fig. 7.42 gives solubilities for other materials that have been used as silicon solvents. The curves in this figure, rather than being plotted in the conventional phase diagram manner, are plotted as solubility versus 1/T (K) because, for small percentages dissolved, theory predicts (and it is experimentally observed in many cases) that such curves should be straight lines (192). Regardless of which solvent is used, the grown silicon layer will be saturated with it. At the lower temperatures, solubility may be reduced, and, except for tin, all of the metals shown produce p-doping. Tin is a group IV element and apparently does not affect the electrical properties of silicon. Therefore, it has been extensively studied as a silicon solvent (184, 193-195), although gallium (84, 182) and gallium aluminum (196) have also been considered. (A figure showing the solid solubility versus temperature for a variety of materials is given in Chapter 8.)

Potential advantages of silicon LPE are relatively rapid growth at temperatures in the 600°C or less range, fewer carbon-induced growth defects (197), reduced concentration of heavy metals because of limited solubility at the reduced temperature, and more abrupt junctions through reduced autodoping (195). Some difficulties are that, except for aluminum, none of the solvents listed will remove residual SiO<sub>2</sub>; it is difficult to produce thin layers; and the surfaces are often rough.

#### 7.5 LIQUID PHASE EPITAXY 355

#### FIGURE 7.43

Solubility of GaAs in molten gallium and tin versus temperature. (*Source:* Plotted from data in B.R. Pamplin, *Crystal Growth*, Pergamon Press, New York, 1980.)





Two common solvents for GaAs LPE are gallium and tin. Fig. 7.43 gives the solubility of GaAs in these two solvents. Tin is incorporated into the GaAs layer as an n-type dopant. Gallium, however, being a constituent of GaAs, does no doping, and further, near-stoichiometric GaAs can be grown from a low-temperature gallium melt. When they are grown from a GaAs melt during conventional crystal pulling, GaAs crystals have a gallium deficiency (vacancies on gallium sublattice sites).

The growth rate of LPE layers is determined either by the rate at which the solute can diffuse to the surface or by the nucleation rate on the surface. Unlike vapor phase epitaxy, chemical reaction rates are not involved. Since GaAs LPE has been so widely used, various growth rate modeling studies have been made (198–201). It appears that solute transport to the surface (arsenic when growing in gallium or both gallium and arsenic when growing in tin) is generally the limiting step. If the possibility of transport by convection is neglected, and only diffusion from the main body of solvent through a boundary layer to the surface is assumed, an equation for the growth rate R of the form

$$R \cong \frac{D(C_l - C_e)}{\delta}$$
7.23

results, where D is the diffusion coefficient of the solute in the melt,  $C_l$  is the concentration of solute—for example, arsenic—in the main body of liquid,  $C_e$  is the concentration in the liquid adjacent to the growing surface, and  $\delta$  is the width of the solute-depleted layer. The

ered stepwise once and growth is allowed to proceed isothermally. Instead of cooling the whole melt and wafer, it is also possible to pass a current between the wafer and melt and produce Peltier cooling at the wafer-melt interface. This technique, still experimental, is referred to as liquid phase electroepitaxy and has been used to grow layers of several III–V compounds (202).

LPE apparatus can be broadly broken into tipping (the earliest method) (180), rotating (203, 204, 186), direct immersion, and sliding (205), depending on the manner in which the melt is transferred to and from the wafers. These variations are shown schematically in Fig. 7.44. The sliding system has the advantage of wiping off excess solvent that would otherwise solidify on the wafer surface. The joints need only be reasonably tight to prevent leakage since the melts used have high surface tensions and the material of construction will generally be graphite and not wet by the molten metal.

Single-crystal regions can be enlarged at temperatures well below the melting point as is demonstrable by the grain growth during annealing that is often observed. In some cases, it is postulated that the driving force is just the reduction in interfacial energy that occurs when some grains enlarge and others disappear. In other cases, the internal stresses in the grains left after cold-working apparently provide the force. By stressing materials and then annealing, large metal crystals can be grown. The first intentional growth of crystals by this method was apparently in 1912 (206). More directly related to IC fabrication are the presumption that the damaged surfaces left after slice polishing regrow as single crystal in this manner and the observation that silicon amorphized by ion implant will regrow from the solid phase as single crystal. The latter is a very important feature since the ion implant doses often used produce an amorphized layer on the surface. Afterward, if the layer had to be regrown by surface melting, substantial redistribution of dopant would occur during the time the surface was molten. The rate of regrowth of silicon layers amorphized by silicon implantation has been measured as varying from  $\sim 100$  Å/s near 600°C to  $\sim 1.4 \mu$ m/s at 745°C (207). It has also been reported that amorphous silicon layers evaporated onto well-cleaned (100) silicon surfaces can be regrown by annealing at temperatures above 550°C (208). However, the use of a series of subsequent ion implants to first ensure a completely amorphous layer and then to produce ion mixing<sup>19</sup> at the substrate-layer interface appears to be more reproducible (209). The use of such a combination of low-temperature amorphous or near-amorphous deposition, followed by an implant and relatively low-temperature

7.6

EPITAXIAL CRYSTAL GROWTH FROM A SOLID

<sup>&</sup>quot;For a discussion of ion mixing, see Chapter 9.



#### 7.7 CRYSTAL GROWTH OVER INSULATING SUBSTRATES 357

boundary conditions, and hence the final form of R, will depend on whether the temperature is continually lowered or whether it is lowsolid-state epitaxial regrowth (anneal), is an alternative to direct low-temperature chemical vapor phase or molecular beam epitaxy.

7.7

CRYSTAL GROWTH OVER INSULATING SUBSTRATES\* Almost since the inception of the silicon IC, true epitaxy over singlecrystal insulating substrates such as sapphire, spinel, and quartz has been studied (211–213). In addition, germanium epitaxy on high-

'See reference (210).

resistivity (semi-insulating) GaAs (13) was considered as a material for germanium ICs. None of the silicon on foreign substrates has the perfection of homoepitaxial silicon, although silicon on sapphire (SOS) is good enough for some classes of devices to be made. Further, most other single-crystal substrates such as sapphire are quite expensive relative to silicon wafers. Thus, alternatives to heteroepitaxy have also been pursued. One of the oldest is the suggestion to use amorphous-film replicas of the steps occurring on singlecrystal surfaces for nucleation. Since nucleation begins most easily at steps, growth should begin at the replicated steps, and since the steps are all separated by integral multiples of lattice spacings, the separately nucleated regions should grow together with a minimum of defects (214). This concept has been expanded to use artificially created steps (grooves) on amorphous substrates (215, 216) and, depending on the investigators, has been called artificial epitaxy, diataxy, and graphoepitaxy. In some cases, recrystallization following deposition gives better results apparently because even if there is no ordering over the total area, enough material is oriented adjacent to the ledges to act as seeds during recrystallization (217). In a somewhat related (and largely unsuccessful) approach (rheotaxy), the film was deposited on a liquid surface that was subsequently allowed to freeze. The concept was that small single-crystal nuclei would form on the molten surface and, upon contact with one another, would have enough mobility to shift to the proper orientation for large-area single-crystal growth (218).

Several methods have also been proposed for producing singlecrystal layers on insulating films that do not depend on the initial growth of a single-crystal layer. A polycrystalline or amorphous layer is first deposited and then melted and allowed to freeze in such a manner that single-crystal regrowth occurs from a seed. The seed may be either the first part of the remelted layer that freezes or a region of single crystal initially in contact with the polycrystalline layer. The second approach has the advantage of providing a seed of known and reproducible orientation, while the first has the advantage of not requiring external seeding.

If there is no initial seed, the area melted at any time must be small, and either a spot traveling over the film in raster fashion or a line moving laterally over the film is generally used. Spot heating was originally by electron beam (219) but may also be by laser beam. When a narrow line is to be melted, strip heaters can be used (220). An alternative approach has been to restrict the deposition to a small area at any one time and to sweep it over the surface (221). A more successful nonseed method for silicon is to pattern the polycrystalline layer so that only small areas remain. During laser annealing,

#### 7.7 CRYSTAL GROWTH OVER INSULATING SUBSTRATES 359

islands of up to 2  $\mu$ m  $\times$  20  $\mu$ m have been observed to regrow as single crystals (222).

When seeds are used, they are most often exposed regions of a single-crystal wafer otherwise covered with an insulating layer as shown in Fig. 7.45 (223). Not shown in this figure is a coating often deposited over the silicon layer. Such a coating is usually desirable. for any of the melt regrowth methods, whether external seeding is used or not. Surface tension can cause the molten surface to be undulating and sometimes to ball up and loose contact with the remainder of the film unless the film is first covered with a coating such as SiO<sub>2</sub>. The technique of growth shown in Fig. 7.45, along with the one described in the next paragraph, is referred to as lateral overgrowth. Melting has been by E-beam (224), laser (223), strip heater (220), or halogen lamps (225). Scanning can be used, or the film over a whole wafer can be melted from the top, after which, if cooling is from the bottom of the wafer, the single-crystal seed areas will freeze first and seed the lateral growth over the oxide. This occurs because of the higher thermal conductivity of the semiconductor in the windows relative to the oxide over the rest of the wafer.

There are conditions for selectively growing GaAs (226) or silicon that produces some overgrowth of single-crystal material onto the mask. Typical overhangs for silicon are about equal to the thickness grown above the mask, as is shown in the cross section of Fig. 7.46. However, by changing growth conditions. larger lateral-tovertical growth ratios have been reported (227). For GaAs, the lateral spread is typically much more than for silicon. An early use for

FIGURE 7.45

Lateral overgrowth process for preparing single-crystal films over amorphous layers.

CVD polycrystalline layer added Oxide mint Ш Single-crystal silicon Polycrystalline layer melted and regrown as a single-crystal film

Exposed regions act as seeds during regrowth

#### FIGURE 7.46

Cross section of a silicon pillar selectively grown through a hole in thermal oxide. (The thermal oxide has been etched away so that the amount of lateral spreading that occurred after the hole was filled can be seen.)

SAFETY



this spreading was to increase the collection area of photodiodes (115), but more recently it has been proposed as an alternative to the seeded recrystallization discussed previously (227). In order to obtain reasonable lateral spreading distance and good crystal quality, the film thickness may be excessive, in which case it has been proposed to thin it by a series of oxidation–etch steps (228).

7 Safety is probably more of an issue in the epitaxial operation than in any other, and very careful procedures must be followed to prevent leaks in the epitaxial reactor or any of the supply lines feeding it. Outward leaks can bring highly toxic or flammable materials into the atmosphere, and inward leaks may allow explosions. The explosion and safety risks in epitaxial deposition are similar to those encountered during thin-film CVD and discussed in Chapter 4, although the gas volumes and flow rates are usually much higher in epitaxial operations. The increased volumes of explosive gases found in most large-scale production epitaxial reactors offer the potential of much more damaging explosions than might be expected of the typical LPCVD reactor. Because of the higher flows of hazardous gases encountered, more care must be taken in the design of ventilation systems and in ensuring that there are no stagnant pockets where either a poisonous or an explosive gas can collect.

KEY IDEAS 7

The original concept of epitaxy was that of oriented overgrowth of some material onto a naturally occurring crystallographic face of a mineral.

CHAPTER

The first semiconductor industry application of oriented overgrowth, the vapor phase growth of a germanium layer onto a germanium wafer, was also referred to as epitaxy.

PROBLEMS 361

- □ Growth onto a foreign substrate—for example, silicon on sapphire—is called heteroepitaxy to distinguish it from growth onto a substrate of the same material (epitaxy, isoepitaxy, autoepitaxy).
- □ Epitaxial layers can be doped by co-depositing a dopant with the semiconductor.
- □ The epitaxial process allows a layer to be overgrown onto a substrate of radically different impurity concentration.
- Silicon epitaxial growth is almost always from the vapor. GaAs epitaxy for discrete devices is sometimes from the liquid, but GaAs for ICs is from the vapor phase.
- The hydrogen reduction of SiH<sub>2</sub>Cl<sub>2</sub>, SiHCl<sub>3</sub>, and SiCl<sub>4</sub> and the thermal decomposition of SiH<sub>4</sub> are the common sources of epitaxial silicon.
- AsH<sub>3</sub> + GaCl manufactured in the deposition chamber or AsH<sub>3</sub> + gallium from the decomposition of an organometallic such as trimethylgallium is commonly used for GaAs epitaxial layers.

CHAPTER PROBLEMS 7

- 1. If  $SiCl_4 + H_2$  is to be used for etching silicon, give a set of conditions that might work, and explain your choice.
- By a series of at least three sketches, indicate where a single-crystal, round rod of silicon with the long dimension in a <110> direction first shows (110) and (211) facets, and then demonstrate that the faster-growing (110) facets (planes) will eventually disappear.
- 3. If a 1% by volume SiHCl<sub>3</sub> concentration in hydrogen is desired, what should the temperature of the bubbler be if all of the hydrogen gas goes through it? What should the temperature be if 1 liter/minute goes through the bubbler and a total of 50 liters/minute goes through the reactor.
- If a 1% by volume of SiHCl, concentration in hydrogen flows into a reactor at a rate of 50 li-

- Molecular beam epitaxy can be used for either silicon or GaAs, but it is too slow for most applications.
- In situ etching of wafer surfaces before epitaxial deposition is widely used to remove native grown surface oxide and residual polish damage.
- Because of autodoping and, to a lesser degree, because of solid-state diffusion, the doping concentration transition from the substrate to the epitaxial layer is not as abrupt as the substrate-layer transition itself.
- Any shallow depressions originally in the substrate wafer surface are usually transferred to the epitaxial layer surface, but the boundaries may be displaced laterally. The effect is referred to as pattern shift and distortion.
- By growing a layer heavily doped with an impurity with a large misfit ratio, misfit dislocations (useful for gettering) can be deliberately produced in the layer.

ters/minute and if the deposition rate is 1  $\mu$ m/ minute onto twenty 100 mm diameter wafers, what is the deposition efficiency in grams of silicon onto a wafer divided by grams of silicon in inlet gas stream?

- 5. If a 20  $\mu$ m thick silicon epitaxial layer over a buried layer is grown from SiCl<sub>4</sub> at 1125°C at a rate of 1  $\mu$ m/minute, how much and in which direction should the isolation diffusion mask be offset from the surface pattern in order to center the isolation diffusion over the buried collector diffusion?
- 6. Sketch a top view of a stacking fault occurring in a (100) silicon epitaxial layer, and calculate the thickness of the layer if the length of one side of the fault is 3  $\mu$ m. If the length of one side of

a (111) stacking fault is also 3  $\mu$ m, how thick is the layer?

7. If misfit dislocations are to be minimized when

# REFERENCES 7

- M.L. Royer, "Recherches Experimentales Sur L'epitaxie ou Orientation Mutuelle de Cristaux D'especes Differentes," *Bull. Soc. Franc. Mineral* 51, pp. 7–159, 1928.
- H. Christiansen and G.K. Teal, U.S. Patent 2,692,839, October 26, 1954 (filed April 7, 1951).
- H.C. Theuerer et al., "Epitaxial Diffused Transistor," Proc. IRE 48, pp. 1642–1643 (L), 1960.
- B.T. Murphy, "Monolithic Semiconductor Devices," U.S. Patent 3,237,026, February 22, 1966 (filed October 20, 1961).
- J.P. Hirth and G.M. Pound, *Condensation and Evaporation*, The Macmillan Co., New York, 1963.
- W.K. Burton, N. Cabrera, and F.C. Frank, "The Growth of Crystals and the Equilibrium Structure of Their Surfaces," *Phil. Trans. Royal Soc.* 243, pp. 299–358, 1951.
- Tsunenori Sakamoto and Gen Hashiguchi, "Si(001)—2×1 Single Domain Structure Obtained by High Temperature Annealing," Jap. J. Appl. Phys. 25, pp. L78–L80, 1986.
- D. Walton, "Nucleation of Vapor Deposits," J. Chem. Phys. 37, pp. 2182–2188, 1962.
- H.P. Zeindl et al., "Influence of Substrate Misorientation and Temperature on MBE-Grown Si," J. Crystal Growth 81, pp. 231–236, 1987.
- S.K. Tung, "The Effects of Substrate Orientation on Epitaxial Growth," J. Electrochem. Soc. 112, pp. 436–438, 1965.
- C.H.J. van den Brekel, "Growth Rate Anisotropy and Morphology of Autoepitaxial Silicon Films from SiCl<sub>4</sub>," J. Crystal Growth 23, pp. 259–266, 1974.
- B.A. Joyce et al., "The Influence of Substrate Surface Conditions on the Nucleation and Growth of Epitaxial Silicon Films," *Surface Science* 15, pp. 1–13, 1969.
- 13. G.O. Krause and E.C. Teague, "Observations of

growing a lightly doped silicon layer onto a heavily n-doped silicon substrate, what would be the best substrate dopant to use?

Misfit Dislocations in GaAs-Ge Heterojunctions," Appl. Phys. Lett. 10, pp. 251-253, 1967.

- Herbert Kroemer, "Polar-on-Nonpolar Epitaxy," J. Crystal Growth 81, pp. 193–204, 1987.
- References in J. Bloem, "Nucleation and Growth of Silicon by CVD," J. Crystal Growth 50, pp. 581–604, 1980.
- B.S. Meyerson and M.L. Yu, "In-Situ Doped Silicon Via LPCVD; Interactions of the Dopant/ Silane/Silicon Surface System," pp. 287–294, in *CVD 84*, Electrochemical Society, Pennington, N.J., 1984.
- A.A. Chernov and M.P. Rusaikin, "Theoretical Analysis of Equilibrium Adsorption Layers in CVD Systems (Si-H-Cl, Ga-As-H-Cl)," J. Crystal Growth 45, pp. 73-81, 1978.
- See, for example, Georg-Maria Schwab, Hugh S. Taylor, and R. Spence, *Catalysis*, D. Van Nostrand Co., New York, 1937.
- E.G. Alexander, "A Surface Reaction Approach to the Growth Kinetics of Epitaxial Silicon from SiCl<sub>4</sub>," abst. no. 105, Electrochemical Society Meeting, Spring 1967.
- John C. Bean, "Silicon Molecular Beam Epitaxy: 1984–1986," J. Crystal Growth 81, pp. 411–420, 1987.
- Wilford J. Corrigan, "Gas Etching," U.S. Patent 3,243,323, March 29, 1966 (continuation of a patent filed June 11, 1962).
- Kenneth Bean and Paul Gleim, "Vapor Etching Prior to Epitaxial Deposition of Silicon," late news paper, Electrochemical Society Meeting, Fall 1963.
- G.A. Lang and T. Stavish, "Chemical Polishing of Silicon with Anhydrous Hydrogen Chloride," *RCA Review 24*, pp. 488–499, 1963.
- 24. P. Rai-Choudhury and D.K. Schroder, "Selective Growth of Epitaxial Silicon and Gallium Arse-

REFERENCES 363

nide," J. Electrochem. Soc. 118, pp. 106-110, 1971.

- F. Jona, "Preparation of Atomically Clean Surfaces of Si and Ge by Heating in Vacuum," *Appl. Phys. Lett.* 6, pp. 205–206, 1965.
- E. Kasper and K. Worner, "Applications of Si-MBE for Integrated Circuits," pp. 429–447, in K.E. Bean and G.A. Rozgonyi, eds., VLSI Science and Technology/84, Electrochemical Society, Pennington, N.J., 1984.
- Koichi Kugimiya et al., "Si-Beam Radiation Cleaning in Mole-Beam Epitaxy," Jap. J. Appl. Phys. 24, pp. 564–567, 1985.
- Yusuke Ota, "Si Molecular Beam Epitaxy (n on n<sup>+</sup>) with Wide Range Doping Control," J. Electrochem. Soc. 124, pp. 1795–1802, 1977.
- Steve Wright and Herbert Kroemer, "Reduction of Oxides on Silicon by Heating in a Gallium Molecular Beam at 800°C," *Appl. Phys. Lett.* 36, pp. 210–211, 1980.
- E.R. Levin, J.P. Dismukes, and M.D. Coutts, "Electron Microscope Characterization of Defects on Gaseous-Etched Silicon Surfaces," J. Electrochem. Soc. 118, pp. 1171–1175, 1971.
- H.C. Theuerer, "Epitaxial Silicon Films by Hydrogen Reduction of SiCl<sub>4</sub>," J. Electrochem. Soc. 108, pp. 649–653, 1961.
- S.B. Kulkarni and A.K. Gaind, "'Hydrogen Etching' of Silicon at High Temperatures," abst. no. 546, Electrochemical Society Pub. 79–2, 1979.
- T.L. Chu, "A Low-Temperature Nonpreferential Gaseous Etchant for Silicon," J. Electrochem. Soc. 115, p. 1207, 1968.
- P. Rai-Choudhury and A.J. Noreika, "Hydrogen Sulfide as an Etchant for Silicon," J. Electrochem. Soc. 116, pp. 539–541, 1969.
- J.P. Dismukes and R. Ulmer, "Gas-Phase Etching of Silicon with Chlorine," J. Electrochem. Soc. 118, pp. 634–636, 1971.
- L.D. Dyer, HI–H, Vapor Etch for Low Temperature Silicon Epitaxial Manufacturing," *AIChE* J. 18, pp. 728–734, 1972.
- P. Rai-Choudhury, "Sulfur Hexafluoride as an Etchant for Silicon," J. Electrochem. Soc. 118, pp. 266–269, 1971.
- L.J. Stinson et al., "Sulfur Hexafluoride Etching Effects in Silicon," J. Electrochem. Soc. 123, pp. 551–555, 1976.

- P. Van Der Putte et al., "Surface Morphology of HCl Etched Silicon Wafers," J. Crystal Growth 41, pp. 133–145, 1977.
- T.L. Chu and R.L. Tallman, "Water Vapor as an Etchant for Silicon," J. Electrochem. Soc. 111, pp. 1306–1307, 1964.
- J.G. Gualtieri, M.J. Katz, and G.A. Wolff, "Gas Etching and Its Effect on Semiconductor Surfaces," Z. Fur Krist. 114, pp. 9–22, 1960.
- L.V. Gregor, P. Balk, and F.J. Campagna, "Vapor-Phase Polishing of Silicon with H<sub>2</sub>-HBr Gas Mixtures," *IBM J. Res. Develop.* 9, pp. 327– 332, 1965.
- J.C. Bean et al., "Dependence of Residual Damage on Temperature during Ar<sup>+</sup> Sputter Cleaning of Silicon," J. Appl. Phys. 48, pp. 907–913, 1977.
- John Borland et al., "Silicon Epitaxial Growth for Advanced Device Structures," Solid State Technology, pp. 111–119, January 1988.
- 45. E.A.V. Ebsworth, Volatile Silicon Compounds, The Macmillan Co., New York, 1963.
- F. Mieno et al., "Low Temperature Silicon Epitaxy Using Si<sub>2</sub>H<sub>6</sub>," J. Electrochem. Soc. 134, pp. 2320–2323, 1987.
- Harold P. Klug and Robert C. Brasted, Comprehensive Inorganic Chemistry, Vol. 7, D. Van Nostrand Co., New York, 1958.
- E. Sirtl and H. Seiter, "Vapor Deposited Microcrystalline Silicon," J. Electrochem. Soc. 113, pp. 506–508, 1966.
- Lee P. Hunt, "Thermodynamic Equilibrium in the Si-H-Cl and Si-H-Br Systems," J. Electrochem. Soc. 135, pp. 206–209 and references therein, 1988.
- J. Nishizawa and M. Saito, "Mechanism of Chemical Vapor Deposition of Silicon," J. Crystal Growth 52, pp. 213–218, 1981.
- R.F.C. Farrow, "The Kinetics of Silicon Deposition on Silicon by Pyrolysis of Silane," J. Electrochem. Soc. 121, pp. 899–907, 1974.
- 52. S.E. Bradshaw, "The Effects of Gas Pressure and Velocity on Epitaxial Silicon Deposition by the Hydrogen Reduction of Chlorosilanes," *Int. J. Electronics 23*, pp. 381–391, 1967.
- E.G. Bylander, "Kinetics of Silicon Crystal Growth from SiCl<sub>4</sub> Decomposition," J. Electrochem. Soc. 109, pp. 1171–1175, 1962.
- 54. J. Bloem, "Trends in the Chemical Vapor Depo-

sition of Silicon," pp. 180–190, in Howard R. Huff and Ronald R. Burgess, eds., *Semiconductor Silicon*/73, Electrochemical Society, Princeton, N.J., 1973.

- W.H. Shepherd, "Vapor Phase Deposition and Etching of Silicon," J. Electrochem. Soc. 112, pp. 988–994, 1962.
- J. Nishizawa et al., "Silicon Epitaxial Growth," J. Crystal Growth 17, pp. 241–248, 1972.
- 57. J.M. Charig and B.A. Joyce, "Epitaxial Growth of Silicon by Hydrogen Reduction of SiHCl<sub>3</sub> onto Silicon Substrates," J. Electrochem. Soc. 109, pp. 957–962, 1962.
- Anders Lekholm, "Epitaxial Growth of Silicon from Dichlorosilane," J. Electrochem. Soc. 119, pp. 1122–1123, 1972.
- Kohetsu Tanno et al., "Selective Silicon Epitaxy Using Reduced Pressure Techniques," Jap. J. Appl. Phys. 21, pp. L564–L566, 1982.
- B.A. Joyce and R.R. Bradley, "Epitaxial Deposition of Silicon from the Pyrolysis of Monosilane on Silicon Substrates," J. Electrochem. Soc. 110, pp. 1235–1240, 1963.
- J. Bloem and W.A.P. Claassen, "Rate-Determining Reactions and Surface Species in CVD of Silicon," J. Crystal Growth 49, pp. 435–444, 1980.
- R.B. Herring, "Silicon Epitaxy at Reduced Pressure," Applied Materials Tech. Rpt. HT-010, 1980.
- S. Mendelson, "Stacking Fault Nucleation in Epitaxial Silicon on Variously Oriented Silicon Substrates," J. Appl. Phys. 35, pp. 1570–1581, 1964.
- 64. C.H.J. van den Brekel, "Growth Rate Anisotropy and Morphology of Autoepitaxial Silicon Films from SiCl<sub>4</sub>," J. Crystal Growth 23, pp. 259–266, 1974.
- 65. J.E. Allegretti et al., pp. 255–270, in Ralph O. Grubel, ed., *Metallurgy of Elemental and Compound Semiconductors*, Interscience Publishers, New York, 1961.
- 66. J. Nishizawa et al., "Layer Growth in Silicon Epitaxy," J. Crystal Growth 13/14, pp. 297–301, 1972.
- H.E. Buckley, Crystal Growth, John Wiley & Sons, New York, 1951.
- 68. C. Holm and E. Sirtl, "Large-Area Epitaxial

Growth of Chalcogen-Doped Silicon," pp. 303– 309, in *CVD 84*, Electrochemical Society, Pennington, N.J., 1984.

- J.F. Carboy et al., "Pattern Distortion during CVD Epitaxy on (100) Silicon at Atmospheric and Reduced Pressure," pp. 434–453, in CVD 84, Electrochemical Society, Pennington, N.J., 1984.
- S.P. Weeks, "Pattern Shift and Pattern Distortion during CVD Epitaxy on (111) and (100) Silicon," *Solid State Technology*, pp. 111–117, November 1981.
- J.H. Alexander and H.F. Sterling, "Semiconductor Epitaxy–Gas Phase Doping by Electric Discharge," *Solid-State Electronics* 10, pp. 485– 490, 1967.
- B.S. Meyerson and M.L. Yu, "In-Situ Doped Silicon Via LPCVD; Interactions of the Dopant/ Silane/Silicon Surface System," pp. 287–294, in, *CVD 84*, Electrochemical Society, Pennington, N.J., 1984.
- R. Nuttall, "The Dependence on Deposition Conditions of the Dopant Concentration of Epitaxial Layers," J. Electrochem. Soc. 111, pp. 317-323, 1964.
- W.H. Shepherd, "Doping of Epitaxial Silicon," J. Electrochem. Soc. 115, pp. 541–545, 1968.
- P. Rai-Choudhury and E.I. Salkovitz, "Doping of Epitaxial Silicon," J. Crystal Growth 7, pp. 353– 360, 1970.
- J. Bloem et al., "The Incorporation of Phosphorus in Silicon Epitaxial Layer Growth," J. Electrochem. Soc. 121, pp. 1354–1357, 1974.
- L.J. Giling and J. Bloem, "The Incorporation of Phosphorus in Silicon: The Temperature Dependence of the Segregation Coefficient," *J. Crystal Growth 31*, pp. 317–322, 1975.
- H. Basseches et al., "Factors Affecting the Resistivity of Epitaxial Silicon Layers," pp. 69–86, in John B. Schroeder, ed., *Metallurgy of Semiconductor Materials*, Interscience Publishers, New York, 1962 (proc. of meeting held in 1961).
- C.O. Thomas et al., "Impurity Distribution in Epitaxial Silicon Films," J. Electrochem. Soc. 109, pp. 1055-1061, 1962.
- B.A. Joyce et al., "Impurity Redistribution Processes in Epitaxial Silicon Layers," J. Electrochem Soc. 112, pp. 1100–1106, 1965.

- W.H. Shepherd, "Autodoping of Epitaxial Silicon," J. Electrochem. Soc. 115, pp. 652–656, 1968.
- G. Skelly and A.C. Adams, "Impurity Atom Transfer during Epitaxial Deposition of Silicon," J. Electrochem. Soc. 120, pp. 116–122, 1973.
- Paul H. Langer and Joseph I. Goldstein, "Impurity Redistribution during Silicon Epitaxial Growth and Semiconductor Device Processing," J. Electrochem. Soc. 121, pp. 563–571, 1974.
- W.R. Runyan, Silicon Semiconductor Technology, McGraw-Hill Book Co., New York, 1965.
- H.B. Poage et al., in Fifth International Conference on Chemical Vapor Deposition, Electrochemical Society, Princeton, N.J., 1970.
- 86. G.R. Srinivasan, "A Flow Model for Autodoping in VLSI Substrates," J. Electrochem. Soc. 127, pp. 2305–2306, 1980.
- S.E. Meyer and D.E. Shea, "Epitaxial Deposition of Silicon Layers by Pyrolysis of Silane," J. Electrochem. Soc. 111, pp. 550–556, 1964.
- Carl O. Bozler, "Reduction of Autodoping," J. Electrochem. Soc. 122, pp. 1705–1709, 1975.
- G.R. Srinivasan, "Kinetics of Autodoping in Silicon Epitaxy," J. Electrochem. Soc. 125, pp. 146–151, 1978.
- M.W.M. Graef and B.J.H. Leunissen, "Autodoping in Silicon Epitaxy," *Electrochem. Soc. Ext. Abst.* 84-2, abst. no. 456, pp. 655-656, 1984.
- J.L. Deines and A. Spiro, "Low Pressure Silicon Epitaxy," *Electrochem. Soc. Ext. Abst.* 74-1, abst. no. 62, 1974.
- Michiharu Tabe and Hiroaki Nakamura, "Adsorbed Layer Model for Autodoping Mechanism in Silicon Epitaxial Growth," J. Electrochem. Soc. 126, pp. 822–826, 1979.
- G.R. Srinivasan, "Autodoping Effects in Silicon Epitaxy," J. Electrochem. Soc. 127, pp. 1334– 1342, 1980.
- G.K. Ackermann and E. Ebert, "Autodoping Phenomena in Epitaxial Silicon," J. Electrochem. Soc. 130, pp. 1910–1915, 1983.
- A. Reif and R.W. Dutton, "Computer Simulation in Silicon Epitaxy," J. Electrochem. Soc. 128, pp. 909–918, 1981.
- 96. Hseuh-Rong Chang, "Autodoping in Silicon Ep-

itaxy," J. Electrochem. Soc. 132, pp. 219–224, 1985.

- For lapping and staining techniques, see, for example, W.R. Runyan, Semiconductor Measurements and Instrumentation, McGraw-Hill Book Co., New York, 1975.
- Hiroyasu Kubota and Shigeo Kotani, "A New Method of Pattern Shift Measurement during Silicon Epitaxy," *Electrochem. Soc. Ext. Abst.* 84–2, abst. no. 458, pp. 659–660, 1984.
- 99. J. Wang and P. Howell, "Pattern Distortion and Shift Measurement Technique for 1 Micron Epitaxial Layers," *Electrochem. Soc. Ext. Abst.* 84-2, abst. no. 500, pp. 728-729, 1984.
- 100. Kenneth E. Bean and Walter R. Runyan, "Semiconductor Wafer with at Least One Epitaxial Layer and Methods of Making Same," U.S. Patent 3,379,584, April 23, 1968 (filed September 4, 1964).
- 101. C.E. Benjamin and E.J. Patzner, "(111) Faceting in Epitaxy Grown over Stepped Surfaces," recent news paper, Electrochemical Society Meeting, May 1965.
- 102. D.W. Boss and V.Y. Doo, "Pattern Washout and Displacement in (111) Epitaxy," *Electrochem. Soc. Electronics Div. Ext. Abst. 15-2*, abst. no. 186, pp. 59-65, 1966 (Philadelphia meeting).
- 103. C.M. Drum and C.A. Clark, "Geometric Stability of Shallow Surface Depressions during Growth of (111) and (100) Epitaxial Silicon," J. Electrochem. Soc. 115, pp. 664–669, 1968.
- 104. C.M. Drum and C.A. Clark, "Anisotropy of Macrostep Motion and Pattern Edge-Displacement during Growth of Epitaxial Silicon on Silicon near {100}," J. Electrochem. Soc. 117, pp. 1401–1405, 1970.
- 105. P.H. Lee et al., "Epitaxial Pattern Shift Comparison in Vertical, Horizontal, and Cylindrical Reactor Geometries," J. Electrochem. Soc. 124, pp. 1824–1826, 1977.
- Walter Benzing et al., Applied Materials Seminar, Dallas, 1982.
- 107. B.D. Joyce and J.A. Baldrey, "Selective Epitaxial Deposition of Silicon," *Nature 195*, pp. 485– 486, 1962.
- Jack M. Hirshon, "Silicon Epitaxial Junctions with Compatible Masking," *Electrochem. Soc.*

*Electronics Div. Ext. Abst.*, abst. no. 101, pp. 242–243, Spring 1962.

- R. Glang and E.S. Wajda, "Status of Vapor Growth in Semiconductor Technology," pp. 27– 47, in John B. Schroeder, ed., *Metallurgy of Semiconductor Materials*. Interscience Publishers, New York, 1962.
- Hiroshi Katsumura et al., "Isolation in Integrated Circuits Using Preferential Epitaxial Growth Technique," *Electronics and Communications in Japan 49*, pp. 180–186, 1966.
- 111. "Localized Epitaxial Growth for Wristwatch, Space MOS," *Electronics*, January 4, 1971.
- 112. G.L. Schnable et al., "Preferential Silicon Epitaxy with Oxide Masking," *Electrochem. Soc. Electronics Div. Ext. Abst.*, abst. no. 129, pp. 36–39, Fall 1963; expanded in *Electrochem. Technol. 4*, pp. 485–491, 1966.
- 113. Andrew F. McKelvey, "The Use of Preferential Etching and Preferential Epitaxy in Microelectronics," pp. 11-B-1 to II-B-6, in *Proc. 2nd Annual Microelectronics Symposium*, St. Louis, April 1964.
- Juliana Manoliu, "CMOS Device Isolation and Latch-Up Prevention," *Semiconductor International*, pp. 90–92 and references therein, April 1988.
- 115. Kenneth E. Bean, "Chemical Vapor Deposition Applications in Microelectronics Processing," *Thin Solid Films* 83, pp. 173–186, 1981.
- 116. Erhard Sirtl and Hartmut Seiter, "Selective Epitaxy of Silicon under Quasi-Equilibrium Conditions," pp. 189–199, in Rolf R. Haberecht and Edward L. Kern, eds., Semiconductor Silicon, Electrochemical Society, New York, 1969.
- M. Nomura, "Silicon Refill Epitaxial Growth with Si<sub>3</sub>N<sub>4</sub> Masking," abst. no. 181, Electrochemical Society Meeting, Fall 1969.
- 118. Hiroshi Ogawa et al., "The Selective Epitaxial Growth of Silicon by Using Silicon Nitride Films as a Mask." Jap. J. Appl. Phys. 10, pp. 1675– 1679, 1971.
- 119. W.G. Oldham and R. Holmstrom, "The Growth and Etching of Si through Windows in SiO<sub>2</sub>," J. Electrochem. Soc. 114, pp. 381–388, 1967.
- 120. Earl. G. Alexander and W.R. Runyan, "A Study of Factors Affecting Silicon Growth on Amorphous SiO<sub>2</sub> Surfaces," *Trans. Metall. Soc. AIME 236*, pp. 284–290, 1966.

- R. Pagliaro et al., "Uniformly Thick Selective Epitaxial Silicon," J. Electrochem. Soc. 134, pp. 1235–1238, 1987.
- D.J. Dumin, "Selective Epitaxy Using Silane and Germane," J. Crystal Growth 8, pp. 33–36, 1971.
- 123. Manfred Druminski and Roland Gessner, "Selective Etching and Epitaxial Refilling of Silicon Wells in the System SiH<sub>4</sub>/HCl/H<sub>2</sub>," J. Crystal Growth 31, pp. 312–316, 1975.
- 124. H.M. Liaw et al., "Effect of Substrate Preparation and Growth Ambient on Silicon Selective Epitaxy," pp. 463–475, in CVD 84, Electrochemical Society, Pennington, N.J., 1984.
- 125. John O. Boreland and Clifford I. Drowley, "Advanced Dielectric Isolation through Selective Epitaxial Growth Techniques," *Solid State Technology*, pp. 141–148, 1985.
- Don M. Jackson, Jr., "Advanced Epitaxial Processes for Monolithic Integrated-Circuit Applications," *Trans. Metall. Soc. AIME 233*, pp. 596–602, 1965.
- 127. Alan R. Stivers et al., "Growth Condition Dependence of SEG Planarity and Electrical Characteristics," pp. 389–395, in CVD 87, Electrochemical Society, Pennington, N.J., 1987.
- Akihiko Ishitani et al., "Facet Formation in Selective Silicon Epitaxial Growth," Jap. J. Appl. Phys. 24, pp. 1267–1269, 1985.
- Alfred C. Ipri et al., "Selective Epitaxial Growth for the Fabrication of CMOS Integrated Circuits," *IEEE Trans. on Electron Dev. ED-31*, pp. 1741–1748 and earlier references, 1984.
- 130. A. Ishitani et al., "Silicon Selective Epitaxial Growth for CMOS Technology," pp. 355–365, in *CVD* 87, Electrochemical Society, Pennington, N.J., 1987.
- L. Jasterzebeski et al., "Issues and Problems Involved in Selective Epitaxial Growth of Silicon for SOI Fabrication," pp. 334–353, in CVD 87, Electrochemical Society, Pennington, N.J., 1987.
- "Epitaxial Process Is Highly Selective in Depositing Silicon," *Electronics*, January 31, 1980.
- P.C. Rundle, "The Epitaxial Growth of Silicon in Horizontal Reactors," Int. J. Electronics 24, pp. 405–413, 1968.
- 134. F.C. Eversteyn et al., "A Stagnant Layer Model for the Epitaxial Growth of Silicon from Silane

in a Horizontal Reactor," J. Electrochem. Soc. 117, pp. 925–931, 1970.

- 135. Vladimir S. Ban and Stephen L. Gilbert, "The Chemistry and Transport Phenomena of Chemical Vapor Deposition of Silicon from SiCl<sub>4</sub>," J. Crystal Growth 31, pp. 284–289, 1975.
- 136. J.C. Gillis et al., "Fluid Mechanical Model for CVD in a Horizontal RF Reactor," pp. 21–29, in CVD 84, Electrochemical Society, Pennington, N.J., 1984.
- 137. E. Fujii et al., "A Quantitative Calculation of the Growth Rate of Epitaxial Silicon from SiCl<sub>4</sub> in a Barrel Reactor," *J. Electrochem. Soc. 119*, pp. 1106–1113, 1972.
- 138. C.W. Manke and L.F. Donaghey, "Analysis of Transport Processes in Vertical Cylinder Epitaxy Reactors," J. Electrochem. Soc. 124, pp. 561–569, 1977.
- H.A. Lord, "Convective Transport in Silicon Epitaxial Deposition in a Barrel Reactor," J. Electrochem. Soc. 134, pp. 1227–1235, 1987.
- 140. B.A. Unvala, "Epitaxial Growth of Silicon by Vacuum Evaporation," *Nature 194*, pp. 966– 967, 1962.
- 141. B.A. Joyce and R.R. Bradley, "A Study of Nucleation in Chemically Grown Epitaxial Silicon Films Using Molecular Beam Techniques. I: Experimental Methods," *Phil. Mag. Series 8, Vol.* 14, pp. 289–299, 1966.
- 142. G.R. Booker and B.A. Unvala, "Vacuum Evaporated Silicon Layers Free from Stacking Faults," *Phil. Mag. Series* 8, Vol. 8, pp. 1597– 1598, 1963.
- 143. J.C. Bean and E.A. Sadowiski, "Silicon MBE Apparatus for Uniform High-Rate Deposition on Standard Format Wafers," J. Vac. Sci. Technol. 20, pp. 137–142, 1982.
- 144. Subramanian S. Iyer et al., "Dopant Incorporation Processes in Silicon Grown by Molecular Beam Epitaxy," pp. 473–488, in K.E. Bean and G.A. Rozgonyi, eds., VSLI Science and Technology/84, Electrochemical Society, Pennington, N.J., 1984.
- 145. Yusuke Ota, "n-Type Doping Techniques in Silicon Molecular Beam Epitaxy by Simultaneous Arsenic Ion Implantation and by Antimony Evaporation," J. Electrochem. Soc. 126, pp. 1761–1765, 1979.
- 146. References in John C. Bean and Sigrid R.

McAfee, "Silicon Molecular Beam Epitaxy: A Comprehensive Bibliography 1962–82," Proc. Int. Meeting on the Relationship between Epitaxial Growth Conditions and the Properties of Semiconducting Epitaxial Layers, Perpignan, France, August 1982.

- Raymond P. Roberge et al., "Gaseous Impurity Effects in Silicon Epitaxy," Semiconductor International, pp. 77–81, 1987.
- 148. R.S. Wagner and W.C. Ellis, "Vapor-Liquid-Solid Mechanism of Single Crystal Growth," *Appl. Phys. Lett.* 4, pp. 89–90, 1964.
- 149. G.A. Rozgonyi et al., "Defect Engineering for VLSI Epitaxial Silicon," J. Crystal Growth 85, pp. 300–307, 1987.
- 150. H.R. Huff et al., "Influence of Silicon Slice Curvature on Thermally Induced Stresses," J. Electrochem. Soc. 118, pp. 143–145, 1971.
- 151. L.D. Dyer et al., "Plastic Deformation in Central Regions of Epitaxial Silicon Slices," J. Appl. Phys. 42, pp. 5680–5688, 1971.
- 152. J. Bloem and A.H. Goemans, "Slip in Silicon Epitaxy," J. Appl. Phys. 43, pp. 1281–1283, 1972.
- 153. A.H. Goemans and L.J. Van Ruyven, "Control of Slip in Horizontal Silicon Epitaxy with Profiled Susceptors," J. Crystal Growth 31, pp. 308–311, 1975.
- McD. Robinson et al., "Low Dislocation Density RF-Heated Epitaxial Silicon Depositions," *J. Electrochem. Soc.* 129, pp. 2858–2860, 1982.
- 155. J. McDiarmid et al., "Slip-Free Epitaxial Silicon Deposition in an Induction-Heated Vertical Reactor," pp. 351–355, in CVD 84, Electrochemical Society, Pennington, N.J., 1984.
- 156. Rajaram Bhat et al., "Vapor-Phase Etching and Polishing of Gallium Arsenide Using Hydrogen Chloride Gas," J. Electrochem. Soc. 122, pp. 1378–1382, 1975.
- 157. Rajaram Bhat and Sorab K. Ghandhi, "Vapor-Phase Etching and Polishing of GaAs Using Arsenic Trichloride," J. Electrochem. Soc. 124, pp. 1447–1448, 1977.
- 158. N. Holonyak et al., "Halogen Vapor Transport and Growth of Epitaxial Layers of Intermetallic Compounds and Compound Mixtures," pp. 49– 59, in John B. Schroeder, ed., Metallurgy of Semiconductor Materials, Interscience Publish-

ers, New York, 1962 (proc. of meeting held in 1961).

- 159. G.R. Antell and D. Effer, "Preparation of Crystals of InAs, InP, GaAs, and GaP by a Vapor Phase Reaction," J. Electrochem. Soc. 106, pp. 509–511, 1959.
- 160. J.R. Knight et al., "The Preparation of High Purity Gallium Arsenide by Vapor Phase Epitaxial Growth," *Solid-State Electronics* 8, pp. 178– 180, 1965.
- 161. J. Nishizawa et al., "Reaction Mechanism of GaAs Vapor-Phase Epitaxy," J. Electrochem. Soc. 133, pp. 2567–2575, 1986.
- 162. James J. Tietjen and James A. Amick, "The Preparation and Properties of Vapor Deposited Epitaxial GaAs<sub>1-x</sub>P<sub>x</sub> Using Arsine and Phosphine," J. Electrochem. Soc. 113, pp. 724–728, 1966.
- 163. Don W. Shaw, "Epitaxial GaAs Kinetic Studies: {001} Orientation," J. Electrochem. Soc. 117, pp. 683–687, 1970.
- 164. A. Boucher and L. Hollan, "Thermodynamic and Experimental Aspects of Gallium Arsenide Vapor Growth," J. Electrochem, Soc. 117, pp. 932–936, 1970.
- 165. D.J. Kirwan, "Reaction Equilibria in the Growth of GaAs and GaP by the Chloride Transport Process," J. Electrochem. Soc. 117, pp. 1572–1577, 1970.
- 166. Don W. Shaw, "Influence of Substrate Temperature on GaAs Epitaxial Deposition Rates," J. Electrochem. Soc. 115, pp. 405–408, 1968.
- 167. Don W. Shaw, "Kinetic Aspects in the Vapor Phase Epitaxy of III-V Compounds," J. Crystal Growth 31, pp. 130–141, 1975.
- 168. J.C. Hong and H.H. Lee, "Epitaxial Growth Rate of GaAs: Chloride Transport Process," J. Electrochem. Soc. 132, pp. 427–432, 1985.
- 169. H.M. Manasevit and W.I. Simpson, "The Use of Metal-Organics in the Preparation of Semiconductor Materials," J. Electrochem. Soc. 116, pp. 1725–1732, 1969.
- 170. P. Rai-Choudhury, "Epitaxial Gallium Arsenide from Trimethyl Gallium and Arsine," J. Electrochem. Soc. 116, pp. 1745–1746, 1969.
- 171. S.J. Bass, "Device Quality Epitaxial Gallium Arsenide Grown by the Metal Alkyl-Hydride Technique." J. Crystal Growth 31, pp. 172–178, 1975.
- 172. M.J. Ludowise, "Metalorganic Chemical Vapor

Deposition of III-V Semiconductors," J. Appl. Phys. 58, pp. R31-R55 and extensive list of included references, October 1985.

- 173. S.J. Bass, "Growth of Semi-Insulating Epitaxial Gallium Arsenide by Chromium Doping in the Metal-Alkyl + Hydride System," J. Crystal Growth 44, pp. 29–33, 1978.
- 174. W.T. Tsang, "Chemical Beam Epitaxy of InP and GaAs," Appl. Phys. Lett. 45, pp. 1234– 1236, 1984.
- 175. R. Fischer et al., "Growth and Properties of GaAs/AlGaAs on Nonpolar Substrates Using Molecular Beam Epitaxy," J. Appl. Phys. 58, pp. 374–383, 1985.
- N. Otsuka et al., "Study of Heteroepitaxial Interfaces by Atomic Resolution Electron Microscopy," J. Vac. Sci. Technol. B4, pp. 896–899, 1986.
- 177. R.P. Vasquez, "Cleaning Chemistry of GaAs(100) and InSb(100) Substrates for Molecular Beam Epitaxy," J. Vac. Sci. Technol. B1, pp. 791–794, 1983.
- 178. Paul H. Keck and Jacob Broder, "The Solubility of Silicon and Germanium in Gallium and Indium," *Phys. Rev.* 90, pp. 521–522, 1953.
- 179. A.J. Goss, "Crystallization of Silicon from Solution in Tin," J. of Metals 5, part 2, p. 1085, 1953.
- H. Nelson, "Epitaxial Growth from the Liquid State and Its Applications to the Fabrication of Tunnel and Laser Diodes," *RCA Review 24*, pp. 603–615, 1963.
- 181. H.J. Kim, "Liquid Phase Epitaxial Growth of Silicon in Selected Areas," J. Electrochem. Soc. 119, pp. 1394–1398, 1972.
- 182. B.E. Sumner and R.T. Foley, "Liquid-Phase Epitaxial Growth of Gallium-Doped Silicon," J. Electrochem. Soc. 125, pp. 1817–1824, 1978.
- 183. W.R. Runyan, "Melt and Solution Growth of Silicon," pp. 26–39 in Low Cost Polycrystalline Silicon Solar Cells, Southern Methodist University, Dallas, 1976.
- B. Jayant Baliga, "Silicon Liquid Phase Epitaxy—A Review," J. Electrochem. Soc. 133, pp. 5C-14C, January 1986.
- J. Crystal Growth 27, pp. 1–332, 1974. (This complete issue is devoted to liquid phase epitaxy.)
- 186. Tatsuhiko Niini, "GaP Red Light Emitting Diodes Produced by a Rotating Boat System of

REFERENCES 369

Liquid Phase Epitaxy," J. Electrochem. Soc. 124, pp. 1285–1289, 1977.

- 187. R.B. Wilson et al., "Investigation of Melt Carry-Over during Liquid Phase Epitaxy: I. Growth of Indium Phosphide," J. Electrochem. Soc. 132, pp. 172–176, 1985.
- 188. J.M. Woodall et al., "Liquid Phase Epitaxial Growth of Ga<sub>(1-0)</sub>Al<sub>\*</sub>As," J. Electrochem. Soc. 116, pp. 899–903, 1969.
- 189. Kazuo Nakajima and Jiro Okazaki, "Substrate Orientation Dependence of the In-Ga-As Phase Diagram for Liquid Phase Epitaxial Growth of In<sub>0.53</sub>Ga<sub>0.47</sub>As on InP," J. Electrochem. Soc. 132, pp. 1424–1432 and references therein, 1985.
- 190. P. Besome et al., "Investigation of Melt Carry-Over during Liquid Phase Epitaxy: II. Growth of Indium Gallium Arsenic Phosphide Double Heterostructure Material Lattice Matched to Indium Phosphide," J. Electrochem. Soc. 132, pp. 176–179, 1985.
- 191. J.G. Fleming and D.A. Stevenson, "Isothermal Liquid Phase Epitaxial Growth of Mercury Cadmium Telluride," J. Electrochem. Soc. 134, pp. 1225–1227, 1987.
- 192. C.D. Thurmond and M. Kowalchik, "Germanium and Silicon Liquidus Curves," Bell Syst. Tech. J. 39, pp. 169–204, 1960.
- 193. B. Jayant Baliga, "Kinetics of the Epitaxial Growth of Silicon from a Tin Melt," J. Electrochem, Soc. 124, pp. 1627–1631, 1977.
- 194. B. Jayant Baliga, "Isothermal Silicon Liquid Phase Epitaxy from Supersaturated Tin," J. Electrochem. Soc. 125, pp. 598-600, 1978.
- 195. B. Jayant Baliga, "Boron Autodoping during Silicon Liquid Phase Epitaxy," J. Electrochem. Soc. 128, pp. 161–165, 1981.
- 196. B. Girault et al., "Liquid Phase Epitaxy of Silicon at Very Low Temperatures," J. Crystal Growth 37, pp. 169–177, 1977.
- 197. L.A. D'Asaro et al., "Low Defect Silicon by Liquid Phase Epitaxy," pp. 233–242, in Rolf R. Haberecht and Edward L. Kern, eds., Semiconductor Silicon, Electrochemical Society, New York, 1969.
- 198. I. Crossley and M.B. Small, "Computer Simulations of Liquid Phase Epitaxy of GaAs in Ga Solutions," J. Crystal Growth 11, pp. 157–165, 1971.
- 199. R, Ghez, "An Exact Solution of Crystal Growth Rates under Conditions of Constant Cooling

Rate," J. Crystal Growth 19, p. 153, 1973.

- 200. D.L. Rode, "Isothermal Diffusion Theory of LPE: GaAs, GaP, Bubble Garnet," J. Crystal Growth 20, pp. 13-23, 1973.
- T. Bryskiewicz, "Investigation of the Mechanism and Kinetics of Growth of LPE GaAs," J. Crystal Growth 43, pp. 101–114, 1978.
- 202. T. Bryskiewicz et al., "Bulk GaAs Crystal Growth by Liquid Phase Electroepitaxy," J. Crystal Growth 82, pp. 279–288 and references therein, 1987.
- 203. J.A. Donahue and H.T. Minden, "A New Technique for Liquid Phase Epitaxy," J. Crystal Growth 7, pp. 221–226, 1970.
- 204. O.G. Lorimor et al., "High Capacity Liquid Phase Epitaxy Apparatus Utilizing Thin Melts," Solid-State Electronics 16, pp. 1289–1295, 1973.
- 205. L.R. Dawson, "Near-Equilibrium LPE Growth of GaAs-Gat-,Al,As Double Heterostructures," J. Crystal Growth 27, pp. 86–96, 1974.
- 206. H.E. Buckley, p. 93, in *Crystal Growth*, John Wiley & Sons, New York, 1956.
- 207. A. Lietoila et al., "The Rate of CW Laser Induced Solid Phase Epitaxial Regrowth of Amorphous Silicon," *Appl. Phys. Lett.* 39, pp. 810– 812, 1981.
- 208. M. von Allmen et al., "Solid-State Epitaxial Growth of Deposited Si Films," Appl. Phys. Lett. 35, p. 280, 1979.
- R.C. Cole et al., "Thin Epitaxial Silicon Regrowth Using Ion Implantation Amorphization Techniques," J. Electrochem. Soc. 135, pp. 974– 979, 1988.
- 210. For background reading on this topic, see H.W. Lam et al., Chap. 1, "Silicon-on-Insulator for VLSI and VHSIC," in VLSI Electronics: Microstructure Science 4, Academic Press, New York, 1982; and complete issue of J. Crystal Growth, October 1983.
- 211. H.M. Manasevit and W.I. Simpson, "Single-Crystal Silicon on a Sapphire Substrate," American Physical Society Meeting, August 1963; and J. Appl. Phys. 35, pp. 1349–1351, 1964.
- 212. H.M. Manasevit and D.H. Forbes, "Single-Crystal Silicon on Spinel," American Physical Society Meeting, January 1965; and H. Seiter and C. Zaminer, "Epitaktische Siliziumschichten auf Mg-Al-Spinell," Zeit. angew. Phys. 20, pp. 158–161, 1965.
- 213. B.A. Joyce et al., "Epitaxial Deposition of Sili-

con on Quartz," *Solid State Comm. 1*, pp. 107– 108, 1963; and R.W. Bicknell et al., "The Epitaxial Deposition of Silicon on Quartz," *Phil. Mag. Series 8, Vol. 9*, pp. 965–978, 1964.

- 214. J.D. Filby and S. Nielsen, "Single Crystal Films of Silicon on Insulators," *Brit. J. Appl. Phys. 18*, pp. 1357–1382, 1967.
- 215. Henry I. Smith et al., "Silicon-on-Insulator by Graphoepitaxy and Zone-Melting Recrystallization of Patterned Films," J. Crystal Growth 63, pp. 527–546, 1983.
- 216. V.I. Klykov and N.N. Sheftal, "Diataxial Growth of Silicon and Germanium," J. Crystal Growth 52, pp. 687–691, 1981.
- 217. Henry I. Smith et al., "The Mechanism of Orientation in Si Graphoepitaxy by Laser or Strip Heater Recrystallization," J. Electrochem. Soc. 130, pp. 2050–2053, 1983.
- E. Rasmanis, "Thin Film p-n Junction Silicon Devices," Semiconductor Products, pp. 30-33, July 1963.
- J. Maserjian, "Single Crystal Germanium Films by Micro Zone-Melting," Solid-State Electronics 6, pp. 477–484, 1963.
- 220. M.W. Geis et al., "Zone-Melting Recrystallization of Si Films with a Moveable-Strip-Heater Oven," J. Electrochem. Soc. 129, pp. 2812– 2818, 1982.
- 221. See H.F. Matare, "Heteroepitaxy of Silicon on Insulator Crystal Substrates," pp. 249–290, in Rolf R. Haberecht and Edward L. Kern, eds.,

Semiconductor Silicon, Electrochemical Society, New York and included references 52–54, 1969.

- 222. J.F. Gibbons et al., "CW Laser Recrystallization of <100> Silicon on Amorphous Substrates," Appl. Phys. Lett. 34, pp. 831–833, 1979.
- 223. M. Tamura et al., "Si Bridging Epitaxy from Si Windows onto SiO<sub>2</sub> by Q-Switched Ruby Laser," Jap. J. Appl. Phys. 19, pp. L23–L26, 1980.
- 224. J.R. Davis et al., "Characterization of the Dual E-Beam Technique for Recrystallizing Polysilicon Films," J. Electrochem. Soc. 132, pp. 1919– 1924, 1985.
- 225. G.K. Celler et al., "Dielectrically Isolated Thick Si Films by Lateral Epitaxy from the Melt," J. Electrochem. Soc. 132, pp. 211–219, 1985.
- 226. A.G. Lapierre et al., "Anomalous Unconstrained Crystal Growth of GaAs," pp. 301–304, in H. Steffen Peiser, ed., Crystal Growth, Pergamon Press, London, 1967.
- 227. References in L. Jastrzebski et al., "Issues and Growth of Silicon for SOI Fabrication," pp. 334–353, in CVD 87, Electrochemical Society, Pennington, N.J., 1987.
- 228. L. Jastrzebski et al., "Preparation of Thin (0.6 μm) Continuous Monocrystalline Silicon over SiO<sub>2</sub>," J. Electrochem. Soc. 132, pp. 3056–3057, 1985.



## Impurity Diffusion

### 8.1 INTRODUCTION

Thermal diffusion is a thermally activated process in which atoms or molecules in a material (solid, liquid, or gas) move from high- to low-concentration regions. The phenomenon is encountered in a number of semiconductor processes and has already been alluded to in several chapters. In silicon thermal oxidation, the oxygen travels through the growing oxide to the silicon surface by diffusion. In the CVD of thin films and in epitaxial growth, the gaseous reactants must diffuse through a relatively stagnant gas layer adjacent to the growing surface. Similarly, in liquid etching, the reacting species must generally diffuse through stagnant liquid layers. However, when diffusion is mentioned in the context of semiconductor processing, the motion of doping impurities in the semiconductor itself is generally assumed. Such diffusion will occur whenever there is an impurity gradient and the temperature is high enough to allow appreciable motion. Thus, while a high-temperature diffusion is widely used intentionally to move doping impurities in the semiconductor lattice, motion by diffusion can also take place during hightemperature processing steps not related to the diffusion step.

While appreciable diffusion in gases and liquids can occur at room temperature, the solid-state diffusion required for semiconductor device fabrication generally requires temperatures in the 1000°C range. For silicon, two primary groups of diffusing impurities are to be considered. One is the group IIIA and VA substitutional shallow-level impurities that diffuse rather slowly. The other is comprised mainly of gold and transition elements such as Cu, Ni, and Fe, which are deep-level lifetime killers and which diffuse orders of magnitude faster than the group IIIA and VA atoms. The groupings are similar for GaAs, except that the normal dopant atoms of interest are groups II and VI.

After diffusing for some time from an initially abrupt concentration step, the concentration-distance profile will have the general characteristics shown in Fig. 8.1. N(x), the impurity concentration,

#### 372 CHAPTER 8 IMPURITY DIFFUSION



Typical normalized impurity profiles.



is shown plotted both on a linear and on a semilogarithmic scale. The latter scale is more useful since several decades of concentration must generally be considered. Increasing the temperature or extending the time will cause the curve to spread out along the *x* axis. In a typical diffusion step, impurities are desired in localized areas, such as MOS sources and drains or bipolar bases and emitters. To accomplish this diffusion, a wafer is coated with a masking material (something relatively impervious to the desired diffusant) and openings are cut in the mask where diffusions are required. The wafer is then heated to the required temperature in an atmosphere containing the impurities to be diffused and held there long enough for the necessary diffusion to take place.

Ideal device impurity profiles are usually abrupt,<sup>1</sup> as shown in Fig. 8.2a for a simple bipolar transistor. One of the process engineering challenges is to use a mechanism that inherently provides a sloped profile (Fig. 8.1) to approximate the ideal abrupt profiles closely enough for acceptable device operation. Fig. 8.2b shows the general manner in which this is done. In the illustration and in most practical cases, all diffusions are from the front surface of the wafer. Thus, the wafer background doping is present in the base, along with

<sup>&#</sup>x27;Sometimes, a concentration gradient with its attendant electric field is desirable in, for example, a base region in order to decrease the carrier transit time.





doping from the base diffusion. In the emitter region, the wafer background doping and the base doping are both present in the emitter, along with the emitter doping. The profile of most interest, and one also shown in Fig. 8.2b, is not that of the individual impurities, but rather that of the net concentration  $|N_A - N_D|$ , where  $N_A$  is the concentration of acceptors and  $N_p$  is the concentration of donors. Efforts to match diffusion profiles to the desired ideal impurity profiles of various device structures have required a variety of diffusion boundary conditions that will be discussed in the following sections. Fig. 8.3 illustrates, again using a bipolar transistor as an example, how interrelated the various properties become when diffusion is used to place impurities. If it is desired to reduce the base width, diffusing the emitter in deeper is a logical approach.<sup>2</sup> However, as the sequence of parts a, b, and c shows, while the base width is reduced by 35%, the peak base impurity concentration is reduced by a factor of 10. Thus, maintaining an appropriately high Gummel number3 while producing very thin bipolar bases is a very real problem indeed.

<sup>&</sup>lt;sup>3</sup>There is also a phenomenon referred to as emitter push, which causes the base impurities to rapidly diffuse ahead of the emitter and maintain a much wider base than expected. This phenomenon will be discussed in more detail later. <sup>3</sup>The Gummel number is the net number of impurities per cm<sup>2</sup> in the base.

#### 374 CHAPTER 8 IMPURITY DIFFUSION

#### FIGURE 8.3

Effect of narrowing a diffused transistor base on the peak base concentration. (Asterisks mark the peak concentrations and show that while the base is narrowing by 35%, the peak concentration has fallen an order of magnitude.)



Diffusion can be discussed in terms of solutions to a set of diffusion equations that will predict impurity profiles such as those just shown for a wide variety of boundary conditions. It can also be discussed in terms of the atomic motion necessary to provide the observed transport. The first view is more useful in developing semiconductor processes, but the second is required in order to offer explanations of the many deviations from simple theory that are actually observed and to provide the necessary background for computer modeling of the diffusion process.

8.2 ATOMIC DIFFUSION MECHANISMS

Even though there are relatively fixed locations for each atom of a solid, the thermal vibration of the atoms will occasionally be of sufficient magnitude to allow an atom to surmount its potential barrier and move to an adjacent location. This frequency  $v_b$  is given by

$$v_h = v_e^{-E_h/k}$$

8.1

#### 8.2 ATOMIC DIFFUSION MECHANISMS 375

where  $E_b$  is the energy of the barrier, k is Boltzmann's constant, and  $\nu$  is the frequency with which the atom is vibrating in the direction of the jump.  $\nu$  is normally assumed to be the same as the Debye frequency.<sup>4</sup> The frequency with which a jump will actually occur will also depend on the availability of empty sites and the number of directions in which a given atom can jump.

In order to relate the atomic jumps to a net transfer of impurities, consider first the one-dimensional case shown in Fig. 8.4 (1). Let two adjacent atomic planes be located at  $x_1$  and  $x_2$ , and suppose that the density of diffusing atoms residing in plane 1 is  $n_1$  atoms/ cm<sup>2</sup> and in plane 2 is  $n_2$  atoms/cm<sup>2</sup>. Atoms can jump only in the +xor the -x direction and have equal probability of going in either direction. If the jump frequency is  $v_b$ , then there will be  $v_b n_1/2$  atoms/ cm<sup>2</sup> jumping to the right per unit time from  $x_1$  and  $v_b n_2/2$  atoms/cm<sup>2</sup> jumping to the left per unit time from  $x_2$ . The net flux is then given by

$$J = -\frac{\nu_h (n_1 - n_2)}{2}$$
 8.2

However, it is customary to express the concentration in terms of N atoms/cm<sup>3</sup>, not n atoms/cm<sup>2</sup>. If, as in an actual crystal lattice, all atoms are located on planes separated by a distance  $\Delta x$ , then N is given by  $n/\Delta x$  atoms/cm<sup>3</sup>. The concentration gradient  $\partial N/\partial x$  is given by  $\Delta N/\Delta x$  where  $\Delta N = N_1 - N_2 = (n_1 - n_2)/\Delta x$  and  $\Delta x$  (the jump distance) =  $x_1 - x_2$ . Substituting these values into Eq. 8.2 gives

$$I = -\left(\frac{\nu_b}{2}\right) (\Delta x)^2 \frac{\partial N}{\partial x}$$
 8.3

Defining the diffusion coefficient D as  $\Delta x^2(v_b/2)$  gives

$$I = -D\frac{\partial N}{\partial x}$$
 8.4

which is often referred to as Fick's first law of diffusion.

To relate *D* to the cubic diamond lattice constant, consider jumps from two adjacent (400) planes, for which  $\Delta x = a_0/4$  where  $a_0$ is the lattice constant.<sup>5</sup> An examination of the bond directions shows that for this choice of planes, all jumps must be to adjacent planes and none to sites within the same plane. Since there are 4 bonds per atom, there are 2 possible jumps in each direction. However, even

'For a discussion of crystallographic planes, see Appendix A.

#### FIGURE 8.4

Jumping of atoms from plane to plane. (With atoms jumping in both the -x and the +x directions with equal probability, net flow will only result when there is a difference in atomic densities between planes 1 and 2.)



<sup>\*</sup>The Debye frequency is given by  $(k/h)\Theta_p$  where h is Planck's constant and  $\Theta_p$  is the Debye temperature,  $\nu$  for silicon is approximately 10<sup>13</sup>/s.

#### 376 CHAPTER 8 IMPURITY DIFFUSION

if an atom has the energy to surmount the barrier, there is no assurance that the site will be vacant. Therefore, a probability P of a site availability must be assigned. Eq. 8.3 then becomes

$$J = -P\nu_b \left(\frac{a_0^2}{8}\right) \frac{\partial N}{\partial x}$$
 8.5

*P* may, for example, be the probability of a vacancy being adjacent to the diffusing atom and will be given by the density of vacancies divided by the number of atomic sites. The general form of *P* is  $P = P_0 e^{-E_p/kT}$ . If this expression is combined with the value of  $\nu_b$  from Eq. 8.1,

$$D = v e^{-E_b/kT} P_0 e^{-E_b/kt} = D_0 e^{-E_d/kT}$$
 8.6

where  $D_0$  is a constant and  $E_d$  is the diffusion activation energy. In practice, the *D* values used are experimentally measured quantities. To calculate them accurately from first principles requires a more elegant treatment than that used to arrive at the general form given by Eq. 8.6.

If jumps between adjacent (220) planes are considered,  $\Delta x = a_0/2\sqrt{2}$ , but only half of the jumps will be to adjacent planes. Thus, again,

$$J = -P\nu_{h}\left(\frac{a_{0}^{2}}{8}\right)\frac{\partial N}{\partial x}$$
8.7

Regardless of the set of planes chosen (equivalent to direction of diffusion), the value for D will remain the same.

If a vacancy is diffusing, it can go to any filled site. Therefore, if the number of vacancies is much less than the number of possible sites, the probability of a jump direction being toward an acceptable site is 1 and  $D_V = (a_0/8)ve^{-E/kT}$  where E is the free energy required for a vacancy to surmount its barrier.

In the case of atoms, the problem is more complex. Examples of the ways in which motion can occur are shown in Fig. 8.5 (1, 2). Part a shows a simple exchange of sites by a rotation of adjacent atoms. The energy required for this process is substantial and is not believed<sup>6</sup> to occur in either Si or GaAs. Part b shows motion by an atom moving into an empty adjacent site (a vacancy). Several dopants apparently move by this mechanism. For those that do, any part of the diffusion process that generates more than an equilibrium

Elm Exhibit 2159, Page 393

<sup>\*</sup>Even though diffusion mechanisms in semiconductors have now been studied for about 40 years, no experiments have been devised that conclusively establish the mode of motion of most of the impurities in common use.

8.2 ATOMIC DIFFUSION MECHANISMS 377



(d) Conversion from interstitial diffusion site to fixed substitutional site

number of vacancies<sup>7</sup> will enhance the jump rate. Similarly, if the vacancy density is depressed, the jump rate will decrease. Part c shows motion in which an interstitial atom kicks a substitutional atom out of place and occupies its space. The kicked-out atom then kicks out an adjacent atom, and the process repeats itself. Note that in this case the interstitial atom does not move through the lattice via interstices but only via the kick-out mechanism. If both of the atoms involved belong to the host. then self-diffusion occurs; if the kicked-out atom is a substitutional impurity, then impurity diffusion occurs. In this mechanism, no impurity atom will move until an interstitial moves adjacent to it. Thus, any process that increases interstitials over equilibrium will enhance the jump rate, and a reduction will reduce it. Some impurities ordinarily occupy substitutional sites but can move into an interstitial site, travel rapidly via the interstices, and then return to a substitutional site. This motion is shown in part d. The return can be via a vacancy (Frank-Turnbull mechanism) or via a kick-out as discussed previously. Such impur-

FIGURE 8.5

Atomic motion involved in various types of diffusion.

<sup>&</sup>lt;sup>7</sup>Both vacancies and interstitials are present in small quantities and are thermodynamically stable.

#### 378 CHAPTER 8 IMPURITY DIFFUSION

| TABLE 8.1                                   | Lattice Location and Kind of Motion                                                                                | Impurity                                   |
|---------------------------------------------|--------------------------------------------------------------------------------------------------------------------|--------------------------------------------|
| Atomic Motion in Silicon<br>Versus Impurity | Substitutional; motion by vacancy<br>Substitutional; motion by interstitialcy<br>Interstitial; interstitial motion | P, Sb, Si, Al, Ga, As<br>Si, B, P, As<br>O |
|                                             | Substitutional; interstitial motion                                                                                | Au                                         |

ities generally have a very low interstitial solubility relative to their substitutional solubility. Table 8.1 lists various mechanisms by which atoms are assumed to move in silicon. As indicated, a given atomic species may move by more than one mechanism.

In gallium arsenide, group II impurities are substitutional in the gallium sublattice, and group VI are substitutional in the arsenic sublattice. Since any substitutional atom's four nearest neighbors are on the other sublattice, simple atomic motion by moving into an adjacent vacancy is not possible, and a rotational exchange would be highly disruptive. However, a substitutional-interstitial step followed by interstitial motion would allow the longer travel distance required for an atom to reach a proper site and is the mechanism thought to be active in gallium arsenide.

Diffusion by vacancy must include terms for each of the vacancy charge states (neutrals, -, =, +) so that the diffusion coefficient is (3)

$$D_{V}^{*} = D^{*0} + D^{*-} + D^{*-} + D^{*+}$$
 8.8

where an \* indicates the value when a component is at its equilibrium value. When the doping level increases to the point where the semiconductor is no longer intrinsic at diffusion temperature, the distribution of charged vacancies will change. Thus, the relative contribution to diffusion from each of the vacancy charge states will change, and Dy becomes doping-level dependent. A further implication is that since the concentration decreases from the wafer surface inward,  $D_v$  will change with position. The D value will vary linearly with the number of vacancies, and the variation of vacancy concentration with doping level will obey the mass action law:

 $V^0 + h \rightarrow V^+, V^0 + e \rightarrow V^-, \ldots$ 

so that

$$K_e = \frac{N_V^+}{pN_V^0} \cdot \cdot$$

8.2.1 Diffusion by Vacancies

#### 8.2 ATOMIC DIFFUSION MECHANISMS 379

where  $V^0$  is a neutral vacancy,  $V^+$  is a vacancy with a single plus charge,  $V^-$  is a vacancy with a single minus charge,  $N_V$  is the concentration of vacancies, h is a hole, p is the hole concentration, e is an electron, and  $K_e$  is the mass action equilibrium constant.  $K_e$  may be evaluated under intrinsic conditions when  $p = n = n_i$  and  $N_V^+ = N_{Vi}^+$ ; n is the electron concentration:

$$K_{e} = \frac{N_{Vi}^{+}}{n_{i}N_{V}^{0}} = \frac{N_{V}^{+}}{pN_{V}^{0}}$$

Substituting  $p = n_i^2/n$  gives

$$\frac{N_{Vi}^+}{N_V^+} = \frac{n}{n_i}$$

Performing similar arithmetic on the other terms gives

$$D_V = D^{*0} + \left(\frac{n}{n_i}\right) D^{*-} + \left(\frac{n}{n_i}\right)^2 D^{*-} + \left(\frac{n_i}{n}\right) D^{*+} \qquad 8.9$$

The expression for  $D_v$  just derived is in terms of electron concentration but, for p-type diffusions, could have been derived with hole concentration used instead. Remember that, for intrinsic conditions,  $n = n_i$ , and thus there is no reduction in the diffusion constant for low carrier concentrations. Depending on the experimental conditions, different terms in Eq. 8.9 will dominate. Therefore, it is necessary to know the values of all of the  $D^*$ 's in order to adequately describe a material that diffuses in this manner.

Apparently, interstitialcy diffusion is never observed by itself, and most interstitials are neutral so that their concentration will be unaffected by doping level. They are, however, produced in nonequilibrium numbers by the thermal oxidation of silicon, and hence any diffusion that proceeds partially by interstitialcy will be enhanced near an oxidizing surface. When an impurity diffuses by both vacancy and interstitialcy, the total diffusivity  $D^*$  is the sum of the two components. That is,

$$D^* = D_V^* + D_V^*$$

and the fractional vacancy diffusivities are

$$f_V = \frac{D_V^*}{D^*}$$

and

$$f_l = \frac{D_l^*}{D^*}$$

Interstitialcy

8.2.2 Diffusion by

#### 380 CHAPTER 8 IMPURITY DIFFUSION

When the concentration  $N_v$  or  $N_l$  is changed from its equilibrium value, each of the fractional diffusivities is assumed to change by the same amount so that (4)

$$D = D^* \left( \frac{f_V N_V}{N_V^*} + \frac{f_I N_I}{N_I^*} \right)$$
 8.10

Some materials, such as gold in silicon, have a very low interstitial solubility and a reasonably high substitutional solubility. However, they have a high interstitial diffusivity and a low substitutional diffusivity. Hence, appreciable motion can occur only by moving from a substitutional to an interstitial site, diffusing, and then becoming substitutional again. For the general case, with combined vacancy and interstitial motion, the effective diffusion coefficient  $D_{\text{eff}}$  is given by

$$D_{\rm eff} = \frac{D_I N_I}{N_I + N_S} + \frac{D_S N_S}{N_I + N_S}$$
 8.11

where the subscripts I and S refer to the interstitial and substitutional components, respectively. When  $D_I >> D_S$  and  $N_S >> N_I$ and there are no limitations by either vacancies or self-interstitials,

$$D_{\rm eff} \equiv D_1 \approx \frac{D_I N_I}{N_{\rm s}}$$
8.12

When the Frank-Turnbull interstitial-vacancy reaction occurs,

Interstitials + Vacancies 
$$\rightarrow$$
 Substitutionals 8.13

and a restricted source of vacancies—for example, dislocation-free material—may limit diffusion. As an example, consider that before an interstitial–substitutional conversion can occur, a vacancy has to diffuse to the site. Further, suppose that  $D_t$  is much greater that  $D_v$  where  $D_v$  is the vacancy diffusivity.  $D_{eff}$  then becomes

$$D_{\rm eff} = \frac{D_V N_V}{N_V + N_S} + \frac{D_S N_S}{N_V + N_S}$$
 8.14a

When  $N_V$  is much less than  $N_S$  and  $D_V$  is less than  $D_S$ ,

$$D_{\rm eff} \equiv D_2 \cong \frac{D_V N_V}{N_S}$$
 8.14b

If the kick-out mechanism is dominant,

Interstitials → Substitutionals + Silicon Interstitials 8.15

8.2.3 Interstitial Diffusion

#### 8.2 ATOMIC DIFFUSION MECHANISMS 381

and a buildup of silicon interstitials can be the limiting factor. In this case, the diffusion profile will be determined by silicon selfdiffusivity. Early studies of gold diffusion in silicon indicated that the vacancy mechanism played the dominant role (5), but later work has favored Eq. 8.15 as being more likely (6, 7).

As another example, consider the specific case of diffusion of zinc in gallium arsenide, which is assumed to be covered by the conditions of Eq. 8.12 (8). Zinc will move from an interstitial position to a gallium site vacancy. Substitutional zinc has a negative charge; interstitial zinc, a positive charge; and the gallium vacancy is neutral. The concentration of gallium vacancies is assumed to be independent of the doping level  $N_s$ . Thus, in equilibrium,

$$[Zn]_{s}^{-} = [Zn]_{l}^{+} + 2e \text{ or } N_{s} = N_{l} + 2n$$

where  $N_s$  is the concentration of substitutional zinc and  $N_i$  is the concentration of interstitial zinc. The hole density p arises from the negatively charged substitutional zinc ions and equals  $N_s$ . n can be calculated from  $pn = n_i^2$ . Then, by applying the mass action law,

$$K = \frac{n^2 [\text{Zn}]_I}{[\text{Zn}]_S}$$

where K is an equilibrium constant. Since  $n = n_i^2/p = n_i^2 N_s$ ,

$$K = \frac{N_I}{N_S^3} \quad \text{or} \quad N_I = K N_S^3$$

Substituting this value into Eq. 8.12 gives  $D_1 = KD_I N_3^2$ ; that is, the diffusion coefficient depends on the concentration of substitutional zinc. Expressed somewhat differently,

$$D_1(N_{S2}) = D_1(N_{S1}) \times \left(\frac{N_{S2}}{N_{S1}}\right)^2$$

where the subscripts 1 and 2 denote two different values of  $N_s$ .

If an electric field is applied to the semiconductor, a velocity v will be imparted to any mobile ions present. When the field is parallel to the direction of diffusion, Eq. 8.4, Fick's first law, becomes

$$J = -D\left(\frac{\partial N}{\partial x}\right) + vN \qquad 8.16$$

v is given by  $\mu E$  where  $\mu$  is the ionic mobility and E is the electric field. D and  $\mu$  are related as  $D/\mu = kT/qZ$ . Thus, Eq. 8.16 becomes

8.2.4 Electric-Field-Aided Diffusion 382 CHAPTER 8 IMPURITY DIFFUSION

$$J = -D\left(\frac{\partial N}{\partial x}\right) + \frac{qZEN}{kT}$$
8.17

where qZ is the ionic charge.<sup>8</sup> Some ions, such as lithium, have a high enough mobility that they can be moved at room temperature (drifted) by the field developed in a space charge region. At the high temperatures required for most diffusing species, a small electric field can be produced by the IR drop of an electric current flow. An internal field will also exist due to the diffusion gradient itself if the concentration is heavy enough for the semiconductor to be extrinsic at diffusion temperature (9). This field is given by

$$E = -\left(\frac{kT}{q}\right)\left(\frac{1}{n}\right)\left(\frac{dn}{dx}\right) = -\left(\frac{kT}{q}\right)\left(\frac{1}{n}\right)\left(\frac{dn}{dN}\right)\left(\frac{\partial N}{\partial x}\right) \qquad 8.18$$

where *n* is the free-carrier concentration. Only when  $N >> n_i$  does n = N, and this condition is often not met during diffusion. For the general case,

$$n = \left(\frac{N^2}{4} + n_i^2\right)^{1/2} + \frac{N}{2}$$
 8.19

A combination of Eqs. 8.16-8.19 gives9

$$J = -gD\frac{\partial N}{\partial x}$$

$$8.20$$

where g is the enhancement factor due the field:

$$g = 1 + \frac{ZN}{(N^2 + 4n_1^2)^{1/2}}$$
8.21

From Eq. 8.21, it can be seen that the maximum internal field enhancement for a singly charged ion occurs when N is considerably greater than  $n_i$  and is 2x. If a p-dopant is being diffused into a region where a gradient due to an n-diffusion already exists, then a retardation rather than an enhancement is to be expected (10) and can cause a dip in base profiles that will be discussed later.

<sup>\*</sup>Remember that some species diffuse as neutrals and thus will not be affected by the field. This could occur either because the atom is un-ionized or because it diffuses as a neutral complex.

<sup>&</sup>quot;For refinements to this simple derivation, see reference 10.