Generative Deep Learning (Record no. 359829)
[ view plain ]
| 000 -LEADER | |
|---|---|
| fixed length control field | 09347 a2200181 4500 |
| 005 - DATE AND TIME OF LATEST TRANSACTION | |
| control field | 20241018161810.0 |
| 008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION | |
| fixed length control field | 241018b |||||||| |||| 00| 0 eng d |
| 020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
| International Standard Book Number | 9789355429988 |
| 041 ## - LANGUAGE CODE | |
| Language code of text/sound track or separate title | English |
| 100 ## - MAIN ENTRY--PERSONAL NAME | |
| Author | Foster D. |
| 245 ## - TITLE STATEMENT | |
| Title | Generative Deep Learning |
| Remainder of title | :Teaching Machines To Paint, Write, Compose And Play |
| 250 ## - EDITION STATEMENT | |
| Edition statement | 2nd |
| 260 ## - PUBLICATION, DISTRIBUTION, ETC. | |
| Name of publisher, distributor, etc. | SPD |
| Date of publication, distribution, etc. | 2023 |
| 300 ## - PHYSICAL DESCRIPTION | |
| Extent | 426 |
| 520 ## - SUMMARY, ETC. | |
| Summary, etc. | Table of Contents<br/>Foreword. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xv<br/>Preface. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii<br/>Part I. Introduction to Generative Deep Learning<br/>1. Generative Modeling. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3<br/>What Is Generative Modeling? 4<br/>Generative Versus Discriminative Modeling 5<br/>The Rise of Generative Modeling 6<br/>Generative Modeling and AI 8<br/>Our First Generative Model 9<br/>Hello World! 9<br/>The Generative Modeling Framework 10<br/>Representation Learning 12<br/>Core Probability Theory 15<br/>Generative Model Taxonomy 18<br/>The Generative Deep Learning Codebase 20<br/>Cloning the Repository 20<br/>Using Docker 21<br/>Running on a GPU 21<br/>Summary 21<br/>2. Deep Learning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23<br/>Data for Deep Learning 24<br/>Deep Neural Networks 25<br/>vii<br/>What Is a Neural Network? 25<br/>Learning High-Level Features 26<br/>TensorFlow and Keras 27<br/>Multilayer Perceptron (MLP) 28<br/>Preparing the Data 28<br/>Building the Model 30<br/>Compiling the Model 35<br/>Training the Model 37<br/>Evaluating the Model 38<br/>Convolutional Neural Network (CNN) 40<br/>Convolutional Layers 41<br/>Batch Normalization 46<br/>Dropout 49<br/>Building the CNN 51<br/>Training and Evaluating the CNN 53<br/>Summary 54<br/>Part II. Methods<br/>3. Variational Autoencoders. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59<br/>Introduction 60<br/>Autoencoders 61<br/>The Fashion-MNIST Dataset 62<br/>The Autoencoder Architecture 63<br/>The Encoder 64<br/>The Decoder 65<br/>Joining the Encoder to the Decoder 67<br/>Reconstructing Images 69<br/>Visualizing the Latent Space 70<br/>Generating New Images 71<br/>Variational Autoencoders 74<br/>The Encoder 75<br/>The Loss Function 80<br/>Training the Variational Autoencoder 82<br/>Analysis of the Variational Autoencoder 84<br/>Exploring the Latent Space 85<br/>The CelebA Dataset 85<br/>Training the Variational Autoencoder 87<br/>Analysis of the Variational Autoencoder 89<br/>Generating New Faces 90<br/>viii | Table of Contents<br/>Latent Space Arithmetic 91<br/>Morphing Between Faces 92<br/>Summary 93<br/>4. Generative Adversarial Networks. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95<br/>Introduction 96<br/>Deep Convolutional GAN (DCGAN) 97<br/>The Bricks Dataset 98<br/>The Discriminator 99<br/>The Generator 101<br/>Training the DCGAN 104<br/>Analysis of the DCGAN 109<br/>GAN Training: Tips and Tricks 110<br/>Wasserstein GAN with Gradient Penalty (WGAN-GP) 113<br/>Wasserstein Loss 114<br/>The Lipschitz Constraint 115<br/>Enforcing the Lipschitz Constraint 116<br/>The Gradient Penalty Loss 117<br/>Training the WGAN-GP 119<br/>Analysis of the WGAN-GP 121<br/>Conditional GAN (CGAN) 122<br/>CGAN Architecture 123<br/>Training the CGAN 124<br/>Analysis of the CGAN 126<br/>Summary 127<br/>5. Autoregressive Models. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129<br/>Introduction 130<br/>Long Short-Term Memory Network (LSTM) 131<br/>The Recipes Dataset 132<br/>Working with Text Data 133<br/>Tokenization 134<br/>Creating the Training Set 137<br/>The LSTM Architecture 138<br/>The Embedding Layer 138<br/>The LSTM Layer 140<br/>The LSTM Cell 142<br/>Training the LSTM 144<br/>Analysis of the LSTM 146<br/>Recurrent Neural Network (RNN) Extensions 149<br/>Stacked Recurrent Networks 149<br/>Table of Contents | ix<br/>Gated Recurrent Units 151<br/>Bidirectional Cells 153<br/>PixelCNN 153<br/>Masked Convolutional Layers 154<br/>Residual Blocks 156<br/>Training the PixelCNN 158<br/>Analysis of the PixelCNN 159<br/>Mixture Distributions 162<br/>Summary 164<br/>6. Normalizing Flow Models. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167<br/>Introduction 168<br/>Normalizing Flows 169<br/>Change of Variables 170<br/>The Jacobian Determinant 172<br/>The Change of Variables Equation 173<br/>RealNVP 174<br/>The Two Moons Dataset 174<br/>Coupling Layers 175<br/>Training the RealNVP Model 181<br/>Analysis of the RealNVP Model 184<br/>Other Normalizing Flow Models 186<br/>GLOW 186<br/>FFJORD 187<br/>Summary 188<br/>7. Energy-Based Models. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189<br/>Introduction 189<br/>Energy-Based Models 191<br/>The MNIST Dataset 192<br/>The Energy Function 193<br/>Sampling Using Langevin Dynamics 194<br/>Training with Contrastive Divergence 197<br/>Analysis of the Energy-Based Model 201<br/>Other Energy-Based Models 202<br/>Summary 203<br/>8. Diusion Models. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 205<br/>Introduction 206<br/>Denoising Diffusion Models (DDM) 208<br/>The Flowers Dataset 208<br/>x | Table of Contents<br/>The Forward Diffusion Process 209<br/>The Reparameterization Trick 210<br/>Diffusion Schedules 211<br/>The Reverse Diffusion Process 214<br/>The U-Net Denoising Model 217<br/>Training the Diffusion Model 224<br/>Sampling from the Denoising Diffusion Model 225<br/>Analysis of the Diffusion Model 228<br/>Summary 231<br/>Part III. Applications<br/>9. Transformers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235<br/>Introduction 236<br/>GPT 236<br/>The Wine Reviews Dataset 237<br/>Attention 238<br/>Queries, Keys, and Values 239<br/>Multihead Attention 241<br/>Causal Masking 242<br/>The Transformer Block 245<br/>Positional Encoding 248<br/>Training GPT 250<br/>Analysis of GPT 252<br/>Other Transformers 255<br/>T5 256<br/>GPT-3 and GPT-4 259<br/>ChatGPT 260<br/>Summary 264<br/>10. Advanced GANs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 267<br/>Introduction 268<br/>ProGAN 269<br/>Progressive Training 269<br/>Outputs 276<br/>StyleGAN 277<br/>The Mapping Network 278<br/>The Synthesis Network 279<br/>Outputs from StyleGAN 280<br/>StyleGAN2 281<br/>Table of Contents | xi<br/>Weight Modulation and Demodulation 282<br/>Path Length Regularization 283<br/>No Progressive Growing 284<br/>Outputs from StyleGAN2 286<br/>Other Important GANs 286<br/>Self-Attention GAN (SAGAN) 286<br/>BigGAN 288<br/>VQ-GAN 289<br/>ViT VQ-GAN 292<br/>Summary 294<br/>11. Music Generation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 297<br/>Introduction 298<br/>Transformers for Music Generation 299<br/>The Bach Cello Suite Dataset 300<br/>Parsing MIDI Files 300<br/>Tokenization 303<br/>Creating the Training Set 304<br/>Sine Position Encoding 305<br/>Multiple Inputs and Outputs 307<br/>Analysis of the Music-Generating Transformer 309<br/>Tokenization of Polyphonic Music 313<br/>MuseGAN 317<br/>The Bach Chorale Dataset 317<br/>The MuseGAN Generator 320<br/>The MuseGAN Critic 326<br/>Analysis of the MuseGAN 327<br/>Summary 329<br/>12. World Models. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 331<br/>Introduction 331<br/>Reinforcement Learning 332<br/>The CarRacing Environment 334<br/>World Model Overview 336<br/>Architecture 336<br/>Training 338<br/>Collecting Random Rollout Data 339<br/>Training the VAE 341<br/>The VAE Architecture 341<br/>Exploring the VAE 343<br/>Collecting Data to Train the MDN-RNN 346<br/>xii | Table of Contents<br/>Training the MDN-RNN 346<br/>The MDN-RNN Architecture 347<br/>Sampling from the MDN-RNN 348<br/>Training the Controller 348<br/>The Controller Architecture 349<br/>CMA-ES 349<br/>Parallelizing CMA-ES 351<br/>In-Dream Training 353<br/>Summary 356<br/>13. Multimodal Models. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 359<br/>Introduction 360<br/>DALL.E 2 361<br/>Architecture 362<br/>The Text Encoder 362<br/>CLIP 362<br/>The Prior 367<br/>The Decoder 369<br/>Examples from DALL.E 2 373<br/>Imagen 377<br/>Architecture 377<br/>DrawBench 378<br/>Examples from Imagen 379<br/>Stable Diffusion 380<br/>Architecture 380<br/>Examples from Stable Diffusion 381<br/>Flamingo 381<br/>Architecture 382<br/>The Vision Encoder 382<br/>The Perceiver Resampler 383<br/>The Language Model 385<br/>Examples from Flamingo 388<br/>Summary 389<br/>14. Conclusion. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 391<br/>Timeline of Generative AI 392<br/>2014–2017: The VAE and GAN Era 394<br/>2018–2019: The Transformer Era 394<br/>2020–2022: The Big Model Era 395<br/>The Current State of Generative AI 396<br/>Large Language Models 396<br/>Table of Contents | xiii<br/>Text-to-Code Models 400<br/>Text-to-Image Models 402<br/>Other Applications 405<br/>The Future of Generative AI 407<br/>Generative AI in Everyday Life 407<br/>Generative AI in the Workplace 409<br/>Generative AI in Education 410<br/>Generative AI Ethics and Challenges 411<br/>Final Thoughts 413<br/>Index. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 417<br/>xiv | Table of Contents |
| 700 ## - ADDED ENTRY--PERSONAL NAME | |
| Personal name | Friston K. |
| 942 ## - ADDED ENTRY ELEMENTS (KOHA) | |
| Koha item type | Books |
| Withdrawn status | Lost status | Source of classification or shelving scheme | Damaged status | Not for loan | Home library | Current library | Date acquired | Vendor | Net Price | Total Checkouts | Full call number | Barcode | Date last seen | Actual Price | Bill Date | Koha item type |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dewey Decimal Classification | Cummins College of Engineering for Women Pune | Cummins College of Engineering for Women Pune | 14/10/2024 | 115 | 1350.00 | 006.32 FOS | CCEP-BK-67505 | 14/10/2024 | 1800.00 | 14/10/2024 | Books |