Using various converter softwares, I turned WEBM videos into JPG images. The initial software I used successfully turned the video into 100 photos, but the problem was that it couldn’t resize the photos. I tried the NCH Photo Resize software instead, but it didn’t successfully output consistent dimensions. Then, I used BatchPhoto, which successfully resized, but requires a subscription to get rid of the watermark. Since BatchPhoto works with Windows, I sent the sample photos from my Windows desktop to my Mac laptop (where I will train the model) so I can see that it works before I purchase. The final, working solution was to use VLC following this tutorial: https://www.raymond.cc/blog/extract-video-frames-to-images-using-vlc-media-player/. I retrieved 551 images from one person’s video. The next step is to figure out a consistent naming convention for student images vs. non student images. A possibility is “s123450”.
I also read:
- Chapter 12 of Hands-On Machine Learning (Using TensorFlow like NumPy)
- Similarities in TensorFlow and NumPy tenro operations (ex. math)
- You can apply TensorFlow operations to NumPy and vice versa
- tf.Variable is used when you want to be able to modify tf.Tensor
- Custom Loss Functions
- Train a regression model but the training set is noisy
- Remove outliers but dataset is still noisy
- Use Huber loss
- Train a regression model but the training set is noisy
- Saving models with custom components
- Need to map the names to the objects
- Losses are used to train the model. Metrics are used to evaluate the accuracy of the model (need to be easier to interpret)
- Keep track of the number of true positives and false positives → keras.metrics.Precision