/r/q?req.query.q -- Subreddit Search

11,741 Subscribers

[R] Baidu’s 10-Billion Scale ERNIE-ViLG Unified Generative Pretraining Framework Achieves SOTA Performance on Bidirectional Vision-Language Generation Tasks

Baidu researchers propose ERNIE-ViLG, a 10-billion parameter scale pretraining framework for bidirectional text-image generation. Pretrained on 145 million (Chinese) image-text pairs, ERNIE-ViLG achieves state-of-the-art performance on both text-to-image and image-to-text generation tasks.

Here is a quick read: Baidu’s 10-Billion Scale ERNIE-ViLG Unified Generative Pretraining Framework Achieves SOTA Performance on Bidirectional Vision-Language Generation Tasks.

The paper ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation is on arXiv.

1 Comment

2022/01/07
15:18 UTC

[ShareMyResearch] Drift with Devil: Security of Multi-Sensor Fusion based Localization in High-Level Autonomous Driving under GPS Spoofing

Content provided by Junjie Shen, the first-author of the paper Drift with Devil: Security of Multi-Sensor Fusion based Localization in High-Level Autonomous Driving under GPS Spoofing.

In this work, we perform the first study on the security of MSF-based localization in AV settings. We find that the state-of-the-art MSF-based AD localization algorithm can indeed generally enhance the security, but have a take-over vulnerability that can fundamentally defeat the design principle of MSF, but only appear dynamically and non-deterministically. Leveraging this insight, we design FusionRipper, a novel and general attack that opportunistically captures and exploits take-over vulnerabilities. We perform both trace-based and simulation-based evaluations, and find that FusionRipper can achieve >= 97% and 91.3% success rates in all traces for off-road and wrong way attacks respectively, with high robustness to practical factors such as spoofing inaccuracies.

0 Comments

2021/01/22
16:21 UTC

Book release: Machine Learning Engineering

Hey. I'm thrilled to announce that my new book, Machine Learning Engineering, was just released and is now available on Amazon and Leanpub, as both a paperback edition and an e-book!

I've been working on the book for the last eleven months and I'm happy (and relieved!) that the work is now over. Just like my previous The Hundred-Page Machine Learning Book, this new book is distributed on the “read-first, buy-later” principle. That means that you can freely download the book, read it, and share it with your friends and colleagues, before buying.

The new book can be bought on Leanpub as a PDF file and on Amazon as a paperback and Kindle. The hardcover edition will be released later this week.

Here's the book's wiki with the drafts of all chapters. You can read them before buying the book: http://www.mlebook.com/wiki/doku.php

I will be here to answer your questions. Or just read the awesome Foreword by Cassie Kozyrkov!

https://preview.redd.it/9swslee6i8m51.jpg?width=1600&format=pjpg&auto=webp&s=bf83d13301b1f381cd5e1e1544bec4e8a1fff8c7

4 Comments

2020/09/10
02:49 UTC

[R] Google ‘BigBird’ Achieves SOTA Performance on Long-Context NLP Tasks

To alleviate the quadratic dependency of transformers, a team of researchers from Google Research recently proposed a new sparse attention mechanism dubbed BigBird. In their paper Big Bird: Transformers for Longer Sequences, the team demonstrates that despite being a sparse attention mechanism, BigBird preserves all known theoretical properties of quadratic full attention models. In experiments, BigBird is shown to dramatically improve performance across long-context NLP tasks, producing SOTA results in question answering and summarization.

Here is a quick read: Google ‘BigBird’ Achieves SOTA Performance on Long-Context NLP Tasks

The paper Big Bird: Transformers for Longer Sequences is on arXiv.

5 Comments

2020/08/03
18:57 UTC

A big update to the "Papers with Code" database of results from papers, now with 2500+ leaderboards and 20,000+ results

Link to the website and the paper on the methodology.

1 Comment

2020/05/20
20:45 UTC

Pose Animator: a web animation tool that brings SVG illustrations to life with real-time human perception TensorFlow.js models

video

demo

github

1 Comment

2020/05/20
19:35 UTC

Google Brain & CMU Semi-Supervised ‘Noisy Student’ Achieves 88.4% Top-1 Accuracy on ImageNet

Very impressive results:

The research team says their proposed method’s 88.4 percent accuracy on ImageNet is 2.0 percent better than the SOTA model that requires 3.5B weakly labelled Instagram images. And that’s not all: “On robustness test sets, it improves ImageNet-A top-1 accuracy from 61.0% to 83.7%, reduces ImageNet-C mean corruption error from 45.7 to 28.3, and reduces ImageNet-P mean flip rate from 27.8 to 12.2.”

A quick read: Google Brain & CMU Semi-Supervised ‘Noisy Student’ Achieves 88.4% Top-1 Accuracy on ImageNet

The paper: Self-training with Noisy Student improves ImageNet classification

5 Comments

2020/02/13
17:03 UTC

AAAI 2020 | What’s Next for Deep Learning? Hinton, LeCun, and Bengio Share Their Visions

The trio of researchers have made deep neural networks a critical component of computing, and in individual talks and a panel discussion they discussed their views on current challenges facing deep learning and where it should be heading.

0 Comments

2020/02/11
03:35 UTC

State of the art in image inpainting!

0 Comments

2020/02/10
21:55 UTC

ICYMI from Tencent researchers: Real-time, high-quality video object segmentation!

0 Comments

2020/02/08
22:58 UTC

Latest from Intel researchers on object detection!

0 Comments

2020/02/07
03:51 UTC

State of the art in image to image translation (guided)

0 Comments

2020/02/07
03:24 UTC

Machine Unlearning: Fighting for the Right to Be Forgotten

In a new paper, researchers from the University of Toronto, Vector Institute, and University of Wisconsin-Madison propose SISA training, a new framework that helps models “unlearn” information by reducing the number of updates that need to be computed when data points are removed.

Future of fashion design: Generate a new garment that seamlessly integrates the desired design attribute to the reference image

0 Comments

2020/02/04
21:45 UTC

Change My Mind: Deep learning isn’t ready to be used for conversational AI systems

Google’s Meena was released in a preprint recently stating that it could create its own joke, but the threat of racism in the system and its logical inconsistencies aren’t ready to be deployed in a corporate environment. Change my mind

13 Comments

2020/02/04
21:36 UTC

Just in: A new comprehensive object detection dataset for detecting parking stickers on cars!

0 Comments

2020/02/04
20:57 UTC

Tutorial: Image Compression Using Autoencoders in Keras

In this tutorial author and teacher Ahmed Fawzy Gad covers a thorough introduction to autoencoders and how to use them for image compression in Keras.

Article link: https://blog.paperspace.com/autoencoder-image-compression-keras/

0 Comments

2020/02/02
17:14 UTC

ICYMI from Nvidia researchers: Produce a 3D object from a 2D image (in less than 100 milliseconds!)

0 Comments

2020/02/01
04:38 UTC

How do you analyze the distribution of scores produced from a binary classification model?

How do you analyze the distribution of scores produced from a binary classification model to make sure it makes sense?

I am using a decision tree to predict how likely an individual is to vote or not. One idea is to analyze the splits of the tree to see why an individual was given that score. For example, people that got a score below 25% had these characteristics, people that got a score between 25-50% had these characteristics, etc. Is there a better way to do it?

1 Comment

2020/01/30
21:49 UTC

We on Twitter: @r_MachinesLearn

11,741 Subscribers

[R] Baidu’s 10-Billion Scale ERNIE-ViLG Unified Generative Pretraining Framework Achieves SOTA Performance on Bidirectional Vision-Language Generation Tasks

[ShareMyResearch] Drift with Devil: Security of Multi-Sensor Fusion based Localization in High-Level Autonomous Driving under GPS Spoofing

Book release: Machine Learning Engineering

[R] Google ‘BigBird’ Achieves SOTA Performance on Long-Context NLP Tasks

A big update to the "Papers with Code" database of results from papers, now with 2500+ leaderboards and 20,000+ results

Pose Animator: a web animation tool that brings SVG illustrations to life with real-time human perception TensorFlow.js models

Google Brain & CMU Semi-Supervised ‘Noisy Student’ Achieves 88.4% Top-1 Accuracy on ImageNet

AAAI 2020 | What’s Next for Deep Learning? Hinton, LeCun, and Bengio Share Their Visions

State of the art in image inpainting!

ICYMI from Tencent researchers: Real-time, high-quality video object segmentation!

Latest from Intel researchers on object detection!

State of the art in image to image translation (guided)

Machine Unlearning: Fighting for the Right to Be Forgotten

Future of fashion design: Generate a new garment that seamlessly integrates the desired design attribute to the reference image

Change My Mind: Deep learning isn’t ready to be used for conversational AI systems

Just in: A new comprehensive object detection dataset for detecting parking stickers on cars!

Tutorial: Image Compression Using Autoencoders in Keras

ICYMI from Nvidia researchers: Produce a 3D object from a 2D image (in less than 100 milliseconds!)

How do you analyze the distribution of scores produced from a binary classification model?