MetaQNN

In our paper, Designing Neural Network Architectures Using Reinforcement Learning (arxiv, openreview), we propose a meta-modeling approach based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using Q-learning with an ε-greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types, and are competitive against the state-of-the-art methods that use more complex layer types. We also outperform existing network design meta-modeling approaches on image classification.

CIFAR10

6.92% test error

network solver

(128, 5, 1)

(512, 3, 1)

12.5%

(2, 2)

(128, 1, 1)

25.0%

(128, 5, 1)

(3, 2)

37.5%

(512, 3, 1)

(10)

C(128, 5, 1) - C(512, 3, 1) - P(2, 2) - C(128, 1, 1) - C(128, 5, 1) - P(3, 2) - C(512, 3, 1) - SM(10)

8.88% test error

network solver

(256, 3, 1)

(128, 1, 1)

16.7%

(128, 3, 1)

33.3%

(5, 3)

(128, 3, 1)

50.0%

(10)

C(256, 3, 1) - C(128, 1, 1) - C(128, 3, 1) - C(128, 3, 1) - P(5, 3) - C(128, 3, 1) - SM(10)

11.63% test error

network solver

(64, 3, 1)

(128, 3, 1)

10.0%

(5, 3)

(256, 3, 1)

20.0%

(3, 2)

(512)

30.0%

(128)

40.0%

(10)

C(64, 3, 1) - C(128, 3, 1) - P(5, 3) - C(256, 3, 1) - P(3, 2) - FC(512) - FC(128) - SM(10)

9.24% test error

network solver

(64, 5, 1)

(512, 3, 1)

10.0%

(512, 1, 1)

(64, 1, 1)

20.0%

(128, 5, 1)

(3, 2)

30.0%

(512, 5, 1)

(512, 3, 1)

40.0%

(5, 3)

(10)

C(64, 5, 1) - C(512, 3, 1) - C(512, 1, 1) - C(64, 1, 1) - C(128, 5, 1) - P(3, 2) - C(512, 5, 1) - C(512, 3, 1) - P(5, 3) - SM(10)

8.78% test error

network solver

(128, 5, 1)

(512, 3, 1)

16.7%

(2, 2)

(128, 1, 1)

33.3%

(128, 5, 1)

(3, 2)

50.0%

(10)

C(128, 5, 1) - C(512, 3, 1) - P(2, 2) - C(128, 1, 1) - C(128, 5, 1) - P(3, 2) - SM(10)

SVHN

2.29% test error

network solver

(64, 1, 1)

(128, 3, 1)

8.3%

(64, 5, 1)

(512, 5, 1)

16.7%

(256, 1, 1)

(256, 5, 1)

25.0%

(128, 1, 1)

(256, 5, 1)

33.3%

(3, 2)

(512, 5, 1)

41.7%

(256, 3, 1)

(128, 3, 1)

50.0%

(10)

C(64, 1, 1) - C(128, 3, 1) - C(64, 5, 1) - C(512, 5, 1) - C(256, 1, 1) - C(256, 5, 1) - C(128, 1, 1) - C(256, 5, 1) - P(3, 2) - C(512, 5, 1) - C(256, 3, 1) - C(128, 3, 1) - SM(10)

2.33% test error

network solver

(128, 1, 1)

(256, 5, 1)

8.3%

(128, 5, 1)

(2, 2)

16.7%

(256, 5, 1)

(256, 1, 1)

25.0%

(256, 3, 1)

33.3%

(256, 5, 1)

(512, 5, 1)

41.7%

(256, 3, 1)

(128, 3, 1)

50.0%

(10)

C(128, 1, 1) - C(256, 5, 1) - C(128, 5, 1) - P(2, 2) - C(256, 5, 1) - C(256, 1, 1) - C(256, 3, 1) - C(256, 3, 1) - C(256, 5, 1) - C(512, 5, 1) - C(256, 3, 1) - C(128, 3, 1) - SM(10)

2.35% test error

network solver

(128, 5, 1)

(128, 3, 1)

10.0%

(64, 5, 1)

(5, 3)

20.0%

(128, 3, 1)

(512, 5, 1)

30.0%

(256, 5, 1)

(128, 5, 1)

40.0%

(128, 5, 1)

(128, 3, 1)

50.0%

(10)

C(128, 5, 1) - C(128, 3, 1) - C(64, 5, 1) - P(5, 3) - C(128, 3, 1) - C(512, 5, 1) - C(256, 5, 1) - C(128, 5, 1) - C(128, 5, 1) - C(128, 3, 1) - SM(10)

2.24% test error

network solver

(128, 3, 1)

(2, 2)

8.3%

(64, 1, 1)

(256, 1, 1)

16.7%

(256, 5, 1)

(128, 1, 1)

25.0%

(128, 5, 1)

(512, 3, 1)

33.3%

(256, 5, 1)

(256, 1, 1)

41.7%

(128, 3, 1)

(64, 1, 1)

50.0%

(10)

C(128, 3, 1) - P(2, 2) - C(64, 1, 1) - C(256, 1, 1) - C(256, 5, 1) - C(128, 1, 1) - C(128, 5, 1) - C(512, 3, 1) - C(256, 5, 1) - C(256, 1, 1) - C(128, 3, 1) - C(64, 1, 1) - SM(10)

2.36% test error

network solver

(128, 1, 1)

(256, 5, 1)

8.3%

(128, 5, 1)

(512, 5, 1)

16.7%

(256, 1, 1)

(256, 5, 1)

25.0%

(5, 3)

(128, 5, 1)

33.3%

(128, 5, 1)

41.7%

(64, 1, 1)

(128, 5, 1)

50.0%

(10)

C(128, 1, 1) - C(256, 5, 1) - C(128, 5, 1) - C(512, 5, 1) - C(256, 1, 1) - C(256, 5, 1) - P(5, 3) - C(128, 5, 1) - C(128, 5, 1) - C(128, 5, 1) - C(64, 1, 1) - C(128, 5, 1) - SM(10)

MNIST

0.44% test error

network solver

(512, 5, 1)

(128, 5, 1)

12.5%

(128, 5, 1)

(128, 3, 1)

25.0%

(256, 3, 1)

(512, 5, 1)

37.5%

(256, 3, 1)

(128, 3, 1)

50.0%

(10)

C(512, 5, 1) - C(128, 5, 1) - C(128, 5, 1) - C(128, 3, 1) - C(256, 3, 1) - C(512, 5, 1) - C(256, 3, 1) - C(128, 3, 1) - SM(10)

0.44% test error

network solver

(64, 1, 1)

(256, 3, 1)

8.3%

(2, 2)

(512, 3, 1)

16.7%

(256, 1, 1)

(5, 3)

25.0%

(256, 3, 1)

(512, 3, 1)

33.3%

(512)

41.7%

(10)

C(64, 1, 1) - C(256, 3, 1) - P(2, 2) - C(512, 3, 1) - C(256, 1, 1) - P(5, 3) - C(256, 3, 1) - C(512, 3, 1) - FC(512) - SM(10)

0.38% test error

network solver

(64, 1, 1)

(256, 5, 1)

8.3%

(256, 5, 1)

(512, 1, 1)

16.7%

(64, 3, 1)

(5, 3)

25.0%

(256, 5, 1)

33.3%

(512, 5, 1)

(64, 1, 1)

41.7%

(128, 5, 1)

(512, 5, 1)

50.0%

(10)

C(64, 1, 1) - C(256, 5, 1) - C(256, 5, 1) - C(512, 1, 1) - C(64, 3, 1) - P(5, 3) - C(256, 5, 1) - C(256, 5, 1) - C(512, 5, 1) - C(64, 1, 1) - C(128, 5, 1) - C(512, 5, 1) - SM(10)

0.46% test error

network solver

(512, 5, 1)

(128, 5, 1)

12.5%

(128, 5, 1)

(128, 1, 1)

25.0%

(2, 2)

(512, 5, 1)

37.5%

(256, 3, 1)

(128, 3, 1)

50.0%

(10)

C(512, 5, 1) - C(128, 5, 1) - C(128, 5, 1) - C(128, 1, 1) - P(2, 2) - C(512, 5, 1) - C(256, 3, 1) - C(128, 3, 1) - SM(10)

0.55% test error

network solver

(256, 3, 1)

(256, 5, 1)

8.3%

(512, 3, 1)

(256, 5, 1)

16.7%

(512, 1, 1)

(5, 3)

25.0%

(256, 3, 1)

(64, 3, 1)

33.3%

(256, 5, 1)

(512, 3, 1)

41.7%

(128, 5, 1)

(512, 5, 1)

50.0%

(10)

C(256, 3, 1) - C(256, 5, 1) - C(512, 3, 1) - C(256, 5, 1) - C(512, 1, 1) - P(5, 3) - C(256, 3, 1) - C(64, 3, 1) - C(256, 5, 1) - C(512, 3, 1) - C(128, 5, 1) - C(512, 5, 1) - SM(10)

0.43% test error

network solver

(64, 3, 1)

(128, 3, 1)

10.0%

(512, 1, 1)

(256, 1, 1)

20.0%

(256, 5, 1)

(128, 3, 1)

30.0%

(5, 3)

(512, 1, 1)

40.0%

(512, 3, 1)

(128, 5, 1)

50.0%

(10)

C(64, 3, 1) - C(128, 3, 1) - C(512, 1, 1) - C(256, 1, 1) - C(256, 5, 1) - C(128, 3, 1) - P(5, 3) - C(512, 1, 1) - C(512, 3, 1) - C(128, 5, 1) - SM(10)

0.41% test error

network solver

(128, 3, 1)

(64, 1, 1)

7.1%

(64, 3, 1)

(64, 5, 1)

14.3%

(2, 2)

(128, 3, 1)

21.4%

(3, 2)

(512, 3, 1)

28.6%

(512)

35.7%

(128)

42.9%

(10)

C(128, 3, 1) - C(64, 1, 1) - C(64, 3, 1) - C(64, 5, 1) - P(2, 2) - C(128, 3, 1) - P(3, 2) - C(512, 3, 1) - FC(512) - FC(128) - SM(10)

0.35% test error

network solver

(128, 3, 1)

(512, 3, 1)

12.5%

(2, 2)

(256, 3, 1)

25.0%

(128, 5, 1)

(64, 1, 1)

37.5%

(64, 5, 1)

(512, 5, 1)

50.0%

GAP

(10)

C(128, 3, 1) - C(512, 3, 1) - P(2, 2) - C(256, 3, 1) - C(128, 5, 1) - C(64, 1, 1) - C(64, 5, 1) - C(512, 5, 1) - GAP(10) - SM(10)

0.40% test error

network solver

(64, 5, 1)

(512, 5, 1)

8.3%

(3, 2)

(256, 5, 1)

16.7%

(256, 3, 1)

25.0%

(128, 1, 1)

(256, 3, 1)

33.3%

(256, 5, 1)

(64, 1, 1)

41.7%

(256, 3, 1)

(64, 3, 1)

50.0%

(10)

C(64, 5, 1) - C(512, 5, 1) - P(3, 2) - C(256, 5, 1) - C(256, 3, 1) - C(256, 3, 1) - C(128, 1, 1) - C(256, 3, 1) - C(256, 5, 1) - C(64, 1, 1) - C(256, 3, 1) - C(64, 3, 1) - SM(10)

0.56% test error

network solver

(512, 1, 1)

(128, 3, 1)

8.3%

(128, 5, 1)

(64, 1, 1)

16.7%

(256, 5, 1)

(64, 1, 1)

25.0%

(5, 3)

(512, 1, 1)

33.3%

(512, 3, 1)

(256, 3, 1)

41.7%

(256, 5, 1)

50.0%

(10)

C(512, 1, 1) - C(128, 3, 1) - C(128, 5, 1) - C(64, 1, 1) - C(256, 5, 1) - C(64, 1, 1) - P(5, 3) - C(512, 1, 1) - C(512, 3, 1) - C(256, 3, 1) - C(256, 5, 1) - C(256, 5, 1) - SM(10)

MetaQNN

About

CIFAR10

SVHN

MNIST

Team