Remote sensing image scene classification based on stackable attention structure