当前位置: 首页 > news >正文

日志参数含义

  1. 学习率相关

    • base_lr:基础学习率,初始设定的学习率
      -lr:当前实际使用的学习率,通常是 base_lr 经过学习率调整策略后的值,比如lr=base_lr*(1+start_factor)
  2. 时间统计

    • time:每次迭代总时间,单位是s
    • data_time:数据加载时间,单位是s
  3. 性能指标

    • loss:总损失值
    • loss_cls:分类损失值:
    • top1_acc:单次batch的准确率
    • top5_acc:单次batch的准确率
  4. 训练进度

    • epoch: 当前训练轮次
    • iter:当前迭代次数,表示模型训练过程中已经处理了多少批数据,等于data_num/batch_size
    • memory: GPU显存使用量(MB),50%最佳(当memory显示50时,总使用显存往往占整个显存的70%左右)
    • step:表示优化器更新参数的次数,在常规训练中,每个迭代都会更新一次参数,此时与 iter 相同,但在一些特殊情况下可能不同,比如,梯度累积时,多个 iter 才会执行一次 step,分布式训练时,step 可能与 iter 不同步

早停一般patience=max_epochs/10~max_epochs/10+5

ann_file_test = '/hy-tmp/test_split1.txt'
ann_file_train = '/hy-tmp/train_split1.txt'
ann_file_val = '/hy-tmp/val_split1.txt'
auto_scale_lr = dict(base_batch_size=64, enable=False)
data_root = '/hy-tmp/hy-tmp/hmdb51_sta/hmdb51_sta'
data_root_val = '/hy-tmp/hy-tmp/hmdb51_sta/hmdb51_sta'
dataset_type = 'VideoDataset'
default_hooks = dict(checkpoint=dict(interval=1, max_keep_ckpts=10, save_best='auto',type='CheckpointHook'),early_stopping=dict(min_delta=0.001,monitor='acc/top1',patience=1,rule='greater',type='EarlyStoppingHook'),logger=dict(ignore_last=False, interval=100, type='LoggerHook'),param_scheduler=dict(type='ParamSchedulerHook'),runtime_info=dict(type='RuntimeInfoHook'),sampler_seed=dict(type='DistSamplerSeedHook'),sync_buffers=dict(type='SyncBuffersHook'),timer=dict(type='IterTimerHook'))
default_scope = 'mmaction'
env_cfg = dict(cudnn_benchmark=False,dist_cfg=dict(backend='nccl'),mp_cfg=dict(mp_start_method='fork', opencv_num_threads=0))
file_client_args = dict(io_backend='disk')
launcher = 'none'
load_from = '/hy-tmp/mmaction2-main/work_dirs/my_swin-tiny-p244-w877_no-pre_8xb8-amp-32x2x1-30e_hmdb51-rgb/epoch_25.pth'
log_level = 'INFO'
log_processor = dict(by_epoch=True, type='LogProcessor', window_size=20)
model = dict(backbone=dict(arch='tiny',attn_drop_rate=0.0,drop_path_rate=0.1,drop_rate=0.0,mlp_ratio=4.0,patch_norm=True,patch_size=(2,4,4,),pretrained=None,pretrained2d=None,qk_scale=None,qkv_bias=True,type='SwinTransformer3D',window_size=(8,7,7,)),cls_head=dict(average_clips='prob',dropout_ratio=0.5,in_channels=768,num_classes=102,spatial_type='avg',type='I3DHead'),data_preprocessor=dict(format_shape='NCTHW',mean=[123.675,116.28,103.53,],std=[58.395,57.12,57.375,],type='ActionDataPreprocessor'),type='Recognizer3D')
optim_wrapper = dict(constructor='SwinOptimWrapperConstructor',optimizer=dict(betas=(0.9,0.999,), lr=0.001, type='AdamW', weight_decay=0.02),paramwise_cfg=dict(absolute_pos_embed=dict(decay_mult=0.0),backbone=dict(lr_mult=0.1),norm=dict(decay_mult=0.0),relative_position_bias_table=dict(decay_mult=0.0)),type='AmpOptimWrapper')
param_scheduler = [dict(begin=0,by_epoch=True,convert_to_iter_based=True,end=2.5,start_factor=0.1,type='LinearLR'),dict(T_max=26,begin=0,by_epoch=True,end=150,eta_min=0,type='CosineAnnealingLR'),
]
randomness = dict(deterministic=False, diff_rank_seed=False, seed=None)
resume = True
test_cfg = dict(type='TestLoop')
test_dataloader = dict(batch_size=1,dataset=dict(ann_file='/hy-tmp/test_split1.txt',data_prefix=dict(video='/hy-tmp/hy-tmp/hmdb51_sta/hmdb51_sta'),pipeline=[dict(io_backend='disk', type='DecordInit'),dict(clip_len=32,frame_interval=2,num_clips=4,test_mode=True,type='SampleFrames'),dict(type='DecordDecode'),dict(scale=(-1,224,), type='Resize'),dict(crop_size=224, type='ThreeCrop'),dict(input_format='NCTHW', type='FormatShape'),dict(type='PackActionInputs'),],test_mode=True,type='VideoDataset'),num_workers=8,persistent_workers=True,sampler=dict(shuffle=False, type='DefaultSampler'))
test_evaluator = dict(type='AccMetric')
test_pipeline = [dict(io_backend='disk', type='DecordInit'),dict(clip_len=32,frame_interval=2,num_clips=4,test_mode=True,type='SampleFrames'),dict(type='DecordDecode'),dict(scale=(-1,224,), type='Resize'),dict(crop_size=224, type='ThreeCrop'),dict(input_format='NCTHW', type='FormatShape'),dict(type='PackActionInputs'),
]
train_cfg = dict(max_epochs=26, type='EpochBasedTrainLoop', val_begin=1, val_interval=1)
train_dataloader = dict(batch_size=4,dataset=dict(ann_file='/hy-tmp/train_split1.txt',data_prefix=dict(video='/hy-tmp/hy-tmp/hmdb51_sta/hmdb51_sta'),pipeline=[dict(io_backend='disk', type='DecordInit'),dict(clip_len=3, frame_interval=2, num_clips=1,type='SampleFrames'),dict(type='DecordDecode'),dict(scale=(-1,256,), type='Resize'),dict(type='RandomResizedCrop'),dict(keep_ratio=False, scale=(224,224,), type='Resize'),dict(flip_ratio=0.5, type='Flip'),dict(input_format='NCTHW', type='FormatShape'),dict(type='PackActionInputs'),],type='VideoDataset'),num_workers=8,persistent_workers=True,sampler=dict(shuffle=True, type='DefaultSampler'))
train_pipeline = [dict(io_backend='disk', type='DecordInit'),dict(clip_len=3, frame_interval=2, num_clips=1, type='SampleFrames'),dict(type='DecordDecode'),dict(scale=(-1,256,), type='Resize'),dict(type='RandomResizedCrop'),dict(keep_ratio=False, scale=(224,224,), type='Resize'),dict(flip_ratio=0.5, type='Flip'),dict(input_format='NCTHW', type='FormatShape'),dict(type='PackActionInputs'),
]
val_cfg = dict(type='ValLoop')
val_dataloader = dict(batch_size=4,dataset=dict(ann_file='/hy-tmp/val_split1.txt',data_prefix=dict(video='/hy-tmp/hy-tmp/hmdb51_sta/hmdb51_sta'),pipeline=[dict(io_backend='disk', type='DecordInit'),dict(clip_len=3,frame_interval=2,num_clips=1,test_mode=True,type='SampleFrames'),dict(type='DecordDecode'),dict(scale=(-1,256,), type='Resize'),dict(crop_size=224, type='CenterCrop'),dict(input_format='NCTHW', type='FormatShape'),dict(type='PackActionInputs'),],test_mode=True,type='VideoDataset'),num_workers=8,persistent_workers=True,sampler=dict(shuffle=False, type='DefaultSampler'))
val_evaluator = dict(type='AccMetric')
val_pipeline = [dict(io_backend='disk', type='DecordInit'),dict(clip_len=3,frame_interval=2,num_clips=1,test_mode=True,type='SampleFrames'),dict(type='DecordDecode'),dict(scale=(-1,256,), type='Resize'),dict(crop_size=224, type='CenterCrop'),dict(input_format='NCTHW', type='FormatShape'),dict(type='PackActionInputs'),
]
vis_backends = [dict(type='LocalVisBackend'),
]
visualizer = dict(type='ActionVisualizer', vis_backends=[dict(type='LocalVisBackend'),])
work_dir = './work_dirs/my_swin-tiny-p244-w877_no-pre_8xb8-amp-32x2x1-30e_hmdb51-rgb'

相关文章:

  • Ubuntu搭建TFTP服务器的方法
  • 优先级队列(堆)
  • JMeter 教程:使用 HTTP 请求的参数列表发送 POST 请求(form 表单格式)
  • 嵌入式硬件篇---拓展板
  • 简单使用Slidev和PPTist
  • 柔性PZT压电薄膜在线监测锂电池内部缺陷-应对薄膜电池安全挑战
  • Go 语言即时通讯系统开发日志-日志day2-5:架构设计与日志封装
  • 关于文件分片的介绍和应用
  • CSS- 4.3 绝对定位(position: absolute)学校官网导航栏实例
  • 【上位机——WPF】布局控件
  • Adapter适配器模式
  • 利用systemd启动部署在服务器上的web应用
  • Zookeeper入门(三)
  • node 后端和浏览器前端,有关 RSA 非对称加密的完整实践, 前后端匹配的代码演示
  • 从零开始实现大语言模型(十六):加载开源大语言模型参数
  • Flink 并行度的设置
  • 给个人程序加上MCP翅膀
  • 基于labview的声音采集、存储、处理
  • GitHub 趋势日报 (2025年05月17日)
  • C++(243~263)STL常用算法、遍历算法(for_each,Transform)、查找算法、拷贝和替换、常用算术生成,常用集合算法。
  • 益阳通报“河水颜色异常有死鱼”:未发现排污,原因待鉴定
  • 南宁海关辟谣网传“查获600公斤稀土材料”:实为焊锡膏
  • 第十届青春文学奖揭晓,梁晓声获特别奖
  • 陕西:未来一周高温持续,继续发布冬小麦干热风风险预警
  • 幼儿园教师拍打孩子额头,新疆库尔勒教育局:涉事教师已被辞退
  • 四川内江警方通报一起持刀伤人致死案:因车辆停放引起,嫌犯被抓获