Skip to content
Snippets Groups Projects
  1. Aug 17, 2021
  2. Aug 13, 2021
    • sennnnn's avatar
      [Feature] Add segformer decode head and related train config (#599) · b4fd32d0
      sennnnn authored
      * [Feature]Segformer re-implementation
      
      * Using act_cfg and norm_cfg to control activation and normalization
      
      * Split this PR into several little PRs
      
      * Fix lint error
      
      * Remove SegFormerHead
      
      * [Feature] Add segformer decode head and related train config
      
      * Add ade20K trainval support for segformer
      
      1. Add related train and val configs;
      
      2. Add AlignedResize;
      
      * Set arg: find_unused_parameters = True
      
      * parameters init refactor
      
      * 1. Refactor segformer backbone parameters init;
      
      2. Remove rebundant functions and unit tests;
      
      * Remove rebundant codes
      
      * Replace Linear Layer to 1X1 Conv
      
      * Use nn.ModuleList to refactor segformer head.
      
      * Remove local to_xtuple
      
      * 1. Remove rebundant codes;
      
      2. Modify module name;
      
      * Refactor the backbone of segformer using mmcv.cnn.bricks.transformer.py
      
      * Fix some code logic bugs.
      
      * Add mit_convert.py to match pretrain keys of segformer.
      
      * Resolve some comments.
      
      * 1. Add some assert to ensure right params;
      
      2. Support flexible peconv position;
      
      * Add pe_index assert and fix unit test.
      
      * 1. Add doc string for MixVisionTransformer;
      
      2. Add some unit tests for MixVisionTransformer;
      
      * Use hw_shape to pass shape of feature map.
      
      * 1. Fix doc string of MixVisionTransformer;
      
      2. Simplify MixFFN;
      
      3. Modify H, W to hw_shape;
      
      * Add more unit tests.
      
      * Add doc string for shape convertion functions.
      
      * Add some unit tests to improve code coverage.
      
      * Fix Segformer backbone pretrain weights match bug.
      
      * Modify configs of segformer.
      
      * resolve the shape convertion functions doc string.
      
      * Add pad_to_patch_size arg.
      
      * Support progressive test with fewer memory cost.
      
      * Modify default value of pad_to_patch_size arg.
      
      * Temp code
      
      * Using processor to refactor evaluation workflow.
      
      * refactor eval hook.
      
      * Fix process bar.
      
      * Fix middle save argument.
      
      * Modify some variable name of dataset evaluate api.
      
      * Modify some viriable name of eval hook.
      
      * Fix some priority bugs of eval hook.
      
      * Fix some bugs about model loading and eval hook.
      
      * Add ade20k 640x640 dataset.
      
      * Fix related segformer configs.
      
      * Depreciated efficient_test.
      
      * Fix training progress blocked by eval hook.
      
      * Depreciated old test api.
      
      * Modify error patch size.
      
      * Fix pretrain of mit_b0
      
      * Fix the test api error.
      
      * Modify dataset base config.
      
      * Fix test api error.
      
      * Modify outer api.
      
      * Build a sampler test api.
      
      * TODO: Refactor format_results.
      
      * Modify variable names.
      
      * Fix num_classes bug.
      
      * Fix sampler index bug.
      
      * Fix grammaly bug.
      
      * Add part of benchmark results.
      
      * Support batch sampler.
      
      * More readable test api.
      
      * Remove some command arg and fix eval hook bug.
      
      * Support format-only arg.
      
      * Modify format_results of datasets.
      
      * Modify tool which use test apis.
      
      * Update readme.
      
      * Update readme of segformer.
      
      * Updata readme of segformer.
      
      * Update segformer readme and fix segformer mit_b4.
      
      * Update readme of segformer.
      
      * Clean AlignedResize related config.
      
      * Clean code from pr #709
      
      * Clean code from pr #709
      
      * Add 512x512 segformer_mit-b5.
      
      * Fix lint.
      
      * Fix some segformer head bugs.
      
      * Add segformer unit tests.
      
      * Replace AlignedResize to ResizeToMultiple.
      
      * Modify readme of segformer.
      
      * Fix bug of ResizeToMultiple.
      
      * Add ResizeToMultiple unit tests.
      
      * Resolve conflict.
      
      * Simplify the implementation of ResizeToMultiple.
      
      * Update test results.
      
      * Fix multi-scale test error when resize_ratio=1.75 and input size=640x640.
      
      * Update segformer results.
      
      * Update Segformer results.
      
      * Fix some url bugs and pipelines bug.
      
      * Move ckpt convertion to tools.
      
      * Add segformer official pretrain weights usage.
      
      * Clean redundant codes.
      
      * Remove redundant codes.
      
      * Unfied format.
      
      * Add description for segformer converter.
      
      * Update workers.
      b4fd32d0
  3. Aug 12, 2021
  4. Aug 03, 2021
  5. Jul 28, 2021
  6. Jul 19, 2021
    • sennnnn's avatar
      [Feature] Segformer backbone re-implementation (#594) · 095ed243
      sennnnn authored
      * [Feature]Segformer re-implementation
      
      * Using act_cfg and norm_cfg to control activation and normalization
      
      * Split this PR into several little PRs
      
      * Fix lint error
      
      * Remove SegFormerHead
      
      * parameters init refactor
      
      * 1. Refactor segformer backbone parameters init;
      
      2. Remove rebundant functions and unit tests;
      
      * Remove rebundant codes
      
      * 1. Remove rebundant codes;
      
      2. Modify module name;
      
      * Refactor the backbone of segformer using mmcv.cnn.bricks.transformer.py
      
      * Fix some code logic bugs.
      
      * Add mit_convert.py to match pretrain keys of segformer.
      
      * Resolve some comments.
      
      * 1. Add some assert to ensure right params;
      
      2. Support flexible peconv position;
      
      * Add pe_index assert and fix unit test.
      
      * 1. Add doc string for MixVisionTransformer;
      
      2. Add some unit tests for MixVisionTransformer;
      
      * Use hw_shape to pass shape of feature map.
      
      * 1. Fix doc string of MixVisionTransformer;
      
      2. Simplify MixFFN;
      
      3. Modify H, W to hw_shape;
      
      * Add more unit tests.
      
      * Add doc string for shape convertion functions.
      
      * Add some unit tests to improve code coverage.
      
      * Fix Segformer backbone pretrain weights match bug.
      
      * resolve the shape convertion functions doc string.
      
      * Add pad_to_patch_size arg.
      
      * Modify default value of pad_to_patch_size arg.
      095ed243
    • 谢昕辰's avatar
      [Fix] fix patch_embed and pos_embed mismatch error (#685) · f6246d6e
      谢昕辰 authored
      
      * fix patch_embed and pos_embed mismatch error
      
      * add docstring
      
      * update unittest
      
      * use downsampled image shape
      
      * use tuple
      
      * remove unused parameters and add doc
      
      * fix init weights function
      
      * revise docstring
      
      * Update vit.py
      
      If -> Whether
      
      * fix lint
      
      Co-authored-by: default avatarJunjun2016 <hejunjun@sjtu.edu.cn>
      f6246d6e
  7. Jul 01, 2021
    • Ze Liu's avatar
      [WIP] Add Swin Transformer (#511) · 214d083c
      Ze Liu authored
      
      * add Swin Transformer
      
      * add Swin Transformer
      
      * fixed import
      
      * Add some swin training settings.
      
      * Fix some filename error.
      
      * Fix attribute name: pretrain -> pretrained
      
      * Upload mmcls implementation of swin transformer.
      
      * Refactor Swin Transformer to follow mmcls style.
      
      * Refactor init_weigths of swin_transformer.py
      
      * Fix lint
      
      * Match inference precision
      
      * Add some comments
      
      * Add swin_convert to load official style ckpt
      
      * Remove arg: auto_pad
      
      * 1. Complete comments for each block;
      
      2. Correct weight convert function;
      
      3. Fix the pad of Patch Merging;
      
      * Clean function args.
      
      * Fix vit unit test.
      
      * 1. Add swin transformer unit tests;
      
      2. Fix some pad bug;
      
      3. Modify config to adapt new swin implementation;
      
      * Modify config arg
      
      * Update readme.md of swin
      
      * Fix config arg error and Add some swin benchmark msg.
      
      * Add MeM and ms test content for readme.md of swin transformer.
      
      * Fix doc string of swin module
      
      * 1. Register swin transformer to model list;
      
      2. Modify pth url which keep meta attribute;
      
      * Update swin.py
      
      * Merge config settings.
      
      * Modify config style.
      
      * Update README.md
      
      Add ViT link
      
      * Modify main readme.md
      
      Co-authored-by: default avatarJiarui XU <xvjiarui0826@gmail.com>
      Co-authored-by: default avatarsennnnn <201730271412@mail.scut.edu.cn>
      Co-authored-by: default avatarJunjun2016 <hejunjun@sjtu.edu.cn>
      214d083c
    • 谢昕辰's avatar
      add configs for vit backbone plus decode_heads (#520) · 5245edb0
      谢昕辰 authored
      * add config
      
      * add cityscapes config
      
      * add default value to docstring
      
      * fix lint
      
      * add deit-s and deit-b
      
      * add readme
      
      * add eps at norm_cfg
      
      * add drop_path_rate experiment
      
      * add deit case at init_weight
      
      * add upernet result
      
      * update result and add upernet 160k config
      
      * update upernet result and fix settings
      
      * Update iters number
      
      * update result and delete some configs
      
      * fix import error
      
      * fix drop_path_rate
      
      * update result and restore config
      
      * update benchmark result
      
      * remove cityscapes exp
      
      * remove neck
      
      * neck exp
      
      * add more configs
      
      * fix init error
      
      * fix ffn setting
      
      * update result
      
      * update results
      
      * update result
      
      * update results and fill table
      
      * delete or rename configs
      
      * fix link delimiter
      
      * rename configs and fix link
      
      * rename neck to mln
      5245edb0
  8. Jun 24, 2021
  9. Jun 23, 2021
    • Sixiao Zheng's avatar
      [Feature] Official implementation of SETR (#531) · 5876868a
      Sixiao Zheng authored
      
      * Adjust vision transformer backbone architectures;
      
      * Add DropPath, trunc_normal_ for VisionTransformer implementation;
      
      * Add class token buring intermediate period and remove it during final period;
      
      * Fix some parameters loss bug;
      
      * * Store intermediate token features and impose no processes on them;
      
      * Remove class token and reshape entire token feature from NLC to NCHW;
      
      * Fix some doc error
      
      * Add a arg for VisionTransformer backbone to control if input class token into transformer;
      
      * Add stochastic depth decay rule for DropPath;
      
      * * Fix output bug when input_cls_token=False;
      
      * Add related unit test;
      
      * Re-implement of SETR
      
      * Add two head -- SETRUPHead (Naive, PUP) & SETRMLAHead (MLA);
      
      * * Modify some docs of heads of SETR;
      
      * Add MLA auxiliary head of SETR;
      
      * * Modify some arg of setr heads;
      
      * Add unit test for setr heads;
      
      * * Add 768x768 cityscapes dataset config;
      
      * Add Backbone: SETR -- Backbone: MLA, PUP, Naive;
      
      * Add SETR cityscapes training & testing config;
      
      * * Fix the low code coverage of unit test about heads of setr;
      
      * Remove some rebundant error capture;
      
      * * Add pascal context dataset & ade20k dataset config;
      
      * Modify auxiliary head relative config;
      
      * Modify folder structure.
      
      * add setr
      
      * modify vit
      
      * Fix the test_cfg arg position;
      
      * Fix some learning schedule bug;
      
      * optimize setr code
      
      * Add arg: final_reshape to control if converting output feature information from NLC to NCHW;
      
      * Fix the default value of final_reshape;
      
      * Modify arg: final_reshape to arg: out_shape;
      
      * Fix some unit test bug;
      
      * Add MLA neck;
      
      * Modify setr configs to add MLA neck;
      
      * Modify MLA decode head to remove rebundant structure;
      
      * Remove some rebundant files.
      
      * * Fix the code style bug;
      
      * Remove some rebundant files;
      
      * Modify some unit tests of SETR;
      
      * Ignoring CityscapesCoarseDataset and MapillaryDataset.
      
      * Fix the activation function loss bug;
      
      * Fix the img_size bug of SETR_PUP_ADE20K
      
      * * Fix the lint bug of transformers.py;
      
      * Add mla neck unit test;
      
      * Convert vit of setr out shape from NLC to NCHW.
      
      * * Modify Resize action of data pipeline;
      
      * Fix deit related bug;
      
      * Set find_unused_parameters=False for pascal context dataset;
      
      * Remove arg: find_unused_parameters which is False by default.
      
      * Error auxiliary head of PUP deit
      
      * Remove the minimal restrict of slide inference.
      
      * Modify doc string of Resize
      
      * Seperate this part of code to a new PR #544
      
      * * Remove some rebundant codes;
      
      * Modify unit tests of SETR heads;
      
      * Fix the tuple in_channels of mla_deit.
      
      * Modify code style
      
      * Move detailed definition of auxiliary head into model config dict;
      
      * Add some setr config for default cityscapes.py;
      
      * Fix the doc string of SETR head;
      
      * Modify implementation of SETR Heads
      
      * Remove setr aux head and use fcn head to replace it;
      
      * Remove arg: img_size and remove last interpolate op of heads;
      
      * Rename arg: conv3x3_conv1x1 to kernel_size of SETRUPHead;
      
      * non-square input support for setr heads
      
      * Modify config argument for above commits
      
      * Remove norm_layer argument of SETRMLAHead
      
      * Add mla_align_corners for MLAModule interpolate
      
      * [Refactor]Refactor of SETRMLAHead
      
      * Modify Head implementation;
      
      * Modify Head unit test;
      
      * Modify related config file;
      
      * [Refactor]MLA Neck
      
      * Fix config bug
      
      * [Refactor]SETR Naive Head and SETR PUP Head
      
      * [Fix]Fix the lack of arg: act_cfg and arg: norm_cfg
      
      * Fix config error
      
      * Refactor of SETR MLA, Naive, PUP heads.
      
      * Modify some attribute name of SETR Heads.
      
      * Modify setr configs to adapt new vit code.
      
      * Fix trunc_normal_ bug
      
      * Parameters init adjustment.
      
      * Remove redundant doc string of SETRUPHead
      
      * Fix pretrained bug
      
      * [Fix] Fix vit init bug
      
      * Add some vit unit tests
      
      * Modify module import
      
      * Remove norm from PatchEmbed
      
      * Fix pretrain weights bug
      
      * Modify pretrained judge
      
      * Fix some gradient backward bugs.
      
      * Add some unit tests to improve code cov
      
      * Fix init_weights of setr up head
      
      * Add DropPath in FFN
      
      * Finish benchmark of SETR
      
      1. Add benchmark information into README.MD of SETR;
      
      2. Fix some name bugs of vit;
      
      * Remove DropPath implementation and use DropPath from mmcv.
      
      * Modify out_indices arg
      
      * Fix out_indices bug.
      
      * Remove cityscapes base dataset config.
      
      Co-authored-by: default avatarsennnnn <201730271412@mail.scut.edu.cn>
      Co-authored-by: default avatarCuttlefishXuan <zhaoxinxuan1997@gmail.com>
      5876868a
  10. Jun 19, 2021
    • sennnnn's avatar
      [Fix] Fix some vit init bugs (#609) · 2a9bf2d2
      sennnnn authored
      * [Fix] Fix vit init bug
      
      * Add some vit unit tests
      
      * Modify module import
      
      * Fix pretrain weights bug
      
      * Modify pretrained judge
      
      * Add some unit tests to improve code cov
      
      * Optimize code
      
      * Fix vit unit test
      2a9bf2d2
  11. Jun 17, 2021
    • sennnnn's avatar
      [Refactor] Using mmcv transformer bricks to refactor vit. (#571) · c01abb4f
      sennnnn authored
      * [Refactor] Using mmcv bricks to refactor vit
      
      * Follow the vit code structure from mmclassification
      
      * Add MMCV install into CI system.
      
      * Add  to 'Install MMCV' CI item
      
      * Add 'Install MMCV_CPU' and 'Install MMCV_GPU CI' items
      
      * Fix & Add
      
      1. Fix low code coverage of vit.py;
      
      2. Remove HybirdEmbed;
      
      3. Fix doc string of VisionTransformer;
      
      * Add helpers unit test.
      
      * Add converter to convert vit pretrain weights from timm style to mmcls style.
      
      * Clean some rebundant code and refactor init
      
      1. Use timm style init_weights;
      
      2. Remove to_xtuple and trunc_norm_;
      
      * Add comments for VisionTransformer.init_weights()
      
      * Add arg: pretrain_style to choose timm or mmcls vit pretrain weights.
      c01abb4f
    • Jerry Jiarui XU's avatar
      [Refactor]: Unified parameter initialization (#567) · 9849a8dc
      Jerry Jiarui XU authored
      * [Refactor]: Unified parameter initialization
      
      * fixed pretrained
      9849a8dc
  12. May 06, 2021
    • sennnnn's avatar
      Add option for output shape of ViT (#530) · 5c195db1
      sennnnn authored
      * Add arg: final_reshape to control if converting output feature information from NLC to NCHW;
      
      * Fix the default value of final_reshape;
      
      * Modify arg: final_reshape to arg: out_shape;
      
      * Fix some unit test bug;
      5c195db1
  13. May 05, 2021
  14. Apr 30, 2021
    • sshuair's avatar
      add metric mFscore (#509) · 7fbdd6f1
      sshuair authored
      * add mFscore and refactor the metrics return value
      
      * fix linting
      
      * some docstring and name fix
      7fbdd6f1
    • sennnnn's avatar
      Adjust vision transformer backbone architectures (#524) · c27ef919
      sennnnn authored
      
      * Adjust vision transformer backbone architectures;
      
      * Add DropPath, trunc_normal_ for VisionTransformer implementation;
      
      * Add class token buring intermediate period and remove it during final period;
      
      * Fix some parameters loss bug;
      
      * * Store intermediate token features and impose no processes on them;
      
      * Remove class token and reshape entire token feature from NLC to NCHW;
      
      * Fix some doc error
      
      * Add a arg for VisionTransformer backbone to control if input class token into transformer;
      
      * Add stochastic depth decay rule for DropPath;
      
      * * Fix output bug when input_cls_token=False;
      
      * Add related unit test;
      
      * * Add arg: out_indices to control model output;
      
      * Add unit test for DropPath;
      
      * Apply suggestions from code review
      
      Co-authored-by: default avatarJerry Jiarui XU <xvjiarui0826@gmail.com>
      c27ef919
  15. Apr 29, 2021
  16. Apr 25, 2021
  17. Apr 22, 2021
    • 谢昕辰's avatar
      support transformer backbone (#465) · b379b5a5
      谢昕辰 authored
      
      * vit backbone
      
      * fix lint
      
      * add docstrings and fix pretrained pos_embed dim not match prob
      
      * add unittest for vit
      
      * fix lint
      
      * add vit based fcn configs
      
      * fix import error
      
      * support multiple resolution input images
      
      * upsample pos_embed at init_weights
      
      * support resize pos_embed at evaluation
      
      * fix training errors
      
      * add more unitest code for vit backbone
      
      * unitest for uncovered code
      
      * add norm_eval unittest
      
      * refactor _pos_embeding
      
      * minor change
      
      * change var name
      
      * rafactor init_weight
      
      * load weights after resize
      
      * ignore 'module' in pretrain checkpoint
      
      * add with_cp
      
      * add with_cp
      
      Co-authored-by: default avatarJiarui XU <xvjiarui0826@gmail.com>
      b379b5a5
  18. Apr 14, 2021
  19. Mar 31, 2021
  20. Mar 30, 2021
  21. Mar 29, 2021
    • 谢昕辰's avatar
      [Bug fixed]Fix dice_loss errors (#417) · 71be1c27
      谢昕辰 authored
      * fix training bugs
      
      * fix unitest error
      
      * fix error in num_classes==2 case
      
      * delete comments
      71be1c27
    • 谢昕辰's avatar
      pytorch metrics implementation (#430) · d474cfde
      谢昕辰 authored
      * pytorch metrics impl and test
      
      * support list[str] input, delete unused test code and delete numpy version
      
      * modify input data type
      
      * add docstring and unitest of filename inputs
      
      * add indents in docstring and use tempfile lib to create dir
      
      * using with statement
      d474cfde
  22. Mar 11, 2021
    • 谢昕辰's avatar
      dice loss (#396) · a3b523e3
      谢昕辰 authored
      * dice loss
      
      * format code, add docstring and calculate denominator without valid_mask
      
      * minor change
      
      * restore
      a3b523e3
  23. Jan 20, 2021
  24. Jan 19, 2021
  25. Jan 07, 2021
  26. Dec 26, 2020
    • yamengxi's avatar
      [New model] Support MobileNetV3 (#268) · 25d8d77f
      yamengxi authored
      * delete markdownlint
      
      * Support MobileNetV3
      
      * fix import
      
      * add mobilenetv3 head and configs
      
      * Modify MobileNetV3 to semantic segmentation version
      
      * modify mobilenetv3 configs
      
      * add std configs
      
      * fix Conv2dAdaptivePadding bug
      
      * add configs
      
      * add unitest and fix bugs
      
      * fix lraspp unitest bugs
      
      * restore
      
      * fix unitest
      
      * add MobileNetV3 docstring
      
      * add mmcv
      
      * add mmcv
      
      * fix syntax bug
      
      * fix unitest bug
      
      * fix unitest bug
      
      * fix unitest bugs
      
      * fix docstring
      
      * add configs
      
      * restore
      
      * delete unnecessary assert
      
      * modify unitest
      
      * delete benchmark
      25d8d77f
  27. Dec 24, 2020
  28. Dec 18, 2020
    • Junjun2016's avatar
      Support APCNet (#299) · a5d15ae2
      Junjun2016 authored
      * Support APCNet
      
      * code optimization
      
      * add apcnet configs
      
      * add benchmark
      
      * add readme and model zoo
      
      * fix doc
      a5d15ae2
  29. Dec 15, 2020
  30. Dec 14, 2020
  31. Dec 02, 2020
    • yamengxi's avatar
      [Feature]add CLAHE transform (#229) · 26f4bed2
      yamengxi authored
      * add CLAHE transform
      
      * fix syntax error
      
      * fix syntax error
      
      * restore
      
      * add a test
      
      * modify cv2 to mmcv
      
      * add docstring
      
      * modify
      
      * restore
      
      * fix mmcv.clahe error
      
      * change mmcv version to 1.3.0
      
      * fix bugs
      
      * add all data transformers to __init__
      
      * fix __init__
      
      * fix test_transform
      26f4bed2
    • yamengxi's avatar
      [Feature] add AdjustGamma transform (#232) · e8d643fe
      yamengxi authored
      * add AdjustGamma transform
      
      * restore
      
      * change cv2 to mmcv
      
      * simplify AdjustGamma
      
      * fix syntax error
      
      * modify
      
      * fix syntax error
      
      * change mmcv version to 1.3.0
      
      * fix lut function name error
      
      * fix syntax error
      
      * fix range
      e8d643fe
  32. Nov 24, 2020
    • Junjun2016's avatar
      add dice evaluation metric (#225) · 1530af65
      Junjun2016 authored
      * add dice evaluation metric
      
      * add dice evaluation metric
      
      * add dice evaluation metric
      
      * support 2 metrics
      
      * support 2 metrics
      
      * support 2 metrics
      
      * support 2 metrics
      
      * fix docstring
      
      * use np.round once for all
      1530af65
  33. Nov 17, 2020
Loading