doyleconan bsnelling commited on
Commit
93fdd4f
·
verified ·
1 Parent(s): d889bb3

Upload data_summary_card.md (#3)

Browse files

- Upload data_summary_card.md (504c103364dfea461488d8e106543e1b39d47d97)


Co-authored-by: Blake S <bsnelling@users.noreply.huggingface.co>

Files changed (1) hide show
  1. data_summary_card.md +157 -0
data_summary_card.md ADDED
@@ -0,0 +1,157 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+
3
+ # Data Summary for microsoft_renderformer-v1.1-swin-large
4
+
5
+
6
+
7
+
8
+
9
+ ## 1. General information
10
+
11
+ **1.0.1 Version of the Summary:** 1.0
12
+
13
+
14
+
15
+ **1.0.2 Last update:** 16-Dec-2025
16
+
17
+
18
+
19
+ ## 1.1 Model Developer Identification
20
+
21
+ **1.1.1 Model Developer name and contact details:** Microsoft Corporation at One Microsoft Way, Redmond, WA 98052. Tel: 425-882-8080
22
+
23
+
24
+
25
+ ## 1.2 Model Identification
26
+
27
+ **1.2.1 Versioned model name(s):** RenderFormer-V1.1-Large
28
+
29
+
30
+
31
+ **1.2.2 Model release date:** 30-May-2025
32
+
33
+
34
+
35
+ ## 1.3 Overall training data size and characteristics
36
+
37
+ ### 1.3.1 Size of dataset and characteristics
38
+
39
+ **1.3.1.A Text training data size:** Not applicable
40
+
41
+
42
+
43
+ **1.3.1.B Text training data content:** Not applicable
44
+
45
+
46
+
47
+ **1.3.1.C Image training data size:** 1 million to 1 billion images
48
+
49
+
50
+
51
+ **1.3.1.D Image training data content:** HDR rendered images of synthetic 3D scenes composed of triangle meshes from Objaverse objects placed in template environments, with varying materials, lighting (1–8 diffuse triangle lights), and camera viewpoints at 256×256 and 512×512 resolutions
52
+
53
+
54
+
55
+ **1.3.1.E Audio training data size:** Not applicable. Audio data is not part of the training data
56
+
57
+
58
+
59
+ **1.3.1.F Audio training data content:** Not applicable
60
+
61
+
62
+
63
+ **1.3.1.G Video training data size:** Not applicable. Video data is not part of the training data
64
+
65
+
66
+
67
+ **1.3.1.H Video training data content:** Not applicable
68
+
69
+
70
+
71
+ **1.3.1.I Other training data size:** Scenes comprised of triangle meshes up to 4,096 triangles per scene during training.
72
+
73
+
74
+
75
+ **1.3.1.J Other training data content:** Triangle mesh geometry and per-triangle reflectance parameters (diffuse albedo, specular albedo, roughness), per-vertex normals, and light source specifications for synthetic scenes derived from Objaverse objects and template walls
76
+
77
+
78
+
79
+ **1.3.2 Latest date of data acquisition/collection for model training:** Dec 2024
80
+
81
+
82
+
83
+ **1.3.3 Is data collection ongoing to update the model with new data collection after deployment?** There is no further training on this released model.
84
+
85
+
86
+
87
+ **1.3.4 Date the training dataset was first used to train the model:**
88
+
89
+ 01-May-2024
90
+
91
+
92
+
93
+ **1.3.5 Rationale or purpose of data selection:** Synthetic scenes built from objects within templated environments were used to cover diverse geometries, materials, lighting setups, and camera placements while remaining controllable for supervised training with reference renders. This supports learning a generalizable neural rendering pipeline for triangle meshes with global illumination
94
+
95
+
96
+
97
+ ## 2. List of data sources
98
+
99
+ ### 2.1 Publicly available datasets
100
+
101
+ **2.1.1 Have you used publicly available datasets to train the model?** Yes
102
+
103
+
104
+
105
+ ## 2.2 Private non-publicly available datasets obtained from third parties
106
+
107
+ ### 2.2.1 Datasets commercially licensed by rights holders or their representatives
108
+
109
+ **2.2.1.A Have you concluded transactional commercial licensing agreement(s) with rights holder(s) or with their representatives?** Not applicable
110
+
111
+
112
+
113
+ ### 2.2.2 Private datasets obtained from other third-parties
114
+
115
+ **2.2.2.A Have you obtained private datasets from third parties that are not licensed as described in Section 2.2.1, such as data obtained from providers of private databases, or data intermediaries?**
116
+
117
+ No.
118
+
119
+
120
+
121
+ ## 2.3 Personal Information
122
+
123
+ **2.3.1 Was personal data used to train the model?**
124
+
125
+ No.
126
+
127
+
128
+
129
+ ## 2.4 Synthetic data
130
+
131
+ **2.4.1 Was any synthetic AI-generated data used to train the model?**
132
+
133
+ No.
134
+
135
+
136
+
137
+ ## 3. Data processing aspects
138
+
139
+ ### 3.1 Respect of reservation of rights from text and data mining exception or limitation
140
+
141
+ **3.1.1 Does this dataset include any data protected by copyright, trademark, or patent?** Microsoft follows all required regulations and laws for processing data protected by copyright, trademark, or patent
142
+
143
+
144
+
145
+ ## 3.2 Other information
146
+
147
+ **3.2.1 Does the dataset include information about consumer groups without revealing individual consumer identities?** Microsoft follows all required regulations and laws for protecting consumer identities
148
+
149
+
150
+
151
+ **3.2.2 Was the dataset cleaned or modified before model training?**
152
+
153
+ No.
154
+
155
+
156
+
157
+