Rewrite ATen native docs. (#4816) * Rewrite ATen native docs. Signed-off-by: Edward Z. Yang <ezyang@fb.com> * Formatting fix Signed-off-by: Edward Z. Yang <ezyang@fb.com> * Some of the CR comments * More CR comments [ci skip] * One last CR comment

commit: ce5ccaef0c854d0c824447b93665d56a3f9f3a19 [log] [tgz]
author: Edward Z. Yang <ezyang@mit.edu> Thu Feb 01 16:56:14 2018 -0500
committer: Soumith Chintala <soumith@gmail.com> Thu Feb 01 16:56:14 2018 -0500
tree: 8b3c4238d196830ea8b7a3906238375b787eebef
parent: eb5daa94780cbb2019a8cc175172e55737d718e4 [diff]
diff --git a/aten/src/ATen/native/README.md b/aten/src/ATen/native/README.md
new file mode 100644
index 0000000..02fe7fa
--- /dev/null
+++ b/aten/src/ATen/native/README.md

@@ -0,0 +1,258 @@
+ATen "native" functions are the modern mechanism for adding operators and
+functions to ATen (they are "native" in contrast to legacy functions, which are bound
+via TH/THC cwrap metadata).  Native functions
+are declared in `native_functions.yaml` and have implementations defined
+in one of the `cpp` files in this directory.
+
+Like all ATen methods/functions, native functions are made available
+from both ATen's C++ and Python APIs.  In C++, they are made available
+either as methods on `Tensor` (`t.mymeth()`) and functions in the ATen
+namespace (`at::myfunc()`).  In PyTorch, they are made available as
+methods on `Variable` or as functions on `torch._C._FunctionBase`
+(it is the user's responsibility to re-exporting these functions in
+a more user-facing module.)  At the moment, only
+functions which ingest `Variable` are made available; to use a function
+with non-differentiable tensors, wrap your tensors with `Variable` before
+passing them in.
+
+The rest of this document describes how to implement an ATen function.
+
+## Registering a function in `native_functions.yaml`
+
+Every native function must have an entry in
+`native_functions.yaml`.  The format can be summarized as:
+
+```
+- func: func_name(ArgType arg0[=default], ArgType arg1[=default], ...) -> ReturnType
+  variants: function, method
+  dispatch:
+    CPU: func_cpu
+    CUDA: func_cuda
+```
+
+Each component is described in more detail below:
+
+### `func`
+
+```
+- func: func_name(ArgType arg0[=default], ArgType arg1[=default], ...) -> ReturnType
+```
+
+The `func` entry is a string describing the name of the function and its type
+signature.
+
+**Argument types.** These types are permissible as ArgType:
+
+- `Tensor`.  A `Tensor` argument translates into a C++ argument of type `const Tensor&`.
+  A trailing `?`, as in `Tensor?`, indicates that the tensor argument is optional
+  and may be omitted by passing an undefined tensor.  When a function takes multiple
+  `Tensor` arguments, these tensors are assumed to be the same type (e.g.,
+  if one argument is a `FloatTensor`, all other arguments are checked
+  to be `FloatTensor`s.)
+- Tensors of specific types.  At the moment, valid type names are:
+    - `IntegerTensor` (a.k.a. `LongTensor`)
+    - `BoolTensor` (a.k.a. `ByteTensor`)
+    - `IndexTensor` (a.k.a. `IntTensor`)
+  These type names were inherited from TH, and may be renamed soon, so
+  don't commit them to memory.
+- `TensorList`.  A `TensorList` argument translates into a C++ argument of type `ArrayRef<Tensor>`
+  (a.k.a. `TensorList`)
+- `IntList`.  `IntList` accepts an optional length specifier, e.g., `IntList[2]`, which
+  has no effect in C++ but extends our Python bindings to accept a bare number, which will be
+  expanded into an appropriately sized list by repeating the number.
+- `int64_t`. There is no `int`; ATen policy is to use `int64_t` in the API anywhere you would
+  have ordinarily passed an `int` or `size_t`.
+- `double`. There is no `float`; ATen policy is to use `double` anywhere you would have used `float`.
+- `bool`
+- `Generator*`, the state for a random number generator,
+- `std::array<bool,N>` (where N is `1-4`).  NB: you MUST NOT put a space after the comma, otherwise
+  this argument will not parse correctly.  (If you decide to fix this, make sure you fix the
+  argument parser both in ATen and in PyTorch.)
+
+**Return types.** These types are permissible as ReturnType:
+
+- `Tensor` and `TensorList`, which translate into the C++ types `Tensor` and `std::vector<Tensor>`,
+  respectively.
+- A tuple of any number of `Tensor`, e.g., `(Tensor, Tensor)`, translating into
+  the C++ `std::tuple<Tensor, Tensor>`.
+
+If you need a type that is not listed in this list, it may be possible to extend ATen's
+code generation to support it.  ATen's philosophy on types to support is that it supports
+only simple, universal types, as well as a handful of fundamental Tensor structures
+(e.g., `Tensor` and `Generator*`), because these types can be easily ported to any language
+bound to ATen (in practice, C++ and Python.)
+
+**Argument names.** Argument names are meaningful; downstream binding code may make use of the specific
+argument name you provide, and a rename of an argument name is considered a BC-breaking
+change (e.g., you will probably need to update `tools/autograd/derivatives.yaml` at
+least).
+
+TODO: Do argument names affect Python keyword arguments?
+
+**Defaults.** Any suffix of arguments can have a default value defined;
+these default values translate into C++/Python default values which
+are applied when those positional arguments are not specified.
+
+Here are the supported default values:
+
+* Numbers (e.g., `0` or `5.0` for `int64_t`, `double` and `IntList`
+  with an explicit length (e.g., `IntList[2]`)--in the case of IntList,
+  a number is replicated to fill the length (e.g., `IntList[2] x=2`
+  is equivalent to `IntList[2] x={2,2}`.
+* Lists of numbers (e.g., `{0, 0}`) for `IntList`.
+* Booleans (e.g., `true`) for `bool`.
+* Empty initializer lists (e.g., `{}`) for `Tensor` (this implicitly changes
+  a `Tensor` argument to accept undefined tensors).
+* `nullptr` for pointer types (e.g., `Generator*`)
+
+The declarations also support the following attributes:
+
+### `variants`
+
+```
+variants: function, method
+```
+
+Controls whether Tensor method (`t.foo()`) or namespace Function (`at::foo()`) is
+generated as a result of this declaration.  If the declaration is a method,
+you must have an argument `Tensor self` at some position in the method;
+in the method variant this argument will be elided from the argument
+list.  For example, given the declaration `where(BoolTensor cond, Tensor self, Tensor other)`,
+this generates the function `at::where(cond, self, other)` and the method
+`self.where(cond, other)`.
+
+By default, ATen generates both function and method variants for a native function.
+Generally, the function variant is always useful; however, you may not wish
+to generate a method variant. Tensor operations as methods are appropriate for "core"
+Tensor operations (e.g., add, sub, etc.), but not for more complicated neural network
+layers (e.g., `conv2d`) and internal functions designed specifically for binding
+(e.g., `cudnn_convolution`).
+
+### `dispatch`
+
+```
+dispatch:
+    CPU: func_cpu
+    CUDA: func_cuda
+```
+
+This specifies the actual name of the function you want to dispatch to, so you
+can dispatch to different functions depending on whether or not you have CPU or
+CUDA tensors.  Technically, it is also possible to write `dispatch: func_name`
+to unconditionally dispatch to a native function whose name is different than
+the name in the public ATen API, but this is generally frowned upon (just name
+them the same thing!)
+
+### `python_default_init`
+
+```
+python_default_init:
+  argument_name: initializing_expression
+```
+
+A map from argument names to default initializing expressions written in C++. Such default
+expressions will only be used in Python API (in the C++ API, these arguments are
+mandatory).
+
+There are a few situations where you might like to use this functionality:
+
+- You want a default value which is fine in Python but would cause ambiguity in C++.
+  TODO: Explain this in more detail.
+
+- You want a value to default to the same value as another argument (this cannot
+  be expressed in C++ default arguments).
+
+If you grep for `python_default_init`, you can find examples of this being used;
+in general, most functions will not need to use this.
+
+## Writing an implementation in C++
+
+Implementations of native functions go in an appropriate C++ file in the
+`native/` directory (they are organized roughly by topic, but there is no
+semantic meaning to their organization aside for the `cuda` directory,
+which is the only place the build system knows how to build `cu` files.)
+To write a native function, you only need to write a C++
+implementation (no header necessary) with a matching signature to
+the generated header from the ATen metadata.  There are many
+simple native functions; take a look at some of them to see what to do.
+
+Although, for the most part, writing an ATen function is mostly writing
+the algorithm you want to implement, there are some less obvious details
+you should also consider.
+
+### Will your function be automatically differentiable?
+
+If you are writing a pair of functions `foo` and `foo_backward`, with
+the intent that `foo_backward` implements the derivative of `foo`, then
+your implementation of `foo` is probably not automatically differentiable:
+it might make use of functions like `data_ptr()` or it dispatches differently
+depending on if it's operating on CPU or CUDA tensors.  Once you write these two functions,
+you will have to write an entry correlating them together in
+`tools/autograd/derivatives.yaml`.
+
+However, in some situations, you can write a function in ATen and it
+will be automatically differentiated!  This can be the case if the function implementation
+only calls other operations which are themselves differentiable.  In this
+case, you don't have to write an entry in `tools/autograd/derivatives.yaml`.
+
+### Can it handle being passed Variables?
+
+The biggest subtlety of writing an ATen implementation is the fact that
+`Tensor` is not a "final" class: your implementation may be passed objects
+which inherit from `Tensor` (in particular, the `Variable` subclass
+implements automatic differentiation in PyTorch.)  This has some
+direct consequences on valid implementations:
+
+* Never create a `Tensor` directly (e.g., `at::CPU` or `at::CUDA`), as a
+  caller will be expecting to get `Variable`s out if it passes `Variable`.
+  Instead, create tensors from the `type()` of one of the input tensors, e.g.,
+  `input.type().tensor()`  or `input.type().toScalarType(kByte)` if you need
+  a different scalar type.
+
+* If you need to call other ATen functions, be sure to qualify the call
+  with `at::`; don't call them unqualified (in the `at::native` namespace).
+  Using the qualified name ensures that your invocation gets dispatched to
+  the `Variable` (which may be overridden to behave differently than
+  simply dispatch to `at::native`).
+
+These are not hard and fast rules: in particular, if you explicitly define
+a derivative for a function, it will only ever be called with `Tensor`
+arguments.  However, it is considered good style to abide by these rules,
+since code written in this style is more robust.
+
+NB: There is one downside to following the `at::` qualification rule, which
+is that if you know that you will only ever be called with `Tensor`, a
+direct `at::native` call will be more efficient (as it avoids a dynamic
+dispatch).
+
+### Undefined tensor conventions
+
+By default, `Tensor` arguments to ATen functions are always defined, unless
+you explicitly specified that an undefined tensor was permissible by writing
+`Tensor?` or `Tensor x={}`.
+
+The rules for returning undefined Tensors are a bit more subtle, but there
+is only one case you have to remember:
+
+* If the function in question is a backward function which accepts a
+  `std::array<bool,N> output_mask` argument, you MUST return an undefined
+  `Tensor` at every tuple position `i` for which `output_mask[i]` is false, otherwise
+
+* You MUST NOT return an undefined tensor.
+
+The most common situations where you might be tempted to return undefined tensors
+are when:
+
+- You have a forward function that may return a buffer if training is enabled, but does not
+  return the buffer in inference mode.  In this case, just return an appropriately
+  typed zero-size tensor.
+
+- You have a backward function where the gradient for an input is zero.  In this case, you
+  are expected to create a zero-filled tensor of appropriate size to return for this input.
+  To get the shape, it may be helpful to take a `TensorGeometry` of the input to use.
+
+### Debugging tips
+
+If you build ATen and get a linker error, that probably means you copy-pasted
+the C++ definition of your function incorrectly.  Double check your `Tensor`
+arguments, and make sure you wrote `const Tensor&` in your signature.

diff --git a/aten/src/ATen/native/native_functions.yaml b/aten/src/ATen/native/native_functions.yaml
index bfb1bfb..e40b3ec 100644
--- a/aten/src/ATen/native/native_functions.yaml
+++ b/aten/src/ATen/native/native_functions.yaml

@@ -1,36 +1,4 @@
-# ATen native functions are a mechanism to write ATen methods which only
-# make use of other ATen operations (e.g., it is not necessary to bind into
-# TH/THC code).  These functions are declared in this file and then folded
-# into the ATen code generation process.
-#
-# The simple format is as follows:
-# - func: func_name(ArgType arg0[=default], ArgType arg1[=default], ...) -> ReturnType
-# ArgType(s) are allowed to be simple types understood by ATen
-# (e.g. Tensor, TensorList, IntList, int64_t, double).
-# ReturnType is allowed to be any ArgType or tuple combination of ArgTypes(s), e.g. (Tensor, Tensor)
-# defaults are optional and are only allowed to be numbers (e.g. '0' for int64_t, '5.0' for double)
-#
-# The declarations also support the following attributes:
-# variants: [function, method] by default; controls whether Tensor method or namespace Function is generated
-#           as a result of this declaration.
-# dispatch: equal to the func_name by default; this can be overridden by providing a
-#           backend-specific name to dispatch to, e.g.:
-#           CPU: func_cpu
-#           CUDA: func_cuda
-# python_default_init: a map from argument names to default initialize expressions in C++. Such default
-#                      expressions will only be used in Python API. This allows us to write argument with
-#                      a default value that can either cause ambiguity in c++ (e.g., `Scalar p` argument
-#                      in `norm`) or have a type that doesn't allow default value None/NULL/nullptr (e.g.,
-#                      `int64_t fft_size` argument in stft, which we want to default to value of another
-#                      argument if not provided).
-#
-# In addition to the variants generation, these declarations will also generate a C++ function declaration
-# in NativeFunctions.h; it is up to you to write the corresponding definitions in the correct file under /native
-# (the exact file depends on if you are writing a generic (all backend), or CPU/CUDA specific implementation).
-# Note that these generated C++ function declarations won't match the declaration here because they will
-# undergo the standard ATen C++ transformations, e.g. use of const-ref for non-inplace Tensor arguments.  It is
-# recommended that you copy the generated C++ function declaration to your definition so that the two
-# match.
+# See README.md in this directory for more guidance
 
 - func: adaptive_avg_pool1d(Tensor self, IntList[1] output_size) -> Tensor
   variants: function
commit	ce5ccaef0c854d0c824447b93665d56a3f9f3a19	[log] [tgz]
author	Edward Z. Yang <ezyang@mit.edu>	Thu Feb 01 16:56:14 2018 -0500
committer	Soumith Chintala <soumith@gmail.com>	Thu Feb 01 16:56:14 2018 -0500
tree	8b3c4238d196830ea8b7a3906238375b787eebef
parent	eb5daa94780cbb2019a8cc175172e55737d718e4 [diff]