Float8 + DTensor Integration

# Summary 
This issue is used to track progress and updates on the integration with Float8 tensor and DTensors.

### Background
DTensor is the PyTorch native  solution for TP/SP and is designed to work with torch.compile. It uses tensor subclasses and module hooks to extend existing models. 
Documentation: https://github.com/pytorch/pytorch/blob/main/torch/distributed/_tensor/README.md

### What is needed:
* refactor activation/grad casts into hooks: https://github.com/pytorch-labs/float8_experimental/pull/170
* torch.compile support for Float8Linear + DTensor
* support hook reordering
*  allgather/reduce_scatter:Float8Tensor subclass need to implement
*  cast_to_float8(DTensor(fp32/fp16 shard)) -> produce: DTensor(Float8Tensor)

### Related Issues:
 
 
 ### Pull Requests:
- [ ] https://github.com/pytorch-labs/float8_experimental/pull/198
 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Float8 + DTensor Integration #194

Summary

Background

What is needed:

Related Issues:

Pull Requests:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Float8 + DTensor Integration #194

Description

Summary

Background

What is needed:

Related Issues:

Pull Requests:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions