The cloud-native orchestrator for the whole development lifecycle, with integrated lineage and observability.

dagster

I'm saving a csv file to my s3 bucket using S3PickleIOManager. It appears as if the content is not saved as a regular csv utf8 string.

Based on the documentation, the `S3PickleIOManager` is designed to manage the output of operations by pickling Python objects and saving them to S3. Pickling is a way to serialize Python objects, which means that the data is not stored in plain text (like a UTF-8 encoded CSV file) but in a binary format that can be deserialized (unpickled) back into a Python object later.  If you want to save data as a regular CSV file with UTF-8 encoding to S3, you would need to use a different approach. You could create a custom IO manager that writes the data to S3 as a CSV file, or you could use an existing IO manager that supports writing files in the desired format.  The documentation does not provide a direct example of an IO manager for writing CSV files to S3, but you could potentially adapt the existing `S3PickleIOManager` by overriding the `handle_output` method to write CSV data instead of pickled data. Alternatively, you could look for community-contributed IO managers or examples that handle CSV files specifically.  For saving CSV files, you would typically want to use Python's built-in `csv` module or `pandas` DataFrame's `to_csv` method to write the CSV data, and then use the `boto3` library (which is the AWS SDK for Python) to upload the file to S3 with the correct content type and encoding.  Remember that when you save a file to S3, you can also set the `ContentType` metadata to indicate that the file is a CSV with UTF-8 encoding. This can be done by setting the `ContentType` to `text/csv; charset=utf-8` when uploading the file using `boto3`.  If you need further assistance with creating a custom IO manager or adapting an existing one to handle CSV files, you may want to refer to the Dagster community or documentation for more specific guidance or examples.